A Unifying Information-theoretic Perspective on Evaluating Generative Models
Journal:
arXiv
Published Date:
Dec 18, 2024
Abstract
Considering the difficulty of interpreting generative model output, there is
significant current research focused on determining meaningful evaluation
metrics. Several recent approaches utilize "precision" and "recall," borrowed
from the classification domain, to individually quantify the output fidelity
(realism) and output diversity (representation of the real data variation),
respectively. With the increase in metric proposals, there is a need for a
unifying perspective, allowing for easier comparison and clearer explanation of
their benefits and drawbacks. To this end, we unify a class of
kth-nearest-neighbors (kNN)-based metrics under an information-theoretic lens
using approaches from kNN density estimation. Additionally, we propose a
tri-dimensional metric composed of Precision Cross-Entropy (PCE), Recall
Cross-Entropy (RCE), and Recall Entropy (RE), which separately measure fidelity
and two distinct aspects of diversity, inter- and intra-class. Our
domain-agnostic metric, derived from the information-theoretic concepts of
entropy and cross-entropy, can be dissected for both sample- and mode-level
analysis. Our detailed experimental results demonstrate the sensitivity of our
metric components to their respective qualities and reveal undesirable
behaviors of other metrics.