Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms
Journal:
arXiv
Published Date:
Dec 23, 2024
Abstract
The availability of multiple training algorithms and architectures for
generative models requires a selection mechanism to form a single model over a
group of well-trained generation models. The selection task is commonly
addressed by identifying the model that maximizes an evaluation score based on
the diversity and quality of the generated data. However, such a best-model
identification approach overlooks the possibility that a mixture of available
models can outperform each individual model. In this work, we numerically show
that a mixture of generative models on benchmark image datasets can indeed
achieve a better evaluation score (based on FID and KID scores), compared to
the individual models. This observation motivates the development of efficient
algorithms for selecting the optimal mixture of the models. To address this, we
formulate a quadratic optimization problem to find an optimal mixture model
achieving the maximum of kernel-based evaluation scores including kernel
inception distance (KID) and R\'enyi kernel entropy (RKE). To identify the
optimal mixture of the models using the fewest possible sample queries, we view
the selection task as a multi-armed bandit (MAB) problem and propose the
Mixture Upper Confidence Bound (Mixture-UCB) algorithm that provably converges
to the optimal mixture of the involved models. More broadly, the proposed
Mixture-UCB can be extended to optimize every convex quadratic function of the
mixture weights in a general MAB setting. We prove a regret bound for the
Mixture-UCB algorithm and perform several numerical experiments to show the
success of Mixture-UCB in finding the optimal mixture of text and image
generative models. The project code is available at
https://github.com/Rezaei-Parham/Mixture-UCB.