Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model
Journal:
arXiv
Published Date:
Mar 16, 2025
Abstract
Image generative models, particularly diffusion-based models, have surged in
popularity due to their remarkable ability to synthesize highly realistic
images. However, since these models are data-driven, they inherit biases from
the training datasets, frequently leading to disproportionate group
representations that exacerbate societal inequities. Traditionally, efforts to
debiase these models have relied on predefined sensitive attributes,
classifiers trained on such attributes, or large language models to steer
outputs toward fairness. However, these approaches face notable drawbacks:
predefined attributes do not adequately capture complex and continuous
variations among groups. To address these issues, we introduce the Debiasing
Diffusion Model (DDM), which leverages an indicator to learn latent
representations during training, promoting fairness through balanced
representations without requiring predefined sensitive attributes. This
approach not only demonstrates its effectiveness in scenarios previously
addressed by conventional techniques but also enhances fairness without relying
on predefined sensitive attributes as conditions. In this paper, we discuss the
limitations of prior bias mitigation techniques in diffusion-based models,
elaborate on the architecture of the DDM, and validate the effectiveness of our
approach through experiments.