On Fairness of Unified Multimodal Large Language Model for Image Generation

#1 On Fairness of Unified Multimodal Large Language Model for Image Generation [PDF³] [Copy] [Kimi] [REL]

Authors: Ming Liu, Hao Chen, Jindong Wang, Liwen Wang, Bhiksha Raj, Wensheng Zhang

Unified multimodal large language models (U-MLLMs) have demonstrated impressive performance in end-to-end visual understanding and generation tasks. However, compared to generation-only systems (e.g., Stable Diffusion), the unified architecture of U-MLLMs introduces new risks of propagating demographic stereotypes. In this paper, we benchmark several state-of-the-art U-MLLMs and show that they exhibit significant gender and race biases in the generated outputs. To diagnose the source of these biases, we propose a locate-then-fix framework: we first audit the vision and language components — using techniques such as linear probing and controlled generation — and find that the language model appears to be a primary origin of the observed generative bias. Moreover, we observe a ``partial alignment'' phenomenon, where the U-MLLMs exhibit less bias in understanding tasks yet produce substantially biased images. To address this, we introduce a novel \emph{balanced preference loss} that enforces uniform generation probabilities across demographics by leveraging a synthetically balanced dataset. Extensive experiments show that our approach significantly reduces demographic bias while preserving semantic fidelity and image quality. Our findings underscore the need for targeted debiasing strategies in unified multimodal systems and introduce a practical approach to mitigate biases.

Subject: NeurIPS.2025 - Poster

4xGJZkdjCU@OpenReview

#1 On Fairness of Unified Multimodal Large Language Model for Image Generation [PDF3] [Copy] [Kimi] [REL]

#1 On Fairness of Unified Multimodal Large Language Model for Image Generation [PDF³] [Copy] [Kimi] [REL]