Aligning Text-to-Image Diffusion Models to Human Preference by Classification

#1 Aligning Text-to-Image Diffusion Models to Human Preference by Classification [PDF⁵] [Copy] [Kimi⁴] [REL]

Authors: Longquan Dai, Xiaolu Wei, He Wang, Shaomeng Wang, Jinhui Tang

Text-to-image diffusion models are typically trained on large-scale web data, often resulting in outputs that misalign with human preferences. Inspired by preference learning in large language models, we propose ABC (Alignment by Classification), a simple yet effective framework for aligning diffusion models with human preferences. In contrast to prior DPO-based methods that depend on suboptimal supervised fine-tuned (SFT) reference models, ABC assumes access to an ideal reference model perfectly aligned with human intent and reformulates alignment as a classification problem. Under this view, we recognize that preference data naturally forms a semi-supervised classification setting. To address this, we propose a data augmentation strategy that transforms preference comparisons into fully supervised training signals. We then introduce a classification-based ABC loss to guide alignment. Our alignment by classification approach could effectively steer the diffusion model toward the behavior of the ideal reference. Experiments on various diffusion models show that our ABC consistently outperforms existing baselines, offering a scalable and robust solution for preference-based text-to-image fine-tuning.

Subject: NeurIPS.2025 - Spotlight

YvypxK4kut@OpenReview

#1 Aligning Text-to-Image Diffusion Models to Human Preference by Classification [PDF5] [Copy] [Kimi4] [REL]

#1 Aligning Text-to-Image Diffusion Models to Human Preference by Classification [PDF⁵] [Copy] [Kimi⁴] [REL]