2508.03481

Total: 1

#1 Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models [PDF7] [Copy] [Kimi6] [REL]

Authors: Hyungjin Kim, Seokho Ahn, Young-Duk Seo

Personalized generation in T2I diffusion models aims to naturally incorporate individual user preferences into the generation process with minimal user intervention. However, existing studies primarily rely on prompt-level modeling with large-scale models, often leading to inaccurate personalization due to the limited input token capacity of T2I diffusion models. To address these limitations, we propose DrUM, a novel method that integrates user profiling with a transformer-based adapter to enable personalized generation through condition-level modeling in the latent space. DrUM demonstrates strong performance on large-scale datasets and seamlessly integrates with open-source text encoders, making it compatible with widely used foundation T2I models without requiring additional fine-tuning.

Subjects: Computer Vision and Pattern Recognition , Artificial Intelligence , Computation and Language

Publish: 2025-08-05 14:14:55 UTC