CASteer: Steering Diffusion Models for Controllable Generation

#1 CASteer: Steering Diffusion Models for Controllable Generation [PDF] [Copy] [Kimi¹] [REL]

Authors: Tatiana Gaintseva, Chengcheng Ma, Ziquan Liu, Martin Benning, Gregory Slabaugh, Jiankang Deng, Ismail Elezi

Diffusion models have transformed image generation, yet controlling their outputs for diverse applications, including content moderation and creative customization, remains challenging. Existing approaches usually require task-specific training and struggle to generalize across both concrete (e.g., objects) and abstract (e.g., styles) concepts. We propose CASteer (Cross-Attention Steering) a training-free framework for controllable image generation using steering vectors to influence a diffusion model$'$s hidden representations dynamically. CASteer computes these vectors offline by averaging activations from concept-specific generated images, then applies them during inference via a dynamic heuristic that activates modifications only when necessary, removing concepts from affected images or adding them to unaffected ones. This approach enables precise control over a wide range of tasks, including removing harmful content, adding desired attributes, replacing objects, or altering styles, all without model retraining. CASteer handles both concrete and abstract concepts, outperforming state-of-the-art techniques across multiple diffusion models while preserving unrelated content and minimizing unintended effects.

Subject: Graphics

Publish: 2025-03-11 18:20:20 UTC

2503.09630

#1 CASteer: Steering Diffusion Models for Controllable Generation [PDF] [Copy] [Kimi1] [REL]

#1 CASteer: Steering Diffusion Models for Controllable Generation [PDF] [Copy] [Kimi¹] [REL]