LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

#1 LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer [PDF⁵] [Copy] [Kimi¹] [REL]

Authors: Yiren Song, Danze Chen, Mike Zheng Shou

Generating cognitive-aligned layered SVGs remains challenging due to existing methods' tendencies toward either oversimplified single-layer outputs or optimization-induced shape redundancies. We propose LayerTracer, a DiT based framework that bridges this gap by learning designers' layered SVG creation processes from a novel dataset of sequential design operations. Our approach operates in two phases: First, a text-conditioned DiT generates multi-phase rasterized construction blueprints that simulate human design workflows. Second, layer-wise vectorization with path deduplication produces clean, editable SVGs. For image vectorization, we introduce a conditional diffusion mechanism that encodes reference images into latent tokens, guiding hierarchical reconstruction while preserving structural integrity. Extensive experiments show that LayerTracer surpasses optimization-based and neural baselines in generation quality and editability.

Subject: ICCV.2025 - Oral

Song_LayerTracer_Cognitive-Aligned_Layered_SVG_Synthesis_via_Diffusion_Transformer@ICCV2025@CVF

#1 LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer [PDF5] [Copy] [Kimi1] [REL]

#1 LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer [PDF⁵] [Copy] [Kimi¹] [REL]