Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture

#1 Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture [PDF¹] [Copy] [Kimi] [REL]

Authors: Xuanchen Li, Jianyu Wang, Yuhao Cheng, Yikun Zeng, Xingyu Ren, Wenhan Zhu, Weiming Zhao, Yichao Yan

Significant progress has been made for speech-driven 3D face animation, but most works focus on learning the motion of mesh/geometry, ignoring the impact of dynamic texture. In this work, we reveal that dynamic texture plays a key role in rendering high-fidelity talking avatars, and introduce a high-resolution 4D dataset TexTalk4D, consisting of 100 minutes of audio-synced scan-level meshes with detailed 8K dynamic textures from 100 subjects. Based on the dataset, we explore the inherent correlation between motion and texture, and propose a diffusion-based framework TexTalker to simultaneously generate facial motions and dynamic textures from speech. Furthermore, we propose a novel pivot-based style injection strategy to capture the complicity of different texture and motion styles, which allows disentangled control. TexTalker, as the first method to generate audio-synced facial motion with dynamic texture, not only outperforms the prior arts in synthesising facial motions, but also produces realistic textures that are consistent with the underlying facial movements.

Subject: CVPR.2025 - Poster

Li_Towards_High-fidelity_3D_Talking_Avatar_with_Personalized_Dynamic_Texture@CVPR2025@CVF

#1 Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture [PDF1] [Copy] [Kimi] [REL]

#1 Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture [PDF¹] [Copy] [Kimi] [REL]