2502.12672

Total: 1

#1 Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability [PDF3] [Copy] [Kimi1] [REL]

Authors: Tzu-Quan Lin, Wei-Ping Huang, Hao Tang, Hung-yi Lee

Speech representation models are highly effective at extracting general features for various tasks. While fine-tuning can enhance these representations for specific applications, it often compromises their generalization ability. To address this challenge, we propose Speech-FT, a fine-tuning strategy for speech representation models that leverages model merging to preserve generalization ability while still benefiting from fine-tuning. Speech-FT is effective across different fine-tuning scenarios and is compatible with various types of speech representation models, providing a versatile solution. Speech-FT offers an efficient and practical approach to further improving general speech representations after pre-training.

Subjects: Computation and Language , Artificial Intelligence

Publish: 2025-02-18 09:23:42 UTC