2409.10429

Total: 1

#1 Meta-Whisper: Speech-Based Meta-ICL for ASR on Low-Resource Languages [PDF3] [Copy] [Kimi2] [REL]

Authors: Ming-Hao Hsu, Kuan Po Huang, Hung-yi Lee

This paper presents Meta-Whisper, a novel approach to improve automatic speech recognition (ASR) for low-resource languages using the Whisper model. By leveraging Meta In-Context Learning (Meta-ICL) and a k-Nearest Neighbors (KNN) algorithm for sample selection, Meta-Whisper enhances Whisper's ability to recognize speech in unfamiliar languages without extensive fine-tuning. Experiments on the ML-SUPERB dataset show that Meta-Whisper significantly reduces the Character Error Rate (CER) for low-resource languages compared to the original Whisper model. This method offers a promising solution for developing more adaptable multilingual ASR systems, particularly for languages with limited resources.

Subjects: Audio and Speech Processing , Computation and Language , Sound

Publish: 2024-09-16 16:04:16 UTC