2511.18774

Total: 1

#1 Context-Aware Whisper for Arabic ASR Under Linguistic Varieties [PDF] [Copy] [Kimi] [REL]

Authors: Bashar Talafha, Amin Abu Alhassan, Muhammad Abdul-Mageed

Low-resource ASR remains a challenging problem, especially for languages like Arabic that exhibit wide dialectal variation and limited labeled data. We propose context-aware prompting strategies to adapt OpenAI's Whisper for Arabic speech recognition without retraining. Our methods include decoder prompting with first-pass transcriptions or retrieved utterances, and encoder prefixing using speech synthesized in the target speaker's voice. We introduce techniques such as prompt reordering, speaker-aware prefix synthesis, and modality-specific retrieval (lexical, semantic, acoustic) to improve transcription in real-world, zero-shot settings. Evaluated on nine Arabic linguistic conditions, our approach reduces WER by up to 22.3% on Modern Standard Arabic and 9.2% on dialectal speech, significantly mitigating hallucinations and speaker mismatch.

Subject: Computation and Language

Publish: 2025-11-24 05:16:04 UTC