2508.07563

Total: 1

#1 Exploring Efficient Directional and Distance Cues for Regional Speech Separation [PDF1] [Copy] [Kimi2] [REL]

Authors: Yiheng Jiang, Haoxu Wang, Yafeng Chen, Gang Qiao, Biao Tian

In this paper, we introduce a neural network-based method for regional speech separation using a microphone array. This approach leverages novel spatial cues to extract the sound source not only from specified direction but also within defined distance. Specifically, our method employs an improved delay-and-sum technique to obtain directional cues, substantially enhancing the signal from the target direction. We further enhance separation by incorporating the direct-to-reverberant ratio into the input features, enabling the model to better discriminate sources within and beyond a specified distance. Experimental results demonstrate that our proposed method leads to substantial gains across multiple objective metrics. Furthermore, our method achieves state-of-the-art performance on the CHiME-8 MMCSG dataset, which was recorded in real-world conversational scenarios, underscoring its effectiveness for speech separation in practical applications.

Subjects: Sound , Audio and Speech Processing

Publish: 2025-08-11 02:52:46 UTC