edraki21@interspeech_2021@ISCA

Total: 1

#1 A Spectro-Temporal Glimpsing Index (STGI) for Speech Intelligibility Prediction [PDF] [Copy] [Kimi1]

Authors: Amin Edraki ; Wai-Yip Chan ; Jesper Jensen ; Daniel Fogerty

We propose a monaural intrusive speech intelligibility prediction (SIP) algorithm called STGI based on detecting glimpses in short-time segments in a spectro-temporal modulation decomposition of the input speech signals. Unlike existing glimpse-based SIP methods, the application of STGI is not limited to additive uncorrelated noise; STGI can be employed in a broad range of degradation conditions. Our results show that STGI performs consistently well across 15 datasets covering degradation conditions including modulated noise, noise reduction processing, reverberation, near-end listening enhancement, checkerboard noise, and gated noise.