ferrer21@interspeech_2021@ISCA

Total: 1

#1 Modeling Dysphonia Severity as a Function of Roughness and Breathiness Ratings in the GRBAS Scale [PDF] [Copy] [Kimi1]

Authors: Carlos A. Ferrer ; Efren Aragón ; María E. Hdez-Díaz ; Marc S. de Bodt ; Roman Cmejla ; Marina Englert ; Mara Behlau ; Elmar Nöth

Dysphonia comprises many perceptually deviating aspects of voice, and its overall severity perception is made by the listener according to methods of aggregating the single dimensions which are personally conceived and not well studied. Roughness and breathiness are constituent dimensions in most devised rating scales in clinical use. In this paper, we evaluate several ways to model the mapping of the overall severity as a function of the particular ratings of roughness and breathiness. The models include the simple linear averaging as well as several non-linear variants suggested elsewhere, and some minor adjustments. The models are evaluated on four datasets from different countries, allowing a more global evaluation of how the mapping is conceived. Results show the limitations of the most widely assumed linear approach, while also hinting at a need for a more uniform coverage of the sample space in voice pathology datasets. The models explored in this paper can be expanded to higher-dimensional scales.