I want a horror – comedy – movie: Slips-of-the-Tongue Impact Conversational Recommender System Performance

#1 I want a horror – comedy – movie: Slips-of-the-Tongue Impact Conversational Recommender System Performance [PDF] [Copy] [Kimi] [REL]

Authors: Maria Teleki, Lingfeng Shi, Chengkai Liu, James Caverlee

Disfluencies are a characteristic of speech. We focus on the impact of a specific class of disfluency - whole-word speech substitution errors (WSSE) - on LLM-based conversational recommender system performance. We develop Syn-WSSE, a psycholinguistically-grounded framework for synthetically creating genre-based WSSE at varying ratios to study their impact on conversational recommender system performance. We find that LLMs are impacted differently: llama and mixtral have improved performance in the presence of these errors, while gemini, gpt-4o, and gpt-4o-mini have deteriorated performance. We hypothesize that this difference in model resiliency is due to differences in the pre- and post-training methods and data, and that the increased performance is due to the introduced genre diversity. Our findings indicate the importance of a careful choice of LLM for these systems, and more broadly, that disfluencies must be carefully designed for as they can have unforeseen impacts.

Subject: INTERSPEECH.2025 - Speech Synthesis

teleki25@interspeech_2025@ISCA

#1 I want a horror – comedy – movie: Slips-of-the-Tongue Impact Conversational Recommender System Performance [PDF] [Copy] [Kimi] [REL]