Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models

#1 Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models [PDF⁵] [Copy] [Kimi⁵] [REL]

Authors: Son Quoc Tran, Tushaar Gangavarapu, Nicholas Chernogor, Jonathan P. Chang, Cristian Danescu-Niculescu-Mizil

We often rely on our intuition to anticipate the direction of a conversation. Endowing automated systems with similar foresight can enable them to assist human-human interactions. Recent work on developing models with this predictive capacity has focused on the Conversations Gone Awry (CGA) task: forecasting whether an ongoing conversation will derail. In this work, we revisit this task and introduce the first uniform evaluation framework, creating a benchmark that enables direct and reliable comparisons between different architectures. This allows us to present an up-to-date overview of the current progress in CGA models, in light of recent advancements in language modeling. Our framework also introduces a novel metric that captures a model's ability to revise its forecast as the conversation progresses.

Subjects: Computation and Language , Human-Computer Interaction

Publish: 2025-07-25 17:55:13 UTC

2507.19470

#1 Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models [PDF5] [Copy] [Kimi5] [REL]

#1 Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models [PDF⁵] [Copy] [Kimi⁵] [REL]