Before the Labels: How Dataset Construction Shapes Suicidality Detection in Clinical Text

#1 Before the Labels: How Dataset Construction Shapes Suicidality Detection in Clinical Text [PDF] [Copy] [Kimi] [REL]

Authors: Priyanshi Garg, Ishita Rao, Jieqiong Ding, Amandalynne Paullada

Clinical NLP increasingly relies on electronic health record (EHR) data to detect suicidal behaviors, treating clinical documentation as more reliable ground truth than social media. We argue that this framing obscures how EHR-based suicidality datasets encode a particular operationalization of suicidality, shaped by who authors the data, how episodes are bounded, and how ambiguity is resolved. We ground this argument in a case study of the ScAN dataset, built over MIMIC-III clinical notes. We show how governance constraints, ICD-based cohort selection, single-annotator labeling, and hospital-stay-level aggregation produce labels that reflect clinician-documented judgments, treat suicidality as a bounded episode, and assume that intent can be reliably inferred from documentation. A linguistic analysis demonstrates that identical labels subsume heterogeneous clinical framings differing in temporality, negation, and uncertainty. We argue that clinical NLP should examine the assumptions embedded in suicidality datasets before interpreting their labels as ground truth.

Subjects: Computation and Language , Artificial Intelligence

Publish: 2026-06-17 22:31:27 UTC

2606.19637

#1 Before the Labels: How Dataset Construction Shapes Suicidality Detection in Clinical Text [PDF] [Copy] [Kimi] [REL]