Assessing the Representation of Suicidal Ideation in Social Media Datasets Relative to Suicide Notes

Kavut, Merve; Dzafic, Amina; Bayram, ULYA

doi:10.1109/taffc.2026.3664204

Assessing the Representation of Suicidal Ideation in Social Media Datasets Relative to Suicide Notes

Kavut M., Dzafic A., Bayram U.

IEEE Transactions on Affective Computing, 2026 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Basım Tarihi: 2026
Doi Numarası: 10.1109/taffc.2026.3664204
Dergi Adı: IEEE Transactions on Affective Computing
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, INSPEC, Psycinfo, RILM Abstracts of Music Literature
Anahtar Kelimeler: affective com-puting, NLP, social media, suicidal ideation detection, suicide notes, text classification, zero-shot embeddings
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Çanakkale Onsekiz Mart Üniversitesi Adresli: Evet

Özet

Clinical records are inaccessible due to legal restrictions, and genuine suicide notes are scarce, limiting the availability of verified suicidal expression data. Consequently, most Natural Language Processing (NLP) studies rely on widely accessible social media datasets containing expert annotations or heuristic labels. However, it remains unclear how expressions of suicidal ideation in these datasets relate to the ideation in genuine suicide notes. In this study, we address this gap by comparing manually annotated social media datasets in English and Turkish, as well as automatically labeled English datasets, directly against genuine suicide notes using a combination of linguistic and statistical analyses, classification models, and zero shot embedding-based representations. Results show that expert labeled social media data exhibit limited overlap with suicide notes across both languages while remaining distinguishable, whereas automatically labeled data diverge substantially from the notes in all analyses. These findings indicate that the annotation method critically shapes the signals learned by the computational models and that social media data capture different stages or forms of ideation than those in suicide notes. Overall, our findings emphasize the importance of careful dataset evaluation and caution against misinterpreting model performance in suicide risk detection research, given the shared objective: supporting global suicide prevention efforts.