Suicide Note Classification Using Natural Language Processing: A Content Analysis.

John Pestian, Henry Nasrallah, Pawel Matykiewicz, Aurora Bennett, Antoon Leenaars
{"title":"Suicide Note Classification Using Natural Language Processing: A Content Analysis.","authors":"John Pestian, Henry Nasrallah, Pawel Matykiewicz, Aurora Bennett, Antoon Leenaars","doi":"10.4137/bii.s4706","DOIUrl":null,"url":null,"abstract":"<p><p>Suicide is the second leading cause of death among 25-34 year olds and the third leading cause of death among 15-25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes.</p>","PeriodicalId":88397,"journal":{"name":"Biomedical informatics insights","volume":"2010 3","pages":"19-28"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.4137/bii.s4706","citationCount":"210","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical informatics insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4137/bii.s4706","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 210

Abstract

Suicide is the second leading cause of death among 25-34 year olds and the third leading cause of death among 15-25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes.

Abstract Image

Abstract Image

Abstract Image

使用自然语言处理的遗书分类:内容分析。
在美国,自杀是25-34岁人群死亡的第二大原因,也是15-25岁人群死亡的第三大原因。在有自杀倾向的病人经常出现的急诊科,估计反复尝试的风险通常留给临床判断。本文提出了我们的第二次尝试,以确定计算算法在理解自杀患者思想中的作用,如自杀遗书所代表的。我们专注于开发自然语言处理方法,以区分真实的和诱导的遗书。我们假设机器学习算法可以对遗书进行分类,就像心理健康专家和精神科实习医生所做的那样。所使用的数据包括来自33名自杀完成者的遗书,并与来自健康对照组成员的33份遗书相匹配。11名心理健康专业人员和31名精神病学受训人员被要求判断一张纸条是真实的还是被诱骗的。他们的决定与九种不同的机器学习算法进行了比较。结果表明,受训者在49%的时间里准确地分类了笔记,心理健康专业人员在63%的时间里准确地分类了笔记,而最好的机器学习算法在78%的时间里准确地分类了笔记。这是开发基于证据的反复自杀企图预测器的重要一步,因为它表明自然语言处理可以帮助区分自杀笔记的类别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信