Agnieszka Ewa Krautz, Julia Volkening, Janik Raue, Christian Otte, Simon B Eickhoff, Eike Ahlers, Jörg Langner
{"title":"Prediction of suicide using web based voice recordings analyzed by artificial intelligence.","authors":"Agnieszka Ewa Krautz, Julia Volkening, Janik Raue, Christian Otte, Simon B Eickhoff, Eike Ahlers, Jörg Langner","doi":"10.1038/s41598-025-08639-2","DOIUrl":null,"url":null,"abstract":"<p><p>The integration of machine learning (ML) and deep learning models in suicide risk assessment has advanced significantly in recent years. In this study, we utilized ML in a case-control design, we predicted completed suicides using publicly available, web-based, real-world voice data, and treating speech as a biomarker. Our model demonstrated high accuracy in distinguishing between individuals who died by suicide and carefully matched controls achieving an area under the curve (AUC) of 0.74. This improved to an AUC of 0.85 and an accuracy of 76% when analyzing the subset of individuals who died by suicide within 12 months of the audio recording. The best predictive performance was observed with the Multilayer perceptron model, particularly when using the all Bene, Q + U Bene, and Q + U Raw feature sets-highlighting the importance of combining structured and unstructured paralinguistic features. The findings highlight the critical temporal proximity of voice biomarkers to suicide risk. The model's robustness is further evidenced by its resilience to perturbations in the analytical pipeline. This is the first study to successfully predict actual suicidal behavior rather than surrogate markers, marking a major step forward in suicide prevention. By demonstrating that speech can serve as a non-invasive and objective biomarker for suicide risk, this research opens new avenues for diagnostic and prognostic applications.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"23855"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12227525/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-08639-2","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The integration of machine learning (ML) and deep learning models in suicide risk assessment has advanced significantly in recent years. In this study, we utilized ML in a case-control design, we predicted completed suicides using publicly available, web-based, real-world voice data, and treating speech as a biomarker. Our model demonstrated high accuracy in distinguishing between individuals who died by suicide and carefully matched controls achieving an area under the curve (AUC) of 0.74. This improved to an AUC of 0.85 and an accuracy of 76% when analyzing the subset of individuals who died by suicide within 12 months of the audio recording. The best predictive performance was observed with the Multilayer perceptron model, particularly when using the all Bene, Q + U Bene, and Q + U Raw feature sets-highlighting the importance of combining structured and unstructured paralinguistic features. The findings highlight the critical temporal proximity of voice biomarkers to suicide risk. The model's robustness is further evidenced by its resilience to perturbations in the analytical pipeline. This is the first study to successfully predict actual suicidal behavior rather than surrogate markers, marking a major step forward in suicide prevention. By demonstrating that speech can serve as a non-invasive and objective biomarker for suicide risk, this research opens new avenues for diagnostic and prognostic applications.
近年来,机器学习(ML)和深度学习模型在自杀风险评估中的整合取得了显著进展。在这项研究中,我们在病例对照设计中使用ML,我们使用公开可用的、基于网络的、真实世界的语音数据来预测自杀,并将语音作为生物标志物。我们的模型在区分自杀死亡的个体和精心匹配的对照方面显示出很高的准确性,曲线下面积(AUC)达到0.74。当分析录音后12个月内自杀的个体子集时,这一方法的AUC提高到0.85,准确率达到76%。多层感知器模型的预测性能最好,特别是在使用所有Bene、Q + U Bene和Q + U Raw特征集时,这突出了结合结构化和非结构化副语言特征的重要性。这些发现强调了声音生物标志物与自杀风险的关键时间接近性。该模型的鲁棒性进一步证明了其对分析管道中扰动的弹性。这是第一个成功预测实际自杀行为的研究,而不是替代标记,标志着自杀预防向前迈出了重要一步。通过证明语言可以作为自杀风险的非侵入性和客观的生物标志物,本研究为诊断和预后应用开辟了新的途径。
期刊介绍:
We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections.
Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021).
•Engineering
Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live.
•Physical sciences
Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics.
•Earth and environmental sciences
Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems.
•Biological sciences
Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants.
•Health sciences
The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.