Sheriff Tolulope Ibrahim , Madeline Li , Jamin Patel , Tarun Reddy Katapally
{"title":"Utilizing natural language processing for precision prevention of mental health disorders among youth: A systematic review","authors":"Sheriff Tolulope Ibrahim , Madeline Li , Jamin Patel , Tarun Reddy Katapally","doi":"10.1016/j.compbiomed.2025.109859","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The global mental health crisis has created barriers to youth mental healthcare, leaving many disorders unaddressed. Precision prevention, which identifies individual risks, offers the potential for tailored interventions. While natural language processing (NLP) has shown promise in the early detection of mental health disorders, no review has examined its role in youth mental health detection. We hypothesize that NLP can improve early detection and personalized care in mental healthcare among youth.</div></div><div><h3>Methodology</h3><div>After screening 1197 articles from 5 databases, 12 papers were included covering six categories: (1) mental health disorders, (2) data sources, (3) NLP objective for mental health detection, (4) annotation and validation techniques, (5) linguistic markers, and (6) performance and evaluation. Study quality was assessed using Hawker's checklist for disparate study designs.</div></div><div><h3>Results</h3><div>Most studies focused on suicide risk (42 %), depression (25 %), and stress (17 %). Social media (42 %) and interviews (33 %) were the most common data sources, with linguistic inquiry and word count and support vector machines frequently used for analysis. While most studies were exploratory, one implemented a real-time tool for detecting mental health risks. Validation methods, including precision and recall metrics, showed strong predictive performance.</div></div><div><h3>Conclusions</h3><div>This review highlights the potential of NLP in youth mental health detection, addressing challenges such as bias, data quality, and ethical concerns. Future research should refine NLP models using diverse, multimodal datasets, addressing data imbalance, and improving real-time detection. Exploring transformer-based models and ensuring ethical, inclusive data handling will be key to advancing NLP-driven interventions.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"188 ","pages":"Article 109859"},"PeriodicalIF":7.0000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525002094","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
The global mental health crisis has created barriers to youth mental healthcare, leaving many disorders unaddressed. Precision prevention, which identifies individual risks, offers the potential for tailored interventions. While natural language processing (NLP) has shown promise in the early detection of mental health disorders, no review has examined its role in youth mental health detection. We hypothesize that NLP can improve early detection and personalized care in mental healthcare among youth.
Methodology
After screening 1197 articles from 5 databases, 12 papers were included covering six categories: (1) mental health disorders, (2) data sources, (3) NLP objective for mental health detection, (4) annotation and validation techniques, (5) linguistic markers, and (6) performance and evaluation. Study quality was assessed using Hawker's checklist for disparate study designs.
Results
Most studies focused on suicide risk (42 %), depression (25 %), and stress (17 %). Social media (42 %) and interviews (33 %) were the most common data sources, with linguistic inquiry and word count and support vector machines frequently used for analysis. While most studies were exploratory, one implemented a real-time tool for detecting mental health risks. Validation methods, including precision and recall metrics, showed strong predictive performance.
Conclusions
This review highlights the potential of NLP in youth mental health detection, addressing challenges such as bias, data quality, and ethical concerns. Future research should refine NLP models using diverse, multimodal datasets, addressing data imbalance, and improving real-time detection. Exploring transformer-based models and ensuring ethical, inclusive data handling will be key to advancing NLP-driven interventions.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.