José Tomás García-Molina , Maximiliano Downey , Emmanuel Méndez , Alicia Figueroa-Barra
{"title":"Diagnostic and transition accuracy of natural language processing in high risk for psychosis individuals: A systematic review","authors":"José Tomás García-Molina , Maximiliano Downey , Emmanuel Méndez , Alicia Figueroa-Barra","doi":"10.1016/j.ajp.2025.104695","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Schizophrenia spectrum disorders often emerge in adolescence or early adulthood and are a leading cause of global disability. Early identification of clinical high‑risk for psychosis (CHR‑P) can reduce comorbidity and shorten untreated psychosis duration, yet clinician‑administered tools (e.g., SIPS/SOPS, CAARMS, PSYCHS) are time‑consuming and only moderately predictive.</div></div><div><h3>Objective</h3><div>To systematically review the diagnostic and prognostic accuracy of natural language processing (NLP) applied to speech in CHR‑P populations, and to map methodological trends and gaps.</div></div><div><h3>Methods</h3><div>We searched PubMed, Scopus, and Embase through May 2025 for English or Spanish studies enrolling CHR‑P individuals by validated criteria, applying NLP to speech transcripts, and reporting quantitative metrics (accuracy, sensitivity, specificity, AUC‑ROC). Two reviewers independently screened studies, extracted data, and assessed bias with QUADAS‑2; disagreements were resolved by a third reviewer.</div></div><div><h3>Results</h3><div>Results: From 356 records, nine studies (eight unique cohorts; N = 353 CHR-P, 197 controls) met inclusion. Four case–control studies and one prospective cohort assessed cross-sectional discrimination of CHR-P from healthy controls, reporting accuracies of 56–95 % (AUC-ROC 0.86–0.99). Four prospective studies examined transition prediction, with accuracies ranging from 83 % to 100 %. Studies covered five languages and employed diverse NLP pipelines (e.g., LSA, Word2Vec, USE, SBERT, graph metrics, sentiment analysis). However, feature heterogeneity, small samples (≤ 50 CHR-P), varied speech tasks, and inconsistent validation limited comparability.</div></div><div><h3>Conclusions</h3><div>NLP‑based speech analysis shows promise as an objective biomarker for early psychosis detection and risk stratification. To advance clinical utility, future research should adopt standardized protocols, recruit larger and more diverse cohorts, and implement multicenter validation.</div></div>","PeriodicalId":8543,"journal":{"name":"Asian journal of psychiatry","volume":"112 ","pages":"Article 104695"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian journal of psychiatry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1876201825003387","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Schizophrenia spectrum disorders often emerge in adolescence or early adulthood and are a leading cause of global disability. Early identification of clinical high‑risk for psychosis (CHR‑P) can reduce comorbidity and shorten untreated psychosis duration, yet clinician‑administered tools (e.g., SIPS/SOPS, CAARMS, PSYCHS) are time‑consuming and only moderately predictive.
Objective
To systematically review the diagnostic and prognostic accuracy of natural language processing (NLP) applied to speech in CHR‑P populations, and to map methodological trends and gaps.
Methods
We searched PubMed, Scopus, and Embase through May 2025 for English or Spanish studies enrolling CHR‑P individuals by validated criteria, applying NLP to speech transcripts, and reporting quantitative metrics (accuracy, sensitivity, specificity, AUC‑ROC). Two reviewers independently screened studies, extracted data, and assessed bias with QUADAS‑2; disagreements were resolved by a third reviewer.
Results
Results: From 356 records, nine studies (eight unique cohorts; N = 353 CHR-P, 197 controls) met inclusion. Four case–control studies and one prospective cohort assessed cross-sectional discrimination of CHR-P from healthy controls, reporting accuracies of 56–95 % (AUC-ROC 0.86–0.99). Four prospective studies examined transition prediction, with accuracies ranging from 83 % to 100 %. Studies covered five languages and employed diverse NLP pipelines (e.g., LSA, Word2Vec, USE, SBERT, graph metrics, sentiment analysis). However, feature heterogeneity, small samples (≤ 50 CHR-P), varied speech tasks, and inconsistent validation limited comparability.
Conclusions
NLP‑based speech analysis shows promise as an objective biomarker for early psychosis detection and risk stratification. To advance clinical utility, future research should adopt standardized protocols, recruit larger and more diverse cohorts, and implement multicenter validation.
期刊介绍:
The Asian Journal of Psychiatry serves as a comprehensive resource for psychiatrists, mental health clinicians, neurologists, physicians, mental health students, and policymakers. Its goal is to facilitate the exchange of research findings and clinical practices between Asia and the global community. The journal focuses on psychiatric research relevant to Asia, covering preclinical, clinical, service system, and policy development topics. It also highlights the socio-cultural diversity of the region in relation to mental health.