Nils Hentati Isacsson , Lucía Gómez-Zaragozá , Fehmi Ben Abdesslem , Magnus Boman , Viktor Kaldo
{"title":"Natural language processing models for predicting treatment outcomes in internet-delivered cognitive behavioral therapy","authors":"Nils Hentati Isacsson , Lucía Gómez-Zaragozá , Fehmi Ben Abdesslem , Magnus Boman , Viktor Kaldo","doi":"10.1016/j.invent.2025.100879","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Predicting treatment outcome has the potential to enhance Internet-delivered Cognitive Behavioral Therapy (ICBT). One aspect of guided ICBT is patient-therapist interaction through written messages. With Natural language processing (NLP) these could be leveraged to predict outcome; however current evidence is limited. This study investigates the predictive accuracy of NLP models for treatment outcomes and evaluates whether NLP provides additional predictive value beyond symptom variables.</div></div><div><h3>Methods</h3><div>Patient-therapist messages from 6613 patients undergoing 12 weeks of treatment were used to train three types of NLP models: Term Frequency-Inverse Document Frequency (TF-IDF), Bidirectional Encoder Representations from transformers (BERT), and BERT for Longer Text (BELT). These were trained both with and without symptom variables from the initial treatment period to predict post-treatment symptoms. A dummy model was also used, and a linear regression model acted as a symptoms only benchmark. Multiple imputation addressed missing data, and nested cross-validation was used.</div></div><div><h3>Results</h3><div>The symptom only model performed best. Only BERT outperformed the dummy model, achieving a Root Mean Squared Error (RMSE) of 0.17 compared to RMSE of 0.18. Adding symptom variables to the BERT model significantly increased its accuracy, but not the RMSE metric. The best linear regression benchmark based on symptoms only had a BACC of 70 % (F<sub>1</sub>-score of 0.66) which outperformed the BERT model with 60 % (F<sub>1</sub>: 0.55) and the combined BERT plus symptoms model achieved 68 % (F<sub>1</sub>: 0.62).</div></div><div><h3>Conclusion</h3><div>These initial findings indicate a small predictive value from patient-therapist written message interaction but added no value beyond using only symptoms to predict post-treatment symptoms. Further research is needed to refine NLP-methods for use in psychological treatment, and more accurately assess the predictive potential of text-based interactions during ICBT.</div></div>","PeriodicalId":48615,"journal":{"name":"Internet Interventions-The Application of Information Technology in Mental and Behavioural Health","volume":"42 ","pages":"Article 100879"},"PeriodicalIF":4.1000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Interventions-The Application of Information Technology in Mental and Behavioural Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214782925000806","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Predicting treatment outcome has the potential to enhance Internet-delivered Cognitive Behavioral Therapy (ICBT). One aspect of guided ICBT is patient-therapist interaction through written messages. With Natural language processing (NLP) these could be leveraged to predict outcome; however current evidence is limited. This study investigates the predictive accuracy of NLP models for treatment outcomes and evaluates whether NLP provides additional predictive value beyond symptom variables.
Methods
Patient-therapist messages from 6613 patients undergoing 12 weeks of treatment were used to train three types of NLP models: Term Frequency-Inverse Document Frequency (TF-IDF), Bidirectional Encoder Representations from transformers (BERT), and BERT for Longer Text (BELT). These were trained both with and without symptom variables from the initial treatment period to predict post-treatment symptoms. A dummy model was also used, and a linear regression model acted as a symptoms only benchmark. Multiple imputation addressed missing data, and nested cross-validation was used.
Results
The symptom only model performed best. Only BERT outperformed the dummy model, achieving a Root Mean Squared Error (RMSE) of 0.17 compared to RMSE of 0.18. Adding symptom variables to the BERT model significantly increased its accuracy, but not the RMSE metric. The best linear regression benchmark based on symptoms only had a BACC of 70 % (F1-score of 0.66) which outperformed the BERT model with 60 % (F1: 0.55) and the combined BERT plus symptoms model achieved 68 % (F1: 0.62).
Conclusion
These initial findings indicate a small predictive value from patient-therapist written message interaction but added no value beyond using only symptoms to predict post-treatment symptoms. Further research is needed to refine NLP-methods for use in psychological treatment, and more accurately assess the predictive potential of text-based interactions during ICBT.
期刊介绍:
Official Journal of the European Society for Research on Internet Interventions (ESRII) and the International Society for Research on Internet Interventions (ISRII).
The aim of Internet Interventions is to publish scientific, peer-reviewed, high-impact research on Internet interventions and related areas.
Internet Interventions welcomes papers on the following subjects:
• Intervention studies targeting the promotion of mental health and featuring the Internet and/or technologies using the Internet as an underlying technology, e.g. computers, smartphone devices, tablets, sensors
• Implementation and dissemination of Internet interventions
• Integration of Internet interventions into existing systems of care
• Descriptions of development and deployment infrastructures
• Internet intervention methodology and theory papers
• Internet-based epidemiology
• Descriptions of new Internet-based technologies and experiments with clinical applications
• Economics of internet interventions (cost-effectiveness)
• Health care policy and Internet interventions
• The role of culture in Internet intervention
• Internet psychometrics
• Ethical issues pertaining to Internet interventions and measurements
• Human-computer interaction and usability research with clinical implications
• Systematic reviews and meta-analysis on Internet interventions