Tobias Steinbrenner , Christopher Lalk , Kim Targan , Jana Schaffrath , Steffen Eberhardt , Anke Haberkamp , Wolfgang Lutz , Julian Rubel
{"title":"解释心理治疗记录中的焦虑预测:患者语言特征和理论结构的作用。","authors":"Tobias Steinbrenner , Christopher Lalk , Kim Targan , Jana Schaffrath , Steffen Eberhardt , Anke Haberkamp , Wolfgang Lutz , Julian Rubel","doi":"10.1016/j.brat.2025.104857","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Linguistics can be a helpful tool when researching psychological processes and symptoms. This study aimed to predict anxiety severity from patient language in psychotherapy transcripts. In contrast to prior work focusing on isolated feature types, we combine theory-driven psychological constructs with state-of-the-art NLP and machine learning techniques to enhance both performance and interpretability. Specifically, we asked (1) how well anxiety can be predicted, (2) which models perform best, (3) which features are most important.</div></div><div><h3>Method</h3><div>We extracted LIWC features, unigrams and bigrams, Transformer emotions, and topics from 529 psychotherapy transcripts from 118 patients. In addition, we constructed theory-driven features to measure negative self-focused attention, self-insight, future focus of perceived threat, and uncertainty avoidance. Each feature set was modeled using multiple machine learning algorithms. To gain insights into the most informative predictors of anxiety, eXplainable Artificial Intelligence was applied.</div></div><div><h3>Results</h3><div>The Unigram-Bigram Model achieved the best predictive performance (<em>r</em> = .77; <em>95 %-CI</em> = .75–0.80). However, the Anxiety Process Model achieved notable predictive accuracy despite having only four interpretable, theory-based features. Features related to leisure, social relationships, and insecurity were associated with lower anxiety severity, health-related features and “certainty words” (e.g., totally) with higher severity.</div></div><div><h3>Conclusions</h3><div>Our findings highlight a trade-off between performance and interpretability. Unigram-bigram models maximized predictive accuracy, whereas theory-driven constructs provided clinically meaningful insights into core psychological processes. Identifying predictive linguistic features, especially those linked to psychological theory, may guide future research on feedback systems and clinical applications by providing interpretable and theory-aligned insights.</div></div>","PeriodicalId":48457,"journal":{"name":"Behaviour Research and Therapy","volume":"194 ","pages":"Article 104857"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Explaining anxiety prediction in psychotherapy transcripts: The role of patient linguistic features and theoretical constructs\",\"authors\":\"Tobias Steinbrenner , Christopher Lalk , Kim Targan , Jana Schaffrath , Steffen Eberhardt , Anke Haberkamp , Wolfgang Lutz , Julian Rubel\",\"doi\":\"10.1016/j.brat.2025.104857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>Linguistics can be a helpful tool when researching psychological processes and symptoms. This study aimed to predict anxiety severity from patient language in psychotherapy transcripts. In contrast to prior work focusing on isolated feature types, we combine theory-driven psychological constructs with state-of-the-art NLP and machine learning techniques to enhance both performance and interpretability. Specifically, we asked (1) how well anxiety can be predicted, (2) which models perform best, (3) which features are most important.</div></div><div><h3>Method</h3><div>We extracted LIWC features, unigrams and bigrams, Transformer emotions, and topics from 529 psychotherapy transcripts from 118 patients. In addition, we constructed theory-driven features to measure negative self-focused attention, self-insight, future focus of perceived threat, and uncertainty avoidance. Each feature set was modeled using multiple machine learning algorithms. To gain insights into the most informative predictors of anxiety, eXplainable Artificial Intelligence was applied.</div></div><div><h3>Results</h3><div>The Unigram-Bigram Model achieved the best predictive performance (<em>r</em> = .77; <em>95 %-CI</em> = .75–0.80). However, the Anxiety Process Model achieved notable predictive accuracy despite having only four interpretable, theory-based features. Features related to leisure, social relationships, and insecurity were associated with lower anxiety severity, health-related features and “certainty words” (e.g., totally) with higher severity.</div></div><div><h3>Conclusions</h3><div>Our findings highlight a trade-off between performance and interpretability. Unigram-bigram models maximized predictive accuracy, whereas theory-driven constructs provided clinically meaningful insights into core psychological processes. Identifying predictive linguistic features, especially those linked to psychological theory, may guide future research on feedback systems and clinical applications by providing interpretable and theory-aligned insights.</div></div>\",\"PeriodicalId\":48457,\"journal\":{\"name\":\"Behaviour Research and Therapy\",\"volume\":\"194 \",\"pages\":\"Article 104857\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Behaviour Research and Therapy\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0005796725001792\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, CLINICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behaviour Research and Therapy","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0005796725001792","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, CLINICAL","Score":null,"Total":0}
Explaining anxiety prediction in psychotherapy transcripts: The role of patient linguistic features and theoretical constructs
Objective
Linguistics can be a helpful tool when researching psychological processes and symptoms. This study aimed to predict anxiety severity from patient language in psychotherapy transcripts. In contrast to prior work focusing on isolated feature types, we combine theory-driven psychological constructs with state-of-the-art NLP and machine learning techniques to enhance both performance and interpretability. Specifically, we asked (1) how well anxiety can be predicted, (2) which models perform best, (3) which features are most important.
Method
We extracted LIWC features, unigrams and bigrams, Transformer emotions, and topics from 529 psychotherapy transcripts from 118 patients. In addition, we constructed theory-driven features to measure negative self-focused attention, self-insight, future focus of perceived threat, and uncertainty avoidance. Each feature set was modeled using multiple machine learning algorithms. To gain insights into the most informative predictors of anxiety, eXplainable Artificial Intelligence was applied.
Results
The Unigram-Bigram Model achieved the best predictive performance (r = .77; 95 %-CI = .75–0.80). However, the Anxiety Process Model achieved notable predictive accuracy despite having only four interpretable, theory-based features. Features related to leisure, social relationships, and insecurity were associated with lower anxiety severity, health-related features and “certainty words” (e.g., totally) with higher severity.
Conclusions
Our findings highlight a trade-off between performance and interpretability. Unigram-bigram models maximized predictive accuracy, whereas theory-driven constructs provided clinically meaningful insights into core psychological processes. Identifying predictive linguistic features, especially those linked to psychological theory, may guide future research on feedback systems and clinical applications by providing interpretable and theory-aligned insights.
期刊介绍:
The major focus of Behaviour Research and Therapy is an experimental psychopathology approach to understanding emotional and behavioral disorders and their prevention and treatment, using cognitive, behavioral, and psychophysiological (including neural) methods and models. This includes laboratory-based experimental studies with healthy, at risk and subclinical individuals that inform clinical application as well as studies with clinically severe samples. The following types of submissions are encouraged: theoretical reviews of mechanisms that contribute to psychopathology and that offer new treatment targets; tests of novel, mechanistically focused psychological interventions, especially ones that include theory-driven or experimentally-derived predictors, moderators and mediators; and innovations in dissemination and implementation of evidence-based practices into clinical practice in psychology and associated fields, especially those that target underlying mechanisms or focus on novel approaches to treatment delivery. In addition to traditional psychological disorders, the scope of the journal includes behavioural medicine (e.g., chronic pain). The journal will not consider manuscripts dealing primarily with measurement, psychometric analyses, and personality assessment.