微调llm对数字化患者评估的自动化治疗质量的影响。

Npj mental health research Pub Date : 2025-09-13 DOI:10.1038/s44184-025-00159-1

Stav Yosef, Moreah Zisquit, Ben Cohen, Anat Brunstein Klomek, Kfir Bar, Doron Friedman

{"title":"微调llm对数字化患者评估的自动化治疗质量的影响。","authors":"Stav Yosef, Moreah Zisquit, Ben Cohen, Anat Brunstein Klomek, Kfir Bar, Doron Friedman","doi":"10.1038/s44184-025-00159-1","DOIUrl":null,"url":null,"abstract":"The use of generative large language models (LLMs) in mental health applications is gaining traction, with some proposals even suggesting LLM-based automated therapists. In this study, we assess the impact of fine-tuning therapist LLMs to improve the quality of therapy sessions, addressing a critical question in LLM-based mental health research. Specifically, we demonstrate that fine-tuning with datasets focused on specific therapeutic techniques significantly enhances the performance of LLM therapists. To facilitate this assessment, we introduce a novel evaluation system based on digital patients, powered by LLMs, which engage in text-based therapy sessions and provide session evaluations through questionnaires designed for human patients. This method addresses the inadequacies of traditional text-similarity metrics, which are insufficient for assessing the quality of therapeutic interactions. This study centers on motivational interviewing (MI), a structured and goal-oriented therapeutic approach. However, our digital therapists and patients can be adapted to work in other forms of therapy. We believe that our digital therapists offer a standardized method for assessing automated therapists and showcasing the potential of LLMs in mental health care.","PeriodicalId":74321,"journal":{"name":"Npj mental health research","volume":"4 1","pages":"43"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12433451/pdf/","citationCount":"0","resultStr":"{\"title\":\"The impact of fine-tuning LLMs on the quality of automated therapy assessed by digital patients.\",\"authors\":\"Stav Yosef, Moreah Zisquit, Ben Cohen, Anat Brunstein Klomek, Kfir Bar, Doron Friedman\",\"doi\":\"10.1038/s44184-025-00159-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of generative large language models (LLMs) in mental health applications is gaining traction, with some proposals even suggesting LLM-based automated therapists. In this study, we assess the impact of fine-tuning therapist LLMs to improve the quality of therapy sessions, addressing a critical question in LLM-based mental health research. Specifically, we demonstrate that fine-tuning with datasets focused on specific therapeutic techniques significantly enhances the performance of LLM therapists. To facilitate this assessment, we introduce a novel evaluation system based on digital patients, powered by LLMs, which engage in text-based therapy sessions and provide session evaluations through questionnaires designed for human patients. This method addresses the inadequacies of traditional text-similarity metrics, which are insufficient for assessing the quality of therapeutic interactions. This study centers on motivational interviewing (MI), a structured and goal-oriented therapeutic approach. However, our digital therapists and patients can be adapted to work in other forms of therapy. We believe that our digital therapists offer a standardized method for assessing automated therapists and showcasing the potential of LLMs in mental health care.\",\"PeriodicalId\":74321,\"journal\":{\"name\":\"Npj mental health research\",\"volume\":\"4 1\",\"pages\":\"43\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12433451/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Npj mental health research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1038/s44184-025-00159-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Npj mental health research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s44184-025-00159-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在心理健康应用中使用生成式大语言模型（llm）正获得越来越多的关注，有些人甚至建议使用基于llm的自动治疗师。在本研究中，我们评估了微调治疗师法学硕士对提高治疗质量的影响，解决了基于法学硕士的心理健康研究中的一个关键问题。具体来说，我们证明了针对特定治疗技术的数据集的微调显著提高了LLM治疗师的表现。为了促进这种评估，我们引入了一种基于数字患者的新型评估系统，该系统由法学硕士提供支持，参与基于文本的治疗课程，并通过为人类患者设计的问卷提供课程评估。该方法解决了传统文本相似度度量的不足，不足以评估治疗相互作用的质量。动机访谈是一种结构化的目标导向治疗方法。然而，我们的数字治疗师和患者可以适应其他形式的治疗。我们相信，我们的数字治疗师提供了一种标准化的方法来评估自动化治疗师，并展示了法学硕士在精神卫生保健方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

The impact of fine-tuning LLMs on the quality of automated therapy assessed by digital patients.

查看原文本刊更多论文

The impact of fine-tuning LLMs on the quality of automated therapy assessed by digital patients.

The use of generative large language models (LLMs) in mental health applications is gaining traction, with some proposals even suggesting LLM-based automated therapists. In this study, we assess the impact of fine-tuning therapist LLMs to improve the quality of therapy sessions, addressing a critical question in LLM-based mental health research. Specifically, we demonstrate that fine-tuning with datasets focused on specific therapeutic techniques significantly enhances the performance of LLM therapists. To facilitate this assessment, we introduce a novel evaluation system based on digital patients, powered by LLMs, which engage in text-based therapy sessions and provide session evaluations through questionnaires designed for human patients. This method addresses the inadequacies of traditional text-similarity metrics, which are insufficient for assessing the quality of therapeutic interactions. This study centers on motivational interviewing (MI), a structured and goal-oriented therapeutic approach. However, our digital therapists and patients can be adapted to work in other forms of therapy. We believe that our digital therapists offer a standardized method for assessing automated therapists and showcasing the potential of LLMs in mental health care.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Npj mental health research

自引率

0.00%

发文量