Synthetic patient and interview transcript creator: an essential tool for LLMs in mental health.

IF 3.2 Q1 HEALTH CARE SCIENCES & SERVICES

Frontiers in digital health Pub Date : 2025-09-11 eCollection Date: 2025-01-01 DOI:10.3389/fdgth.2025.1625444

Aleyna Warner, Jeffrey LeDue, Yutong Cao, Joseph Tham, Timothy H Murphy

{"title":"Synthetic patient and interview transcript creator: an essential tool for LLMs in mental health.","authors":"Aleyna Warner, Jeffrey LeDue, Yutong Cao, Joseph Tham, Timothy H Murphy","doi":"10.3389/fdgth.2025.1625444","DOIUrl":null,"url":null,"abstract":"<p><p>Developing high-quality training data is essential for tailoring large language models (LLMs) to specialized applications like mental health. To address privacy and legal constraints associated with real patient data, we designed a synthetic patient and interview generation framework that can be tailored to regional patient demographics. This system employs two locally run instances of Llama 3.3:70B: one as the interviewer and the other as the patient. These models produce contextually rich interview transcripts, structured by a customizable question bank, with lexical diversity similar to normal human conversation. We calculate median Distinct-1 scores of 0.44 and 0.33 for the patient and interview assistant model outputs respectively compared to 0.50 ± 0.11 as the average for 10,000 episodes of a radio program dialog. Central to this approach is the patient generation process, which begins with a locally run Llama 3.3:70B model. Given the full question bank, the model generates a detailed profile template, combining predefined variables (e.g., demographic data or specific conditions) with LLM-generated content to fill in contextual details. This hybrid method ensures that each patient profile is both diverse and realistic, providing a strong foundation for generating dynamic interactions. Demographic distributions of generated patient profiles were not significantly different from real-world population data and exhibited expected variability. Additionally, for the patient profiles we assessed LLM metrics and found an average Distinct-1 score of 0.8 (max = 1) indicating diverse word usage. By integrating detailed patient generation with dynamic interviewing, the framework produces synthetic datasets that may aid the adoption and deployment of LLMs in mental health settings.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"7 ","pages":"1625444"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12460306/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdgth.2025.1625444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Developing high-quality training data is essential for tailoring large language models (LLMs) to specialized applications like mental health. To address privacy and legal constraints associated with real patient data, we designed a synthetic patient and interview generation framework that can be tailored to regional patient demographics. This system employs two locally run instances of Llama 3.3:70B: one as the interviewer and the other as the patient. These models produce contextually rich interview transcripts, structured by a customizable question bank, with lexical diversity similar to normal human conversation. We calculate median Distinct-1 scores of 0.44 and 0.33 for the patient and interview assistant model outputs respectively compared to 0.50 ± 0.11 as the average for 10,000 episodes of a radio program dialog. Central to this approach is the patient generation process, which begins with a locally run Llama 3.3:70B model. Given the full question bank, the model generates a detailed profile template, combining predefined variables (e.g., demographic data or specific conditions) with LLM-generated content to fill in contextual details. This hybrid method ensures that each patient profile is both diverse and realistic, providing a strong foundation for generating dynamic interactions. Demographic distributions of generated patient profiles were not significantly different from real-world population data and exhibited expected variability. Additionally, for the patient profiles we assessed LLM metrics and found an average Distinct-1 score of 0.8 (max = 1) indicating diverse word usage. By integrating detailed patient generation with dynamic interviewing, the framework produces synthetic datasets that may aid the adoption and deployment of LLMs in mental health settings.

Abstract Image

查看原文本刊更多论文

合成患者和访谈记录创建者：心理健康法学硕士的基本工具。

开发高质量的训练数据对于将大型语言模型（llm）定制为专门的应用程序（如心理健康）至关重要。为了解决与真实患者数据相关的隐私和法律约束，我们设计了一个合成的患者和访谈生成框架，可以根据地区患者人口统计数据进行定制。该系统使用两个本地运行的Llama 3.3:70B实例：一个作为采访者，另一个作为患者。这些模型生成上下文丰富的访谈记录，由可定制的题库构建，具有类似于正常人类对话的词汇多样性。我们计算患者和访谈助理模型输出的中位数Distinct-1得分分别为0.44和0.33，而广播节目对话的10,000集的平均值为0.50±0.11。该方法的核心是患者生成过程，该过程始于本地运行的Llama 3.3:70B模型。给定完整的题库，该模型生成详细的配置文件模板，将预定义的变量（例如，人口统计数据或特定条件）与llm生成的内容相结合，以填充上下文细节。这种混合方法确保每个患者的资料既多样又现实，为产生动态相互作用提供了坚实的基础。生成的患者资料的人口统计学分布与实际人口数据没有显著差异，并表现出预期的可变性。此外，对于患者资料，我们评估了LLM指标，发现平均Distinct-1得分为0.8（最大= 1），表明不同的单词使用。通过将详细的患者生成与动态访谈相结合，该框架产生的综合数据集可能有助于在精神卫生环境中采用和部署法学硕士。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊