New Doc on the Block: Scoping Review of AI Systems Delivering Motivational Interviewing for Health Behavior Change.

IF 6 2区医学 Q1 HEALTH CARE SCIENCES & SERVICES

Journal of Medical Internet Research Pub Date : 2025-09-16 DOI:10.2196/78417

Zev Karve, Jacob Calpey, Christopher Machado, Michelle Knecht, Maria Carmenza Mejia

{"title":"New Doc on the Block: Scoping Review of AI Systems Delivering Motivational Interviewing for Health Behavior Change.","authors":"Zev Karve, Jacob Calpey, Christopher Machado, Michelle Knecht, Maria Carmenza Mejia","doi":"10.2196/78417","DOIUrl":null,"url":null,"abstract":"Background: Artificial intelligence (AI) is increasingly used in digital health, particularly through large language models (LLMs), to support patient engagement and behavior change. One novel application is the delivery of motivational interviewing (MI), an evidence-based, patient-centered counseling technique designed to enhance motivation and resolve ambivalence around health behaviors. AI tools, including chatbots, mobile apps, and web-based agents, are being developed to simulate MI techniques at scale. While these innovations are promising, important questions remain about how faithfully AI systems can replicate MI principles or achieve meaningful behavioral impact.Objective: This scoping review aimed to summarize existing empirical studies evaluating AI-driven systems that apply MI techniques to support health behavior change. Specifically, we examined the feasibility of these systems; their fidelity to MI principles; and their reported behavioral, psychological, or engagement outcomes.Methods: We systematically searched PubMed, Embase, Scopus, Web of Science, and Cochrane Library for empirical studies published between January 1, 2018, and February 25, 2025. Eligible studies involved AI-driven systems using natural language generation, understanding, or computational logic to deliver MI techniques to users targeting a specific health behavior. We excluded studies using AI solely for training clinicians in MI. Three independent reviewers screened and extracted data on study design, AI modality and type, MI components, health behavior focus, MI fidelity assessment, and outcome domains.Results: Of the 1001 records identified, 15 (1.5%) met the inclusion criteria. Of these 15 studies, 6 (40%) were exploratory feasibility or pilot studies, and 3 (20%) were randomized controlled trials. AI modalities included rule-based chatbots (9/15, 60%), LLM-based systems (4/15, 27%), and virtual or mobile agents (2/15, 13%). Targeted behaviors included smoking cessation (6/15, 40%), substance use (3/15, 20%), COVID-19 vaccine hesitancy, type 2 diabetes self-management, stress, mental health service use, and opioid use during pregnancy. Of the 15 studies, 13 (87%) reported positive findings on feasibility or user acceptability, while 6 (40%) assessed MI fidelity using expert review or structured coding, with moderate to high alignment reported. Several studies found that users perceived the AI systems as judgment free, supportive, and easier to engage with than human counselors, particularly in stigmatized contexts. However, limitations in empathy, safety transparency, and emotional nuance were commonly noted. Only 3 (20%) of the 15 studies reported substantially significant behavioral changes.Conclusions: AI systems delivering MI show promise for enhancing patient engagement and scaling behavior change interventions. Early evidence supports their usability and partial fidelity to MI principles, especially in sensitive domains. However, most systems remain in early development, and few have been rigorously tested. Future research should prioritize randomized evaluations; standardized fidelity measures; and safeguards for LLM safety, empathy, and accuracy in health-related dialogue.Trial registration: OSF Registries 10.17605/OSF.IO/G9N7E; https://osf.io/g9n7e.","PeriodicalId":16337,"journal":{"name":"Journal of Medical Internet Research","volume":"27 ","pages":"e78417"},"PeriodicalIF":6.0000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12485255/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Internet Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/78417","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Artificial intelligence (AI) is increasingly used in digital health, particularly through large language models (LLMs), to support patient engagement and behavior change. One novel application is the delivery of motivational interviewing (MI), an evidence-based, patient-centered counseling technique designed to enhance motivation and resolve ambivalence around health behaviors. AI tools, including chatbots, mobile apps, and web-based agents, are being developed to simulate MI techniques at scale. While these innovations are promising, important questions remain about how faithfully AI systems can replicate MI principles or achieve meaningful behavioral impact.

Objective: This scoping review aimed to summarize existing empirical studies evaluating AI-driven systems that apply MI techniques to support health behavior change. Specifically, we examined the feasibility of these systems; their fidelity to MI principles; and their reported behavioral, psychological, or engagement outcomes.

Methods: We systematically searched PubMed, Embase, Scopus, Web of Science, and Cochrane Library for empirical studies published between January 1, 2018, and February 25, 2025. Eligible studies involved AI-driven systems using natural language generation, understanding, or computational logic to deliver MI techniques to users targeting a specific health behavior. We excluded studies using AI solely for training clinicians in MI. Three independent reviewers screened and extracted data on study design, AI modality and type, MI components, health behavior focus, MI fidelity assessment, and outcome domains.

Results: Of the 1001 records identified, 15 (1.5%) met the inclusion criteria. Of these 15 studies, 6 (40%) were exploratory feasibility or pilot studies, and 3 (20%) were randomized controlled trials. AI modalities included rule-based chatbots (9/15, 60%), LLM-based systems (4/15, 27%), and virtual or mobile agents (2/15, 13%). Targeted behaviors included smoking cessation (6/15, 40%), substance use (3/15, 20%), COVID-19 vaccine hesitancy, type 2 diabetes self-management, stress, mental health service use, and opioid use during pregnancy. Of the 15 studies, 13 (87%) reported positive findings on feasibility or user acceptability, while 6 (40%) assessed MI fidelity using expert review or structured coding, with moderate to high alignment reported. Several studies found that users perceived the AI systems as judgment free, supportive, and easier to engage with than human counselors, particularly in stigmatized contexts. However, limitations in empathy, safety transparency, and emotional nuance were commonly noted. Only 3 (20%) of the 15 studies reported substantially significant behavioral changes.

Conclusions: AI systems delivering MI show promise for enhancing patient engagement and scaling behavior change interventions. Early evidence supports their usability and partial fidelity to MI principles, especially in sensitive domains. However, most systems remain in early development, and few have been rigorously tested. Future research should prioritize randomized evaluations; standardized fidelity measures; and safeguards for LLM safety, empathy, and accuracy in health-related dialogue.

Trial registration: OSF Registries 10.17605/OSF.IO/G9N7E; https://osf.io/g9n7e.

查看原文本刊更多论文

新文件：为健康行为改变提供动机性访谈的人工智能系统的范围审查。

背景：人工智能（AI）越来越多地用于数字健康，特别是通过大型语言模型（llm）来支持患者参与和行为改变。一个新颖的应用是动机性访谈（MI）的传递，这是一种基于证据的、以患者为中心的咨询技术，旨在增强动机和解决围绕健康行为的矛盾心理。包括聊天机器人、移动应用程序和基于网络的代理在内的人工智能工具正在开发中，以大规模模拟人工智能技术。虽然这些创新很有希望，但重要的问题仍然存在，即人工智能系统如何忠实地复制人工智能原则或实现有意义的行为影响。目的：本综述旨在总结现有的实证研究，评估应用MI技术支持健康行为改变的人工智能驱动系统。具体来说，我们考察了这些系统的可行性；他们对MI原则的忠诚；以及他们报告的行为、心理或参与结果。方法：系统检索PubMed、Embase、Scopus、Web of Science和Cochrane Library，检索2018年1月1日至2025年2月25日期间发表的实证研究。符合条件的研究涉及使用自然语言生成、理解或计算逻辑的人工智能驱动系统，向针对特定健康行为的用户提供人工智能技术。我们排除了仅使用人工智能培训临床医生心肌梗死的研究。三位独立的审评者筛选并提取了研究设计、人工智能模式和类型、心肌梗死成分、健康行为焦点、心肌梗死保真度评估和结果域的数据。结果：1001条记录中，15条（1.5%）符合纳入标准。在这15项研究中，6项（40%）为探索性可行性研究或试点研究，3项（20%）为随机对照试验。人工智能模式包括基于规则的聊天机器人（9/ 15,60 %），基于法学硕士的系统（4/ 15,27 %）和虚拟或移动代理（2/ 15,13 %）。目标行为包括戒烟（6/ 15,40 %）、药物使用（3/ 15,20 %）、COVID-19疫苗犹豫、2型糖尿病自我管理、压力、心理卫生服务使用和怀孕期间阿片类药物使用。在15项研究中，13项（87%）报告了可行性或用户可接受性方面的积极发现，而6项（40%）使用专家评审或结构化编码评估了MI保真度，报告了中度至高度的一致性。几项研究发现，用户认为人工智能系统没有判断力，支持，而且比人类咨询师更容易接触，特别是在污名化的情况下。然而，人们普遍注意到移情、安全透明度和情感细微差别方面的局限性。15项研究中只有3项（20%）报告了显著的行为改变。结论：提供MI的人工智能系统有望提高患者参与度和扩大行为改变干预措施。早期的证据支持它们的可用性和部分忠于人工智能原则，特别是在敏感领域。然而，大多数系统仍处于早期开发阶段，很少经过严格的测试。未来的研究应优先考虑随机评价；标准化保真度测量；以及在与健康相关的对话中保障法学硕士的安全性、同理心和准确性。试验注册：OSF registres10.17605 /OSF. io /G9N7E；https://osf.io/g9n7e。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Internet Research 医学-卫生保健

CiteScore

14.40

自引率

5.40%

发文量

654

审稿时长

1 months

期刊介绍： The Journal of Medical Internet Research (JMIR) is a highly respected publication in the field of health informatics and health services. With a founding date in 1999, JMIR has been a pioneer in the field for over two decades. As a leader in the industry, the journal focuses on digital health, data science, health informatics, and emerging technologies for health, medicine, and biomedical research. It is recognized as a top publication in these disciplines, ranking in the first quartile (Q1) by Impact Factor. Notably, JMIR holds the prestigious position of being ranked #1 on Google Scholar within the "Medical Informatics" discipline.