Evaluation of an AI-Based Voice Biomarker Tool to Detect Signals Consistent With Moderate to Severe Depression.

IF 4.4 2区医学 Q1 MEDICINE, GENERAL & INTERNAL

Annals of Family Medicine Pub Date : 2025-01-27 DOI:10.1370/afm.240091

Alexa Mazur, Harrison Costantino, Prentice Tom, Michael P Wilson, Ronald G Thompson

{"title":"Evaluation of an AI-Based Voice Biomarker Tool to Detect Signals Consistent With Moderate to Severe Depression.","authors":"Alexa Mazur, Harrison Costantino, Prentice Tom, Michael P Wilson, Ronald G Thompson","doi":"10.1370/afm.240091","DOIUrl":null,"url":null,"abstract":"Purpose: Mental health screening is recommended by the US Preventive Services Task Force for all patients in areas where treatment options are available. Still, it is estimated that only 4% of primary care patients are screened for depression. The goal of this study was to evaluate the efficacy of machine learning technology (Kintsugi Voice, v1, Kintsugi Mindful Wellness, Inc) to detect and analyze voice biomarkers consistent with moderate to severe depression, potentially allowing for greater compliance with this critical primary care public health need.Methods: We performed a cross-sectional study from February 1, 2021 to July 31, 2022 to examine ≥25 seconds of free-form speech content from English-speaking samples captured from 14,898 unique adults in the United States and Canada. Participants were recruited via social media, provided informed consent, and their voice biomarker results were compared with a self-reported Patient Health Questionnaire-9 (PHQ-9) at a cut-off score of 10 (moderate to severe depression).Results: From as few as 25 seconds of free-form speech, machine learning technology was able to detect vocal characteristics consistent with an increased PHQ-9 ≥10, with a sensitivity of 71.3 (95% CI, 69.0-73.5) and a specificity of 73.5 (95% CI, 71.5-75.5).Conclusions: Machine learning has potential utility in helping clinicians screen patients for moderate to severe depression. Further research is needed to measure the effectiveness of machine learning vocal detection and analysis technology in clinical deployment.","PeriodicalId":50973,"journal":{"name":"Annals of Family Medicine","volume":" ","pages":"60-65"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11772039/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Family Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1370/afm.240091","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Mental health screening is recommended by the US Preventive Services Task Force for all patients in areas where treatment options are available. Still, it is estimated that only 4% of primary care patients are screened for depression. The goal of this study was to evaluate the efficacy of machine learning technology (Kintsugi Voice, v1, Kintsugi Mindful Wellness, Inc) to detect and analyze voice biomarkers consistent with moderate to severe depression, potentially allowing for greater compliance with this critical primary care public health need.

Methods: We performed a cross-sectional study from February 1, 2021 to July 31, 2022 to examine ≥25 seconds of free-form speech content from English-speaking samples captured from 14,898 unique adults in the United States and Canada. Participants were recruited via social media, provided informed consent, and their voice biomarker results were compared with a self-reported Patient Health Questionnaire-9 (PHQ-9) at a cut-off score of 10 (moderate to severe depression).

Results: From as few as 25 seconds of free-form speech, machine learning technology was able to detect vocal characteristics consistent with an increased PHQ-9 ≥10, with a sensitivity of 71.3 (95% CI, 69.0-73.5) and a specificity of 73.5 (95% CI, 71.5-75.5).

Conclusions: Machine learning has potential utility in helping clinicians screen patients for moderate to severe depression. Further research is needed to measure the effectiveness of machine learning vocal detection and analysis technology in clinical deployment.

查看原文本刊更多论文

评估一种基于人工智能的语音生物标记工具，以检测与中度至重度抑郁症一致的信号。

目的：美国预防服务工作组建议在有治疗方案的地区对所有患者进行心理健康筛查。尽管如此，据估计，只有4%的初级保健患者接受了抑郁症筛查。本研究的目的是评估机器学习技术（Kintsugi Voice, v1, Kintsugi Mindful Wellness, Inc）在检测和分析与中度至重度抑郁症一致的语音生物标志物方面的功效，从而有可能更好地满足这一关键的初级保健公共卫生需求。方法：我们从2021年2月1日至2022年7月31日进行了一项横断面研究，检查了从美国和加拿大的14,898名成年人中捕获的英语样本中≥25秒的自由形式言论内容。参与者通过社交媒体招募，提供知情同意，他们的语音生物标志物结果与自我报告的患者健康问卷-9 （PHQ-9）进行比较，分值为10分（中度至重度抑郁症）。结果：从25秒的自由语音中，机器学习技术能够检测到与PHQ-9≥10增加一致的声音特征，灵敏度为71.3 (95% CI, 69.0-73.5)，特异性为73.5 （95% CI, 71.5-75.5）。结论：机器学习在帮助临床医生筛选中度至重度抑郁症患者方面具有潜在的效用。机器学习语音检测分析技术在临床应用中的有效性有待进一步研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annals of Family Medicine 医学-医学：内科

CiteScore

3.70

自引率

4.50%

发文量

142

审稿时长

6-12 weeks

期刊介绍： The Annals of Family Medicine is a peer-reviewed research journal to meet the needs of scientists, practitioners, policymakers, and the patients and communities they serve.