Optimizing Voice Sample Quantity and Recording Settings for the Prediction of Type 2 Diabetes Mellitus: Retrospective Study.

JMIR biomedical engineering Pub Date : 2025-06-26 DOI:10.2196/64357

Atousa Assadi, Jessica Oreskovic, Jaycee Kaufman, Yan Fossat

{"title":"Optimizing Voice Sample Quantity and Recording Settings for the Prediction of Type 2 Diabetes Mellitus: Retrospective Study.","authors":"Atousa Assadi, Jessica Oreskovic, Jaycee Kaufman, Yan Fossat","doi":"10.2196/64357","DOIUrl":null,"url":null,"abstract":"Background: The use of acoustic biomarkers derived from speech signals is a promising non-invasive technique for diagnosing type 2 diabetes mellitus (T2DM). Despite its potential, there remains a critical gap in knowledge regarding the optimal number of voice recordings and recording schedule necessary to achieve effective diagnostic accuracy.Objective: This study aimed to determine the optimal number of voice samples and the ideal recording schedule (frequency and timing), required to maintain the T2DM diagnostic efficacy while reducing patient burden.Methods: We analyzed voice recordings from 78 adults (22 women), including 39 individuals diagnosed with T2DM. Participants had a mean (SD) age of 45.26 (10.63) years and mean (SD) BMI of 28.07 (4.59) kg/m². In total, 5035 voice recordings were collected, with a mean (SD) of 4.91 (1.45) recordings per day; higher adherence was observed among women (5.13 [1.38] vs 4.82 [1.46] in men). We evaluated the diagnostic accuracy of a previously developed voice-based model under different recording conditions. Segmented linear regression analysis was used to assess model accuracy across varying numbers of voice recordings, and the Kendall tau correlation was used to measure the relationship between recording settings and accuracy. A significance threshold of P<.05 was applied.Results: Our results showed that including up to 6 voice recordings notably improved the model accuracy for T2DM compared to using only one recording, with accuracy increasing from 59.61 to 65.02 for men and from 65.55 to 69.43 for women. Additionally, the day on which voice recordings were collected did not significantly affect model accuracy (P>.05). However, adhering to recording within a single day demonstrated higher accuracy, with accuracy of 73.95% for women and 85.48% for men when all recordings were from the first and second days.Conclusions: This study underscores the optimal voice recording settings to reduce patient burden while maintaining diagnostic efficacy.","PeriodicalId":87288,"journal":{"name":"JMIR biomedical engineering","volume":"10 ","pages":"e64357"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12226960/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR biomedical engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/64357","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The use of acoustic biomarkers derived from speech signals is a promising non-invasive technique for diagnosing type 2 diabetes mellitus (T2DM). Despite its potential, there remains a critical gap in knowledge regarding the optimal number of voice recordings and recording schedule necessary to achieve effective diagnostic accuracy.

Objective: This study aimed to determine the optimal number of voice samples and the ideal recording schedule (frequency and timing), required to maintain the T2DM diagnostic efficacy while reducing patient burden.

Methods: We analyzed voice recordings from 78 adults (22 women), including 39 individuals diagnosed with T2DM. Participants had a mean (SD) age of 45.26 (10.63) years and mean (SD) BMI of 28.07 (4.59) kg/m². In total, 5035 voice recordings were collected, with a mean (SD) of 4.91 (1.45) recordings per day; higher adherence was observed among women (5.13 [1.38] vs 4.82 [1.46] in men). We evaluated the diagnostic accuracy of a previously developed voice-based model under different recording conditions. Segmented linear regression analysis was used to assess model accuracy across varying numbers of voice recordings, and the Kendall tau correlation was used to measure the relationship between recording settings and accuracy. A significance threshold of P<.05 was applied.

Results: Our results showed that including up to 6 voice recordings notably improved the model accuracy for T2DM compared to using only one recording, with accuracy increasing from 59.61 to 65.02 for men and from 65.55 to 69.43 for women. Additionally, the day on which voice recordings were collected did not significantly affect model accuracy (P>.05). However, adhering to recording within a single day demonstrated higher accuracy, with accuracy of 73.95% for women and 85.48% for men when all recordings were from the first and second days.

Conclusions: This study underscores the optimal voice recording settings to reduce patient burden while maintaining diagnostic efficacy.

查看原文本刊更多论文

优化语音样本量和录音设置预测2型糖尿病：回顾性研究。

背景：使用来自语音信号的声学生物标志物是诊断2型糖尿病（T2DM）的一种很有前途的非侵入性技术。尽管有潜力，但在实现有效诊断准确性所需的最佳录音数量和录音时间表方面，知识仍然存在重大差距。目的：本研究旨在确定维持T2DM诊断疗效同时减轻患者负担所需的最佳语音样本数量和理想录音时间表（频率和时间）。方法：我们分析了78名成年人（22名女性）的录音，其中包括39名诊断为T2DM的人。参与者的平均（SD）年龄为45.26（10.63）岁，平均（SD） BMI为28.07 (4.59)kg/m²。共收集到5035份录音，平均（SD）为4.91（1.45）份/天；女性患者的依从性更高（5.13 [1.38]vs 4.82[1.46]）。我们评估了先前开发的基于语音的模型在不同录音条件下的诊断准确性。使用分段线性回归分析来评估不同数量录音的模型准确性，并使用肯德尔tau相关来衡量录音设置与准确性之间的关系。结果的显著性阈值：我们的结果表明，与只使用一个录音相比，包含多达6个录音的T2DM模型的准确性显着提高，男性的准确性从59.61提高到65.02，女性的准确性从65.55提高到69.43。此外，收集录音的日期对模型的准确性没有显著影响（P < 0.05）。然而，坚持在一天内记录的准确率更高，当所有记录都在第一天和第二天时，女性的准确率为73.95%，男性的准确率为85.48%。结论：本研究强调了最佳的录音设置，以减轻患者负担，同时保持诊断疗效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JMIR biomedical engineering

自引率

0.00%

发文量

审稿时长

20 weeks