使用生态录音的数字嗓音生物标记吸烟状况:Colive Voice 研究的结果

Q1 Computer Science
Digital Biomarkers Pub Date : 2024-08-28 eCollection Date: 2024-01-01 DOI:10.1159/000540327
Hanin Ayadi, Abir Elbéji, Vladimir Despotovic, Guy Fagherazzi
{"title":"使用生态录音的数字嗓音生物标记吸烟状况:Colive Voice 研究的结果","authors":"Hanin Ayadi, Abir Elbéji, Vladimir Despotovic, Guy Fagherazzi","doi":"10.1159/000540327","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The complex health, social, and economic consequences of tobacco smoking underscore the importance of incorporating reliable and scalable data collection on smoking status and habits into research across various disciplines. Given that smoking impacts voice production, we aimed to develop a gender and language-specific vocal biomarker of smoking status.</p><p><strong>Methods: </strong>Leveraging data from the Colive Voice study, we used statistical analysis methods to quantify the effects of smoking on voice characteristics. Various voice feature extraction methods combined with machine learning algorithms were then used to produce a gender and language-specific (English and French) digital vocal biomarker to differentiate smokers from never-smokers.</p><p><strong>Results: </strong>A total of 1,332‬ participants were included after propensity score matching (mean age = 43.6 [13.65], 64.41% are female, 56.68% are English speakers, 50% are smokers and 50% are never-smokers). We observed differences in voice features distribution: for women, the fundamental frequency F0, the formants F1, F2, and F3 frequencies and the harmonics-to-noise ratio were lower in smokers compared to never-smokers (<i>p</i> < 0.05) while for men no significant disparities were noted between the two groups. The accuracy and AUC of smoking status prediction reached 0.71 and 0.76, respectively, for the female participants, and 0.65 and 0.68, respectively, for the male participants.</p><p><strong>Conclusion: </strong>We have shown that voice features are impacted by smoking. We have developed a novel digital vocal biomarker that can be used in clinical and epidemiological research to assess smoking status in a rapid, scalable, and accurate manner using ecological audio recordings.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"8 1","pages":"159-170"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11521430/pdf/","citationCount":"0","resultStr":"{\"title\":\"Digital Vocal Biomarker of Smoking Status Using Ecological Audio Recordings: Results from the Colive Voice Study.\",\"authors\":\"Hanin Ayadi, Abir Elbéji, Vladimir Despotovic, Guy Fagherazzi\",\"doi\":\"10.1159/000540327\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>The complex health, social, and economic consequences of tobacco smoking underscore the importance of incorporating reliable and scalable data collection on smoking status and habits into research across various disciplines. Given that smoking impacts voice production, we aimed to develop a gender and language-specific vocal biomarker of smoking status.</p><p><strong>Methods: </strong>Leveraging data from the Colive Voice study, we used statistical analysis methods to quantify the effects of smoking on voice characteristics. Various voice feature extraction methods combined with machine learning algorithms were then used to produce a gender and language-specific (English and French) digital vocal biomarker to differentiate smokers from never-smokers.</p><p><strong>Results: </strong>A total of 1,332‬ participants were included after propensity score matching (mean age = 43.6 [13.65], 64.41% are female, 56.68% are English speakers, 50% are smokers and 50% are never-smokers). We observed differences in voice features distribution: for women, the fundamental frequency F0, the formants F1, F2, and F3 frequencies and the harmonics-to-noise ratio were lower in smokers compared to never-smokers (<i>p</i> < 0.05) while for men no significant disparities were noted between the two groups. The accuracy and AUC of smoking status prediction reached 0.71 and 0.76, respectively, for the female participants, and 0.65 and 0.68, respectively, for the male participants.</p><p><strong>Conclusion: </strong>We have shown that voice features are impacted by smoking. We have developed a novel digital vocal biomarker that can be used in clinical and epidemiological research to assess smoking status in a rapid, scalable, and accurate manner using ecological audio recordings.</p>\",\"PeriodicalId\":11242,\"journal\":{\"name\":\"Digital Biomarkers\",\"volume\":\"8 1\",\"pages\":\"159-170\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11521430/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Biomarkers\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1159/000540327\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Biomarkers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000540327","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

导言:吸烟对健康、社会和经济造成的复杂后果凸显了将可靠、可扩展的吸烟状况和习惯数据收集纳入各学科研究的重要性。鉴于吸烟会影响嗓音的产生,我们旨在开发一种针对不同性别和语言的吸烟状况嗓音生物标志物:我们利用 Colive Voice 研究的数据,采用统计分析方法量化吸烟对嗓音特征的影响。然后利用各种语音特征提取方法与机器学习算法相结合,生成了一种针对不同性别和语言(英语和法语)的数字语音生物标记,用于区分吸烟者和从不吸烟者:经过倾向得分匹配后,共纳入了 1332 名参与者(平均年龄 = 43.6 [13.65],64.41% 为女性,56.68% 为英语使用者,50% 为吸烟者,50% 为从不吸烟者)。我们观察到语音特征分布的差异:对于女性而言,吸烟者的基频 F0、声母 F1、F2 和 F3 频率以及谐波噪声比均低于从不吸烟者(P < 0.05),而对于男性而言,两组之间没有明显差异。女性参与者的吸烟状态预测准确率和 AUC 分别达到 0.71 和 0.76,男性参与者的准确率和 AUC 分别达到 0.65 和 0.68:结论:我们的研究表明,嗓音特征会受到吸烟的影响。我们开发了一种新型数字声音生物标记,可用于临床和流行病学研究,利用生态录音以快速、可扩展和准确的方式评估吸烟状况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Digital Vocal Biomarker of Smoking Status Using Ecological Audio Recordings: Results from the Colive Voice Study.

Introduction: The complex health, social, and economic consequences of tobacco smoking underscore the importance of incorporating reliable and scalable data collection on smoking status and habits into research across various disciplines. Given that smoking impacts voice production, we aimed to develop a gender and language-specific vocal biomarker of smoking status.

Methods: Leveraging data from the Colive Voice study, we used statistical analysis methods to quantify the effects of smoking on voice characteristics. Various voice feature extraction methods combined with machine learning algorithms were then used to produce a gender and language-specific (English and French) digital vocal biomarker to differentiate smokers from never-smokers.

Results: A total of 1,332‬ participants were included after propensity score matching (mean age = 43.6 [13.65], 64.41% are female, 56.68% are English speakers, 50% are smokers and 50% are never-smokers). We observed differences in voice features distribution: for women, the fundamental frequency F0, the formants F1, F2, and F3 frequencies and the harmonics-to-noise ratio were lower in smokers compared to never-smokers (p < 0.05) while for men no significant disparities were noted between the two groups. The accuracy and AUC of smoking status prediction reached 0.71 and 0.76, respectively, for the female participants, and 0.65 and 0.68, respectively, for the male participants.

Conclusion: We have shown that voice features are impacted by smoking. We have developed a novel digital vocal biomarker that can be used in clinical and epidemiological research to assess smoking status in a rapid, scalable, and accurate manner using ecological audio recordings.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Digital Biomarkers
Digital Biomarkers Medicine-Medicine (miscellaneous)
CiteScore
10.60
自引率
0.00%
发文量
12
审稿时长
23 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信