Leif Simmatis, Saeid Alavi Naeini, Deniz Jafari, Michael Kai Yue Xie, Chelsea Tanchip, Niyousha Taati, Scotia McKinlay, Rupinder Sran, Justin Truong, Diego L Guarin, Babak Taati, Yana Yunusova
{"title":"基于网络摄像头的语音运动学评估的分析验证:遵循 V3 框架的数字生物标记评估。","authors":"Leif Simmatis, Saeid Alavi Naeini, Deniz Jafari, Michael Kai Yue Xie, Chelsea Tanchip, Niyousha Taati, Scotia McKinlay, Rupinder Sran, Justin Truong, Diego L Guarin, Babak Taati, Yana Yunusova","doi":"10.1159/000529685","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Kinematic analyses have recently revealed a strong potential to contribute to the assessment of neurological diseases. However, the validation of home-based kinematic assessments using consumer-grade video technology has yet to be performed. In line with best practices for digital biomarker development, we sought to validate webcam-based kinematic assessment against established, laboratory-based recording gold standards. We hypothesized that webcam-based kinematics would possess psychometric properties comparable to those obtained using the laboratory-based gold standards.</p><p><strong>Methods: </strong>We collected data from 21 healthy participants who repeated the phrase \"buy Bobby a puppy\" (BBP) at four different combinations of speaking rate and volume: Slow, Normal, Loud, and Fast. We recorded these samples twice back-to-back, simultaneously using (1) an electromagnetic articulography (\"EMA\"; NDI Wave) system, (2) a 3D camera (Intel RealSense), and (3) a 2D webcam for video recording via an in-house developed app. We focused on the extraction of kinematic features in this study, given their demonstrated value in detecting neurological impairments. We specifically extracted measures of speed/acceleration, range of motion (ROM), variability, and symmetry using the movements of the center of the lower lip during these tasks. Using these kinematic features, we derived measures of (1) agreement between recording methods, (2) test-retest reliability of each method, and (3) the validity of webcam recordings to capture expected changes in kinematics as a result of different speech conditions.</p><p><strong>Results: </strong>Kinematics measured using the webcam demonstrated good agreement with both the RealSense and EMA (ICC-A values often ≥0.70). Test-retest reliability, measured using the absolute agreement (2,1) formulation of the intraclass correlation coefficient (i.e., ICC-A), was often \"moderate\" to \"strong\" (i.e., ≥0.70) and similar between the webcam and EMA-based kinematic features. Finally, the webcam kinematics were typically as sensitive to differences in speech tasks as EMA and the 3D camera gold standards.</p><p><strong>Discussion and conclusions: </strong>Our results suggested that webcam recordings display good psychometric properties, comparable to laboratory-based gold standards. This work paves the way for a large-scale clinical validation to continue the development of these promising technologies for the assessment of neurological diseases via home-based methods.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"7-17"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10187538/pdf/","citationCount":"1","resultStr":"{\"title\":\"Analytical Validation of a Webcam-Based Assessment of Speech Kinematics: Digital Biomarker Evaluation following the V3 Framework.\",\"authors\":\"Leif Simmatis, Saeid Alavi Naeini, Deniz Jafari, Michael Kai Yue Xie, Chelsea Tanchip, Niyousha Taati, Scotia McKinlay, Rupinder Sran, Justin Truong, Diego L Guarin, Babak Taati, Yana Yunusova\",\"doi\":\"10.1159/000529685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Kinematic analyses have recently revealed a strong potential to contribute to the assessment of neurological diseases. However, the validation of home-based kinematic assessments using consumer-grade video technology has yet to be performed. In line with best practices for digital biomarker development, we sought to validate webcam-based kinematic assessment against established, laboratory-based recording gold standards. We hypothesized that webcam-based kinematics would possess psychometric properties comparable to those obtained using the laboratory-based gold standards.</p><p><strong>Methods: </strong>We collected data from 21 healthy participants who repeated the phrase \\\"buy Bobby a puppy\\\" (BBP) at four different combinations of speaking rate and volume: Slow, Normal, Loud, and Fast. We recorded these samples twice back-to-back, simultaneously using (1) an electromagnetic articulography (\\\"EMA\\\"; NDI Wave) system, (2) a 3D camera (Intel RealSense), and (3) a 2D webcam for video recording via an in-house developed app. We focused on the extraction of kinematic features in this study, given their demonstrated value in detecting neurological impairments. We specifically extracted measures of speed/acceleration, range of motion (ROM), variability, and symmetry using the movements of the center of the lower lip during these tasks. Using these kinematic features, we derived measures of (1) agreement between recording methods, (2) test-retest reliability of each method, and (3) the validity of webcam recordings to capture expected changes in kinematics as a result of different speech conditions.</p><p><strong>Results: </strong>Kinematics measured using the webcam demonstrated good agreement with both the RealSense and EMA (ICC-A values often ≥0.70). Test-retest reliability, measured using the absolute agreement (2,1) formulation of the intraclass correlation coefficient (i.e., ICC-A), was often \\\"moderate\\\" to \\\"strong\\\" (i.e., ≥0.70) and similar between the webcam and EMA-based kinematic features. Finally, the webcam kinematics were typically as sensitive to differences in speech tasks as EMA and the 3D camera gold standards.</p><p><strong>Discussion and conclusions: </strong>Our results suggested that webcam recordings display good psychometric properties, comparable to laboratory-based gold standards. This work paves the way for a large-scale clinical validation to continue the development of these promising technologies for the assessment of neurological diseases via home-based methods.</p>\",\"PeriodicalId\":11242,\"journal\":{\"name\":\"Digital Biomarkers\",\"volume\":\"7 1\",\"pages\":\"7-17\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10187538/pdf/\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Biomarkers\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1159/000529685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Biomarkers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000529685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1
摘要
简介运动学分析最近显示出其在评估神经系统疾病方面的巨大潜力。然而,使用消费级视频技术对基于家庭的运动学评估进行验证的工作尚未开展。根据数字生物标志物开发的最佳实践,我们试图根据已确立的实验室记录黄金标准来验证基于网络摄像头的运动学评估。我们假设,基于网络摄像头的运动学评估将具有与实验室黄金标准相当的心理测量特性:我们收集了 21 名健康参与者的数据,他们以四种不同的语速和音量组合重复了 "给 Bobby 买只小狗"(BBP)的短语:慢速、正常、大声和快速。我们使用(1)电磁发音成像("EMA";NDI Wave)系统、(2)3D 摄像头(英特尔 RealSense)和(3)2D 网络摄像头对这些样本进行了两次背靠背记录,并通过内部开发的应用程序进行视频记录。在本研究中,我们重点提取了运动学特征,因为这些特征在检测神经系统损伤方面具有显著价值。我们特别提取了这些任务中下唇中心运动的速度/加速度、运动范围 (ROM)、可变性和对称性。利用这些运动学特征,我们得出了以下指标:(1) 记录方法之间的一致性;(2) 每种方法的重复测试可靠性;(3) 网络摄像头记录的有效性,以捕捉不同语言条件下运动学的预期变化:结果:使用网络摄像头测量的运动学数据与 RealSense 和 EMA 的数据具有良好的一致性(ICC-A 值通常≥0.70)。使用类内相关系数的绝对一致(2,1)公式(即 ICC-A)测量的测试-再测可靠性通常为 "中等 "到 "较强"(即≥0.70),网络摄像头和 EMA 运动特征之间的可靠性相似。最后,网络摄像头运动学对语音任务差异的敏感度通常与 EMA 和 3D 摄像头黄金标准相当:我们的研究结果表明,网络摄像头录音具有良好的心理测量特性,可与基于实验室的黄金标准相媲美。这项工作为大规模临床验证铺平了道路,以便继续开发这些前景广阔的技术,通过基于家庭的方法评估神经系统疾病。
Analytical Validation of a Webcam-Based Assessment of Speech Kinematics: Digital Biomarker Evaluation following the V3 Framework.
Introduction: Kinematic analyses have recently revealed a strong potential to contribute to the assessment of neurological diseases. However, the validation of home-based kinematic assessments using consumer-grade video technology has yet to be performed. In line with best practices for digital biomarker development, we sought to validate webcam-based kinematic assessment against established, laboratory-based recording gold standards. We hypothesized that webcam-based kinematics would possess psychometric properties comparable to those obtained using the laboratory-based gold standards.
Methods: We collected data from 21 healthy participants who repeated the phrase "buy Bobby a puppy" (BBP) at four different combinations of speaking rate and volume: Slow, Normal, Loud, and Fast. We recorded these samples twice back-to-back, simultaneously using (1) an electromagnetic articulography ("EMA"; NDI Wave) system, (2) a 3D camera (Intel RealSense), and (3) a 2D webcam for video recording via an in-house developed app. We focused on the extraction of kinematic features in this study, given their demonstrated value in detecting neurological impairments. We specifically extracted measures of speed/acceleration, range of motion (ROM), variability, and symmetry using the movements of the center of the lower lip during these tasks. Using these kinematic features, we derived measures of (1) agreement between recording methods, (2) test-retest reliability of each method, and (3) the validity of webcam recordings to capture expected changes in kinematics as a result of different speech conditions.
Results: Kinematics measured using the webcam demonstrated good agreement with both the RealSense and EMA (ICC-A values often ≥0.70). Test-retest reliability, measured using the absolute agreement (2,1) formulation of the intraclass correlation coefficient (i.e., ICC-A), was often "moderate" to "strong" (i.e., ≥0.70) and similar between the webcam and EMA-based kinematic features. Finally, the webcam kinematics were typically as sensitive to differences in speech tasks as EMA and the 3D camera gold standards.
Discussion and conclusions: Our results suggested that webcam recordings display good psychometric properties, comparable to laboratory-based gold standards. This work paves the way for a large-scale clinical validation to continue the development of these promising technologies for the assessment of neurological diseases via home-based methods.