Cross-Validation of the Acoustic Roughness Index in German.

IF 2.4 4区医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY

Journal of Voice Pub Date : 2025-10-06 DOI:10.1016/j.jvoice.2025.09.030

Itsuki Kitayama, Kiyohito Hosokawa, Bernhard Lehnert, Kenji Aruga, Hidenori Inohara, Ben Barsties V Latoszek

{"title":"Cross-Validation of the Acoustic Roughness Index in German.","authors":"Itsuki Kitayama, Kiyohito Hosokawa, Bernhard Lehnert, Kenji Aruga, Hidenori Inohara, Ben Barsties V Latoszek","doi":"10.1016/j.jvoice.2025.09.030","DOIUrl":null,"url":null,"abstract":"Objective: The aim of this study was to validate the Acoustic Roughness Index (ARI) for German-speaking participants by examining its correlation with perceived vocal roughness and its diagnostic accuracy in distinguishing rough from non-rough voices.Methods: Voice samples from 218 adult speakers (175 with dysphonia and 43 vocally healthy controls) were recorded using a sustained vowel /a:/ and a standardized 27-syllable passage of continuous speech (approximately 3 seconds) concatenated into a single sample per participant. Three experienced raters judged the roughness severity of each sample using the R-parameter from the Grade, Roughness, Breathiness, Asthenia, Strain scale (ranging from normal to severe). Intra- and inter-rater reliability were assessed with Cohen's kappa and Fleiss' kappa, respectively. Acoustic analysis was performed using the ARI algorithm implemented in the software Praat. Concurrent validity was evaluated by Spearman rank correlation (rs) between ARI scores and perceptual roughness. Diagnostic validity was assessed via receiver operating characteristic (ROC) analysis determining the optimal ARI threshold for identifying rough voices.Results: Intra-rater reliability for roughness was moderate (mean Cohen's κ = 0.45) and inter-rater agreement was fair (Fleiss' κ = 0.35) indicating the inherent variability of perceptual roughness judgments. ARI scores demonstrated a sufficiently high correlation with perceived roughness (rs = 0.726, P < 0.001, 95% confidence interval [CI] = 0.654-0.785). The area under the ROC curve was 0.824 reflecting good diagnostic accuracy. The (Youden-) optimal ARI threshold was 2.00 yielding 71.8% sensitivity and 79.3% specificity.Conclusion: ARI appears to offer a potentially useful acoustic measure for assessing vocal roughness, though its robustness may be limited. Further research is necessary to improve the accuracy and reliability of the voice quality evaluation of roughness.","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2025.09.030","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: The aim of this study was to validate the Acoustic Roughness Index (ARI) for German-speaking participants by examining its correlation with perceived vocal roughness and its diagnostic accuracy in distinguishing rough from non-rough voices.

Methods: Voice samples from 218 adult speakers (175 with dysphonia and 43 vocally healthy controls) were recorded using a sustained vowel /a:/ and a standardized 27-syllable passage of continuous speech (approximately 3 seconds) concatenated into a single sample per participant. Three experienced raters judged the roughness severity of each sample using the R-parameter from the Grade, Roughness, Breathiness, Asthenia, Strain scale (ranging from normal to severe). Intra- and inter-rater reliability were assessed with Cohen's kappa and Fleiss' kappa, respectively. Acoustic analysis was performed using the ARI algorithm implemented in the software Praat. Concurrent validity was evaluated by Spearman rank correlation (r_s) between ARI scores and perceptual roughness. Diagnostic validity was assessed via receiver operating characteristic (ROC) analysis determining the optimal ARI threshold for identifying rough voices.

Results: Intra-rater reliability for roughness was moderate (mean Cohen's κ = 0.45) and inter-rater agreement was fair (Fleiss' κ = 0.35) indicating the inherent variability of perceptual roughness judgments. ARI scores demonstrated a sufficiently high correlation with perceived roughness (r_s = 0.726, P < 0.001, 95% confidence interval [CI] = 0.654-0.785). The area under the ROC curve was 0.824 reflecting good diagnostic accuracy. The (Youden-) optimal ARI threshold was 2.00 yielding 71.8% sensitivity and 79.3% specificity.

Conclusion: ARI appears to offer a potentially useful acoustic measure for assessing vocal roughness, though its robustness may be limited. Further research is necessary to improve the accuracy and reliability of the voice quality evaluation of roughness.

查看原文本刊更多论文

目的：本研究的目的是通过考察声粗糙度指数（ARI）与感知声音粗糙度的相关性及其在区分粗糙和非粗糙声音方面的诊断准确性来验证德语参与者的声粗糙度指数（ARI）。方法：采用连续元音/a:/和标准化的27个音节的连续语音片段（约3秒）将218名成年说话者的语音样本（175名有发音障碍的人，43名声带健康的人）拼接成一个样本进行录音。三名经验丰富的评分员使用等级、粗糙度、呼吸、虚弱、应变量表（从正常到严重）中的r参数来判断每个样本的粗糙度严重程度。量表内信度和量表间信度分别采用Cohen's kappa和Fleiss' s kappa。声学分析采用Praat软件中实现的ARI算法进行。并发效度采用ARI评分与感知粗糙度的Spearman秩相关（rs）评价。通过受试者工作特征（ROC）分析评估诊断有效性，确定识别粗糙声音的最佳ARI阈值。结果：粗糙度的评分者内部信度为中等（平均Cohen’s κ = 0.45），评分者之间的一致性为一般（Fleiss’s κ = 0.35），表明感知粗糙度判断具有内在的可变性。ARI评分显示与感知粗糙度有足够高的相关性（rs = 0.726, P < 0.001, 95%可信区间[CI] = 0.654-0.785）。ROC曲线下面积为0.824，诊断准确率较高。（Youden-）最佳ARI阈值为2.00，敏感性为71.8%，特异性为79.3%。结论：ARI似乎提供了一种潜在有用的声学测量来评估声音粗糙度，尽管其稳健性可能有限。为了提高粗糙度语音质量评价的准确性和可靠性，还需要进一步的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Voice 医学-耳鼻喉科学

CiteScore

4.00

自引率

13.60%

发文量

395

审稿时长

59 days

期刊介绍： The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.