使用自动语音分析预测口腔癌或口咽癌患者的语言障碍

IF 2.1 3区 医学 Q2 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Mathieu Balaguer, Julien Pinquier, Jérôme Farinas, Virginie Woisard
{"title":"使用自动语音分析预测口腔癌或口咽癌患者的语言障碍","authors":"Mathieu Balaguer,&nbsp;Julien Pinquier,&nbsp;Jérôme Farinas,&nbsp;Virginie Woisard","doi":"10.1111/1460-6984.70103","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Perceptual evaluation of speech disorders produces scores that poorly predict the consequences of speech impairment on the communication abilities of patients treated for oral/oropharyngeal cancer. This may be mitigated by automatic speech analysis.</p>\n </section>\n \n <section>\n \n <h3> Aim</h3>\n \n <p>To measure communication and speech impairment using automatic analyses of spontaneous speech and self-administered questionnaires in patients treated for oral cavity or oropharyngeal cancer.</p>\n </section>\n \n <section>\n \n <h3> Methods and Procedures</h3>\n \n <p>The spontaneous speech of 25 patients was recorded during a semistructured interview. Various acoustic and automatic tools were applied to the speech signal to obtain scores relating to the different linguistic levels. Reduction of dimensionality was applied to retain only relevant and nonredundant parameters. Self-administered questionnaires assessing communication and associated factors (associated deficits, anxiety/depression, cognitive status, communication needs relating to social circles, self-perception of speech impairment and quality of life) were conducted. A predictive modelling of communication and speech impairment by LASSO regression was performed using the scores from the automatic tools alone, which were then combined with the scores arising from the questionnaires.</p>\n </section>\n \n <section>\n \n <h3> Outcomes and Results</h3>\n \n <p>A total of 149 automatic parameters were extracted from the speech signal, of which 75 were retained after dimensional reduction. Predictive modelling of communication and speech impairment [Holistic Communication Score (HoCoS)] using the selected automatic parameters (number of sonants and occlusives recognised per second) provides a correlation of 0.83 between the predicted and actual score. This modelling is reliable (<i>r</i><sub>S</sub> = 0.82 between five-fold cross-validation and HoCoS). The correlation reaches 0.89 when including associated factors in the modelling, while maintaining a high reliability (<i>r</i><sub>S</sub> = 0.70 between five-fold cross-validation and HoCoS).</p>\n </section>\n \n <section>\n \n <h3> Conclusions and Implications</h3>\n \n <p>The use of automatic speech analysis allows a reliable prediction of the communication and speech impairment experienced by the patients. This study opens up new perspectives for the use of automatic speech recognition systems in clinical evaluation and for the consideration of functional and psychosocial needs expressed by the patients during their follow-up.</p>\n </section>\n \n <section>\n \n <h3> WHAT THIS PAPER ADDS</h3>\n \n <div><i>What is already known on the subject</i>\n \n <ul>\n \n <li>Automatic and acoustic analyses compensate for the biases of perceptual speech assessment in clinical practice. Although they are used to assess speech severity, few studies have examined their contribution to the measurement of communication and speech impairment, which is yet an essential objective of clinical intervention.</li>\n </ul>\n </div>\n \n <div><i>What this paper adds to the existing knowledge</i>\n \n <ul>\n \n <li>Acoustic and automatic analyses (and in particular the use of automatic speech recognition systems) enable good prediction of communication and speech impairment reported by patients, with a limited number of tools and parameters. This prediction is even improved by adding to the models scores from self-questionnaires measuring functional and psychosocial dimensions that may be impacted by the speech disorder. This study provides reliable new tools for measuring impaired communication.</li>\n </ul>\n </div>\n \n <div><i>What are the potential or actual clinical implications of this work?</i>\n \n <ul>\n \n <li>Acoustic and automatic tools can be used in routine clinical care to obtain a valid and reliable measure of communication and speech impairment. This prediction leads to a follow-up score to quantify the level of communication and speech impairment at a given time and the evolution of patients from a speech sample.</li>\n </ul>\n </div>\n </section>\n </div>","PeriodicalId":49182,"journal":{"name":"International Journal of Language & Communication Disorders","volume":"60 5","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1460-6984.70103","citationCount":"0","resultStr":"{\"title\":\"Prediction of Speech Impairment in Patients Treated for Oral or Oropharyngeal Cancer Using Automatic Speech Analysis\",\"authors\":\"Mathieu Balaguer,&nbsp;Julien Pinquier,&nbsp;Jérôme Farinas,&nbsp;Virginie Woisard\",\"doi\":\"10.1111/1460-6984.70103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>Perceptual evaluation of speech disorders produces scores that poorly predict the consequences of speech impairment on the communication abilities of patients treated for oral/oropharyngeal cancer. This may be mitigated by automatic speech analysis.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Aim</h3>\\n \\n <p>To measure communication and speech impairment using automatic analyses of spontaneous speech and self-administered questionnaires in patients treated for oral cavity or oropharyngeal cancer.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods and Procedures</h3>\\n \\n <p>The spontaneous speech of 25 patients was recorded during a semistructured interview. Various acoustic and automatic tools were applied to the speech signal to obtain scores relating to the different linguistic levels. Reduction of dimensionality was applied to retain only relevant and nonredundant parameters. Self-administered questionnaires assessing communication and associated factors (associated deficits, anxiety/depression, cognitive status, communication needs relating to social circles, self-perception of speech impairment and quality of life) were conducted. A predictive modelling of communication and speech impairment by LASSO regression was performed using the scores from the automatic tools alone, which were then combined with the scores arising from the questionnaires.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Outcomes and Results</h3>\\n \\n <p>A total of 149 automatic parameters were extracted from the speech signal, of which 75 were retained after dimensional reduction. Predictive modelling of communication and speech impairment [Holistic Communication Score (HoCoS)] using the selected automatic parameters (number of sonants and occlusives recognised per second) provides a correlation of 0.83 between the predicted and actual score. This modelling is reliable (<i>r</i><sub>S</sub> = 0.82 between five-fold cross-validation and HoCoS). The correlation reaches 0.89 when including associated factors in the modelling, while maintaining a high reliability (<i>r</i><sub>S</sub> = 0.70 between five-fold cross-validation and HoCoS).</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions and Implications</h3>\\n \\n <p>The use of automatic speech analysis allows a reliable prediction of the communication and speech impairment experienced by the patients. This study opens up new perspectives for the use of automatic speech recognition systems in clinical evaluation and for the consideration of functional and psychosocial needs expressed by the patients during their follow-up.</p>\\n </section>\\n \\n <section>\\n \\n <h3> WHAT THIS PAPER ADDS</h3>\\n \\n <div><i>What is already known on the subject</i>\\n \\n <ul>\\n \\n <li>Automatic and acoustic analyses compensate for the biases of perceptual speech assessment in clinical practice. Although they are used to assess speech severity, few studies have examined their contribution to the measurement of communication and speech impairment, which is yet an essential objective of clinical intervention.</li>\\n </ul>\\n </div>\\n \\n <div><i>What this paper adds to the existing knowledge</i>\\n \\n <ul>\\n \\n <li>Acoustic and automatic analyses (and in particular the use of automatic speech recognition systems) enable good prediction of communication and speech impairment reported by patients, with a limited number of tools and parameters. This prediction is even improved by adding to the models scores from self-questionnaires measuring functional and psychosocial dimensions that may be impacted by the speech disorder. This study provides reliable new tools for measuring impaired communication.</li>\\n </ul>\\n </div>\\n \\n <div><i>What are the potential or actual clinical implications of this work?</i>\\n \\n <ul>\\n \\n <li>Acoustic and automatic tools can be used in routine clinical care to obtain a valid and reliable measure of communication and speech impairment. This prediction leads to a follow-up score to quantify the level of communication and speech impairment at a given time and the evolution of patients from a speech sample.</li>\\n </ul>\\n </div>\\n </section>\\n </div>\",\"PeriodicalId\":49182,\"journal\":{\"name\":\"International Journal of Language & Communication Disorders\",\"volume\":\"60 5\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-08-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1460-6984.70103\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Language & Communication Disorders\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/1460-6984.70103\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Language & Communication Disorders","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1460-6984.70103","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

语言障碍的感知评价产生的分数不能很好地预测言语障碍对口腔/口咽癌治疗患者沟通能力的影响。自动语音分析可以减轻这种情况。目的通过自动分析口腔或口咽癌患者的自发语言和自我填写的问卷,评估患者的沟通和语言障碍。方法与步骤采用半结构化访谈法对25例患者的自发言语进行记录。将各种声学和自动工具应用于语音信号以获得与不同语言水平相关的分数。降维应用仅保留相关和非冗余参数。采用自填问卷评估沟通及相关因素(相关缺陷、焦虑/抑郁、认知状况、社交圈沟通需求、言语障碍自我认知及生活质量)。通过LASSO回归对交流和语言障碍进行预测建模,使用自动工具的得分,然后将其与问卷调查的得分相结合。结果与结果从语音信号中提取了149个自动参数,其中降维后保留了75个。使用选定的自动参数(每秒识别的辅音和闭塞音的数量)进行沟通和语言障碍预测建模[整体沟通评分(HoCoS)],预测得分与实际得分之间的相关性为0.83。该模型是可靠的(五倍交叉验证和HoCoS之间的rS = 0.82)。当模型中包含相关因素时,相关系数达到0.89,同时保持较高的信度(五重交叉验证与HoCoS之间的rS = 0.70)。结论和意义使用自动语音分析可以可靠地预测患者的沟通和语言障碍。本研究为在临床评估中使用自动语音识别系统以及在随访中考虑患者表达的功能和心理社会需求开辟了新的视角。自动和声学分析弥补了临床实践中感知语音评估的偏差。虽然它们被用来评估语言的严重程度,但很少有研究检验它们对测量交流和语言障碍的贡献,而这是临床干预的一个重要目标。声学和自动分析(特别是自动语音识别系统的使用)可以很好地预测患者报告的沟通和语言障碍,而工具和参数的数量有限。通过在模型中加入可能受言语障碍影响的功能和社会心理维度的自我调查问卷得分,这种预测甚至得到了改进。这项研究为衡量沟通障碍提供了可靠的新工具。这项工作的潜在或实际临床意义是什么?声学和自动工具可用于常规临床护理,以获得有效和可靠的衡量沟通和语言障碍。这一预测导致了一个后续评分,以量化在给定时间内沟通和语言障碍的水平以及患者从语音样本的演变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Prediction of Speech Impairment in Patients Treated for Oral or Oropharyngeal Cancer Using Automatic Speech Analysis

Prediction of Speech Impairment in Patients Treated for Oral or Oropharyngeal Cancer Using Automatic Speech Analysis

Background

Perceptual evaluation of speech disorders produces scores that poorly predict the consequences of speech impairment on the communication abilities of patients treated for oral/oropharyngeal cancer. This may be mitigated by automatic speech analysis.

Aim

To measure communication and speech impairment using automatic analyses of spontaneous speech and self-administered questionnaires in patients treated for oral cavity or oropharyngeal cancer.

Methods and Procedures

The spontaneous speech of 25 patients was recorded during a semistructured interview. Various acoustic and automatic tools were applied to the speech signal to obtain scores relating to the different linguistic levels. Reduction of dimensionality was applied to retain only relevant and nonredundant parameters. Self-administered questionnaires assessing communication and associated factors (associated deficits, anxiety/depression, cognitive status, communication needs relating to social circles, self-perception of speech impairment and quality of life) were conducted. A predictive modelling of communication and speech impairment by LASSO regression was performed using the scores from the automatic tools alone, which were then combined with the scores arising from the questionnaires.

Outcomes and Results

A total of 149 automatic parameters were extracted from the speech signal, of which 75 were retained after dimensional reduction. Predictive modelling of communication and speech impairment [Holistic Communication Score (HoCoS)] using the selected automatic parameters (number of sonants and occlusives recognised per second) provides a correlation of 0.83 between the predicted and actual score. This modelling is reliable (rS = 0.82 between five-fold cross-validation and HoCoS). The correlation reaches 0.89 when including associated factors in the modelling, while maintaining a high reliability (rS = 0.70 between five-fold cross-validation and HoCoS).

Conclusions and Implications

The use of automatic speech analysis allows a reliable prediction of the communication and speech impairment experienced by the patients. This study opens up new perspectives for the use of automatic speech recognition systems in clinical evaluation and for the consideration of functional and psychosocial needs expressed by the patients during their follow-up.

WHAT THIS PAPER ADDS

What is already known on the subject
  • Automatic and acoustic analyses compensate for the biases of perceptual speech assessment in clinical practice. Although they are used to assess speech severity, few studies have examined their contribution to the measurement of communication and speech impairment, which is yet an essential objective of clinical intervention.
What this paper adds to the existing knowledge
  • Acoustic and automatic analyses (and in particular the use of automatic speech recognition systems) enable good prediction of communication and speech impairment reported by patients, with a limited number of tools and parameters. This prediction is even improved by adding to the models scores from self-questionnaires measuring functional and psychosocial dimensions that may be impacted by the speech disorder. This study provides reliable new tools for measuring impaired communication.
What are the potential or actual clinical implications of this work?
  • Acoustic and automatic tools can be used in routine clinical care to obtain a valid and reliable measure of communication and speech impairment. This prediction leads to a follow-up score to quantify the level of communication and speech impairment at a given time and the evolution of patients from a speech sample.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Language & Communication Disorders
International Journal of Language & Communication Disorders AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY-REHABILITATION
CiteScore
3.30
自引率
12.50%
发文量
116
审稿时长
6-12 weeks
期刊介绍: The International Journal of Language & Communication Disorders (IJLCD) is the official journal of the Royal College of Speech & Language Therapists. The Journal welcomes submissions on all aspects of speech, language, communication disorders and speech and language therapy. It provides a forum for the exchange of information and discussion of issues of clinical or theoretical relevance in the above areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信