{"title":"Prediction of Speech Impairment in Patients Treated for Oral or Oropharyngeal Cancer Using Automatic Speech Analysis","authors":"Mathieu Balaguer, Julien Pinquier, Jérôme Farinas, Virginie Woisard","doi":"10.1111/1460-6984.70103","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Perceptual evaluation of speech disorders produces scores that poorly predict the consequences of speech impairment on the communication abilities of patients treated for oral/oropharyngeal cancer. This may be mitigated by automatic speech analysis.</p>\n </section>\n \n <section>\n \n <h3> Aim</h3>\n \n <p>To measure communication and speech impairment using automatic analyses of spontaneous speech and self-administered questionnaires in patients treated for oral cavity or oropharyngeal cancer.</p>\n </section>\n \n <section>\n \n <h3> Methods and Procedures</h3>\n \n <p>The spontaneous speech of 25 patients was recorded during a semistructured interview. Various acoustic and automatic tools were applied to the speech signal to obtain scores relating to the different linguistic levels. Reduction of dimensionality was applied to retain only relevant and nonredundant parameters. Self-administered questionnaires assessing communication and associated factors (associated deficits, anxiety/depression, cognitive status, communication needs relating to social circles, self-perception of speech impairment and quality of life) were conducted. A predictive modelling of communication and speech impairment by LASSO regression was performed using the scores from the automatic tools alone, which were then combined with the scores arising from the questionnaires.</p>\n </section>\n \n <section>\n \n <h3> Outcomes and Results</h3>\n \n <p>A total of 149 automatic parameters were extracted from the speech signal, of which 75 were retained after dimensional reduction. Predictive modelling of communication and speech impairment [Holistic Communication Score (HoCoS)] using the selected automatic parameters (number of sonants and occlusives recognised per second) provides a correlation of 0.83 between the predicted and actual score. This modelling is reliable (<i>r</i><sub>S</sub> = 0.82 between five-fold cross-validation and HoCoS). The correlation reaches 0.89 when including associated factors in the modelling, while maintaining a high reliability (<i>r</i><sub>S</sub> = 0.70 between five-fold cross-validation and HoCoS).</p>\n </section>\n \n <section>\n \n <h3> Conclusions and Implications</h3>\n \n <p>The use of automatic speech analysis allows a reliable prediction of the communication and speech impairment experienced by the patients. This study opens up new perspectives for the use of automatic speech recognition systems in clinical evaluation and for the consideration of functional and psychosocial needs expressed by the patients during their follow-up.</p>\n </section>\n \n <section>\n \n <h3> WHAT THIS PAPER ADDS</h3>\n \n <div><i>What is already known on the subject</i>\n \n <ul>\n \n <li>Automatic and acoustic analyses compensate for the biases of perceptual speech assessment in clinical practice. Although they are used to assess speech severity, few studies have examined their contribution to the measurement of communication and speech impairment, which is yet an essential objective of clinical intervention.</li>\n </ul>\n </div>\n \n <div><i>What this paper adds to the existing knowledge</i>\n \n <ul>\n \n <li>Acoustic and automatic analyses (and in particular the use of automatic speech recognition systems) enable good prediction of communication and speech impairment reported by patients, with a limited number of tools and parameters. This prediction is even improved by adding to the models scores from self-questionnaires measuring functional and psychosocial dimensions that may be impacted by the speech disorder. This study provides reliable new tools for measuring impaired communication.</li>\n </ul>\n </div>\n \n <div><i>What are the potential or actual clinical implications of this work?</i>\n \n <ul>\n \n <li>Acoustic and automatic tools can be used in routine clinical care to obtain a valid and reliable measure of communication and speech impairment. This prediction leads to a follow-up score to quantify the level of communication and speech impairment at a given time and the evolution of patients from a speech sample.</li>\n </ul>\n </div>\n </section>\n </div>","PeriodicalId":49182,"journal":{"name":"International Journal of Language & Communication Disorders","volume":"60 5","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1460-6984.70103","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Language & Communication Disorders","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1460-6984.70103","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Perceptual evaluation of speech disorders produces scores that poorly predict the consequences of speech impairment on the communication abilities of patients treated for oral/oropharyngeal cancer. This may be mitigated by automatic speech analysis.
Aim
To measure communication and speech impairment using automatic analyses of spontaneous speech and self-administered questionnaires in patients treated for oral cavity or oropharyngeal cancer.
Methods and Procedures
The spontaneous speech of 25 patients was recorded during a semistructured interview. Various acoustic and automatic tools were applied to the speech signal to obtain scores relating to the different linguistic levels. Reduction of dimensionality was applied to retain only relevant and nonredundant parameters. Self-administered questionnaires assessing communication and associated factors (associated deficits, anxiety/depression, cognitive status, communication needs relating to social circles, self-perception of speech impairment and quality of life) were conducted. A predictive modelling of communication and speech impairment by LASSO regression was performed using the scores from the automatic tools alone, which were then combined with the scores arising from the questionnaires.
Outcomes and Results
A total of 149 automatic parameters were extracted from the speech signal, of which 75 were retained after dimensional reduction. Predictive modelling of communication and speech impairment [Holistic Communication Score (HoCoS)] using the selected automatic parameters (number of sonants and occlusives recognised per second) provides a correlation of 0.83 between the predicted and actual score. This modelling is reliable (rS = 0.82 between five-fold cross-validation and HoCoS). The correlation reaches 0.89 when including associated factors in the modelling, while maintaining a high reliability (rS = 0.70 between five-fold cross-validation and HoCoS).
Conclusions and Implications
The use of automatic speech analysis allows a reliable prediction of the communication and speech impairment experienced by the patients. This study opens up new perspectives for the use of automatic speech recognition systems in clinical evaluation and for the consideration of functional and psychosocial needs expressed by the patients during their follow-up.
WHAT THIS PAPER ADDS
What is already known on the subject
Automatic and acoustic analyses compensate for the biases of perceptual speech assessment in clinical practice. Although they are used to assess speech severity, few studies have examined their contribution to the measurement of communication and speech impairment, which is yet an essential objective of clinical intervention.
What this paper adds to the existing knowledge
Acoustic and automatic analyses (and in particular the use of automatic speech recognition systems) enable good prediction of communication and speech impairment reported by patients, with a limited number of tools and parameters. This prediction is even improved by adding to the models scores from self-questionnaires measuring functional and psychosocial dimensions that may be impacted by the speech disorder. This study provides reliable new tools for measuring impaired communication.
What are the potential or actual clinical implications of this work?
Acoustic and automatic tools can be used in routine clinical care to obtain a valid and reliable measure of communication and speech impairment. This prediction leads to a follow-up score to quantify the level of communication and speech impairment at a given time and the evolution of patients from a speech sample.
期刊介绍:
The International Journal of Language & Communication Disorders (IJLCD) is the official journal of the Royal College of Speech & Language Therapists. The Journal welcomes submissions on all aspects of speech, language, communication disorders and speech and language therapy. It provides a forum for the exchange of information and discussion of issues of clinical or theoretical relevance in the above areas.