A NLP-based Approach to Improve Speech Recognition Services for People with Speech Disorders

2022 IEEE Symposium on Computers and Communications (ISCC) Pub Date : 2022-06-30 DOI:10.1109/ISCC55528.2022.9912940

A. Celesti, M. Fazio, Lorenzo Carnevale, M. Villari

引用次数: 0

Abstract

Current speech recognition services are not suitable for people with speech disorders, which present difficulties in coordinating muscles and articulating words and sentences. In this case, a speaker-dependent approach is strongly required in order to address the specific vocal disarticulation. Several Deep learning approaches have been proposed in the literature to address this problem. However, they require many voice samples of people to properly work, and this is not practical. In this paper, we present an innovative Automatic Speech Recognition (ASR) system which is able to correct failures of deep learning based solution adopting Natural Language Processing (NLP) techniques. The proposed solution can perform both single word and whole sentence corrections by analyzing the speech context. We evaluated the solution in a home automation case study and proved the good accuracy of our model.

查看原文本刊更多论文

基于nlp的语言障碍患者语音识别服务改进方法

目前的语音识别服务并不适合有语言障碍的人，他们在协调肌肉和清晰表达单词和句子方面存在困难。在这种情况下，一个说话人依赖的方法是强烈需要的，以解决具体的发音脱节。文献中提出了几种深度学习方法来解决这个问题。然而，它们需要大量的人的声音样本才能正常工作，这是不实际的。在本文中，我们提出了一种创新的自动语音识别(ASR)系统，该系统能够采用自然语言处理(NLP)技术来纠正基于深度学习的解决方案的失败。该解决方案可以通过分析语音上下文进行单字和整句的纠错。我们在一个家庭自动化案例研究中评估了该解决方案，并证明了我们的模型具有良好的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE Symposium on Computers and Communications (ISCC)

自引率

0.00%

发文量