{"title":"A high-accuracy ASR technique based on correlational weight analysis for elderly users","authors":"Chih-Hung Chou, Ta-Wen Kuan, Po-Chuan Lin, Jhing-Fa Wang, Yi-Jhong Wu","doi":"10.1109/ICOT.2014.6956631","DOIUrl":null,"url":null,"abstract":"This paper proposes a robust template based on the previously proposed ECWRT (enhanced cross word reference template) for template-based ASR, by using correlational weight adjusting method to improve robustness against elderly speech variation named CWCWRT. This work addresses two vital issues: such as outlier rejection in training set and elimination of unwanted utterances which usually happen by the elderly people. Consequently, two main steps are investigated in this paper, firstly, correlational analyzing, and secondly, weight adjusting. For experiments, the corpus is built by 30 commands in Mandarin and English collected from three elderly (age 62±3 years) and three adults (age 22±2 years) having total 30 utterances for each of them. Two types of platforms including PC and GPCE063A embedded platform are conducted, both inside test and outside test are also applied. The results show that the average recognition rate for inside testis 97% in PC simulation and 90% in the embedded platform. The outside test results are 93% and 87% in two platforms respectively. The related and previous works including cross word reference template (CWRT) and ECWRT are also compared the comparison exhibit that the proposed CWCWRT gives higher robustness and accuracy than two baselines.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Orange Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2014.6956631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a robust template based on the previously proposed ECWRT (enhanced cross word reference template) for template-based ASR, by using correlational weight adjusting method to improve robustness against elderly speech variation named CWCWRT. This work addresses two vital issues: such as outlier rejection in training set and elimination of unwanted utterances which usually happen by the elderly people. Consequently, two main steps are investigated in this paper, firstly, correlational analyzing, and secondly, weight adjusting. For experiments, the corpus is built by 30 commands in Mandarin and English collected from three elderly (age 62±3 years) and three adults (age 22±2 years) having total 30 utterances for each of them. Two types of platforms including PC and GPCE063A embedded platform are conducted, both inside test and outside test are also applied. The results show that the average recognition rate for inside testis 97% in PC simulation and 90% in the embedded platform. The outside test results are 93% and 87% in two platforms respectively. The related and previous works including cross word reference template (CWRT) and ECWRT are also compared the comparison exhibit that the proposed CWCWRT gives higher robustness and accuracy than two baselines.
本文在前人提出的增强交叉词参考模板(enhanced cross word reference template, ECWRT)的基础上,提出了一种基于模板的ASR鲁棒模板,采用相关权值调整方法提高了对老年人语音变异的鲁棒性,称为CWCWRT。这项工作解决了两个至关重要的问题:例如训练集的异常值拒绝和消除通常发生在老年人身上的不想要的话语。因此,本文主要研究了两个步骤,首先是相关性分析,其次是权重调整。在实验中,语料库由3名老年人(62±3岁)和3名成年人(22±2岁)的30个普通话和英语命令组成,每个命令各30个话语。在PC和GPCE063A嵌入式平台两种平台上进行了内部测试和外部测试。结果表明,PC仿真对内胆的平均识别率为97%,嵌入式平台的平均识别率为90%。在两个平台上的外部测试结果分别为93%和87%。本文还比较了交叉词参考模板(cross word reference template, CWRT)和ECWRT的相关研究成果,结果表明本文提出的CWCWRT具有更高的鲁棒性和准确性。