{"title":"Speech enhancement and features compensation algorithms for continuous speech recognition","authors":"Christian Arcos, M. Grivet, A. Alcaim","doi":"10.1109/ChinaSIP.2014.6889195","DOIUrl":null,"url":null,"abstract":"The degradation of the speech signal due to adverse conditions generates low accuracy rates in speech recognition systems. The authors propose mixing two methods: pre-extraction of features for speech enhancement and post-extraction of features for features compensation. According to their main focus, they are fundamentally oriented to minimize the misfit caused by noise insertion in the speech signal. These methods will be applied before and after the extraction of features, respectively, therefore allowing the best possible estimation of the clear signal from its degraded version.","PeriodicalId":248977,"journal":{"name":"2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP)","volume":"1989 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ChinaSIP.2014.6889195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The degradation of the speech signal due to adverse conditions generates low accuracy rates in speech recognition systems. The authors propose mixing two methods: pre-extraction of features for speech enhancement and post-extraction of features for features compensation. According to their main focus, they are fundamentally oriented to minimize the misfit caused by noise insertion in the speech signal. These methods will be applied before and after the extraction of features, respectively, therefore allowing the best possible estimation of the clear signal from its degraded version.