{"title":"利用喉部传声器和普通传声器语音的组合特征提高印地语音节单位的识别","authors":"N. Radha, A. Shahina, G. Vinoth, A. N. Khan","doi":"10.1109/ICCICCT.2014.6993171","DOIUrl":null,"url":null,"abstract":"The performance of Automatic Speech recognition system (ASR) built using close talk microphones degrades in noisy environments. AS R built using Throat Microphone (TM) speech shows relatively better performance under such adverse situations. However, some of the sounds are not well captured in TM. In this work we explore the combined use of Normal Microphone (NM) and TM features to improve the recognition rate of AS R. In the proposed work, the combined Mel-Frequency Cepstral Coefficients (MFCC) derived from the two signals are used to built an AS R in the HMM framework to recognize the 145 syllabic units of Indian language Hindi. The performance of this combined AS R system shows a significant improvement in performance when compared with individual AS R systems built using NM and TM features, respectively.","PeriodicalId":6615,"journal":{"name":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","volume":"9 1","pages":"1343-1348"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Improving recognition of syallabic units of Hindi languagae using combined features of Throat Microphone and Normal Microphone speech\",\"authors\":\"N. Radha, A. Shahina, G. Vinoth, A. N. Khan\",\"doi\":\"10.1109/ICCICCT.2014.6993171\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The performance of Automatic Speech recognition system (ASR) built using close talk microphones degrades in noisy environments. AS R built using Throat Microphone (TM) speech shows relatively better performance under such adverse situations. However, some of the sounds are not well captured in TM. In this work we explore the combined use of Normal Microphone (NM) and TM features to improve the recognition rate of AS R. In the proposed work, the combined Mel-Frequency Cepstral Coefficients (MFCC) derived from the two signals are used to built an AS R in the HMM framework to recognize the 145 syllabic units of Indian language Hindi. The performance of this combined AS R system shows a significant improvement in performance when compared with individual AS R systems built using NM and TM features, respectively.\",\"PeriodicalId\":6615,\"journal\":{\"name\":\"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)\",\"volume\":\"9 1\",\"pages\":\"1343-1348\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCICCT.2014.6993171\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICCT.2014.6993171","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving recognition of syallabic units of Hindi languagae using combined features of Throat Microphone and Normal Microphone speech
The performance of Automatic Speech recognition system (ASR) built using close talk microphones degrades in noisy environments. AS R built using Throat Microphone (TM) speech shows relatively better performance under such adverse situations. However, some of the sounds are not well captured in TM. In this work we explore the combined use of Normal Microphone (NM) and TM features to improve the recognition rate of AS R. In the proposed work, the combined Mel-Frequency Cepstral Coefficients (MFCC) derived from the two signals are used to built an AS R in the HMM framework to recognize the 145 syllabic units of Indian language Hindi. The performance of this combined AS R system shows a significant improvement in performance when compared with individual AS R systems built using NM and TM features, respectively.