{"title":"新的连续语音特征调整噪声鲁棒CSR系统","authors":"Yiming Y. Sun, Y. Miyanaga","doi":"10.1109/ISCIT.2011.6089754","DOIUrl":null,"url":null,"abstract":"We propose a noise-robust continuous speech recognition (CSR) method for recognition. In model building, we extract the novel feature vector by using running spectrum analysis (RSA) and dynamic range adjustment (DRA) methods. DRA adjusts the dynamic range on MFCC modulation spectrum domain (MSD). In recognition, the algorithm automatically divides the continuous speech into short sentences and blocks, then we use DRA based on the blocks. The proposed algorithm efficiency is studied for clean and noisy environment. In our experiments, all HMMs have been trained by using the Japanese newspaper article sentence (JNAS) database. The average recognition rate improves under various types of noise and SNR conditions.","PeriodicalId":226552,"journal":{"name":"2011 11th International Symposium on Communications & Information Technologies (ISCIT)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"New continuous speech feature adjustment for a noise-robust CSR system\",\"authors\":\"Yiming Y. Sun, Y. Miyanaga\",\"doi\":\"10.1109/ISCIT.2011.6089754\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a noise-robust continuous speech recognition (CSR) method for recognition. In model building, we extract the novel feature vector by using running spectrum analysis (RSA) and dynamic range adjustment (DRA) methods. DRA adjusts the dynamic range on MFCC modulation spectrum domain (MSD). In recognition, the algorithm automatically divides the continuous speech into short sentences and blocks, then we use DRA based on the blocks. The proposed algorithm efficiency is studied for clean and noisy environment. In our experiments, all HMMs have been trained by using the Japanese newspaper article sentence (JNAS) database. The average recognition rate improves under various types of noise and SNR conditions.\",\"PeriodicalId\":226552,\"journal\":{\"name\":\"2011 11th International Symposium on Communications & Information Technologies (ISCIT)\",\"volume\":\"108 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 11th International Symposium on Communications & Information Technologies (ISCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCIT.2011.6089754\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 11th International Symposium on Communications & Information Technologies (ISCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCIT.2011.6089754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
New continuous speech feature adjustment for a noise-robust CSR system
We propose a noise-robust continuous speech recognition (CSR) method for recognition. In model building, we extract the novel feature vector by using running spectrum analysis (RSA) and dynamic range adjustment (DRA) methods. DRA adjusts the dynamic range on MFCC modulation spectrum domain (MSD). In recognition, the algorithm automatically divides the continuous speech into short sentences and blocks, then we use DRA based on the blocks. The proposed algorithm efficiency is studied for clean and noisy environment. In our experiments, all HMMs have been trained by using the Japanese newspaper article sentence (JNAS) database. The average recognition rate improves under various types of noise and SNR conditions.