Vicky Zilvan, Iftitahu Ni'mah, A. R. Yuliani, H. Pardede
{"title":"远距离语音识别中基于q对数的实时特征归一化研究","authors":"Vicky Zilvan, Iftitahu Ni'mah, A. R. Yuliani, H. Pardede","doi":"10.1109/ICITSI.2016.7858234","DOIUrl":null,"url":null,"abstract":"The computation of long term mean in feature normalization methods requires information on future frames, and thus makes them inapplicable for real-time implementations. Previously, q-log spectral mean normalization (q-LSMN) as feature normalization method is proposed, and it shows more effective result than conventional normalization methods. However, q-LSMN has not yet been implemented on real time. In this paper, we propose a real time implementation of q-LSMN. In this method, the mean is updated recursively based on only previous frames, hence no future frame information is needed. Experiments on Aurora-5 databases showed that while real time q-LSMN achieved slightly worse performance than non real time q-LSMN as expected, it improved the recognition accuracy up to 54.22% compared to that of non-real time conventional normalization methods such as cepstral mean normalization (CMN) and Log spectral mean normalization (LSMN).","PeriodicalId":172314,"journal":{"name":"2016 International Conference on Information Technology Systems and Innovation (ICITSI)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On real time Q-log-based feature normalization for distant speech recognition\",\"authors\":\"Vicky Zilvan, Iftitahu Ni'mah, A. R. Yuliani, H. Pardede\",\"doi\":\"10.1109/ICITSI.2016.7858234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The computation of long term mean in feature normalization methods requires information on future frames, and thus makes them inapplicable for real-time implementations. Previously, q-log spectral mean normalization (q-LSMN) as feature normalization method is proposed, and it shows more effective result than conventional normalization methods. However, q-LSMN has not yet been implemented on real time. In this paper, we propose a real time implementation of q-LSMN. In this method, the mean is updated recursively based on only previous frames, hence no future frame information is needed. Experiments on Aurora-5 databases showed that while real time q-LSMN achieved slightly worse performance than non real time q-LSMN as expected, it improved the recognition accuracy up to 54.22% compared to that of non-real time conventional normalization methods such as cepstral mean normalization (CMN) and Log spectral mean normalization (LSMN).\",\"PeriodicalId\":172314,\"journal\":{\"name\":\"2016 International Conference on Information Technology Systems and Innovation (ICITSI)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Information Technology Systems and Innovation (ICITSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITSI.2016.7858234\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Information Technology Systems and Innovation (ICITSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITSI.2016.7858234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On real time Q-log-based feature normalization for distant speech recognition
The computation of long term mean in feature normalization methods requires information on future frames, and thus makes them inapplicable for real-time implementations. Previously, q-log spectral mean normalization (q-LSMN) as feature normalization method is proposed, and it shows more effective result than conventional normalization methods. However, q-LSMN has not yet been implemented on real time. In this paper, we propose a real time implementation of q-LSMN. In this method, the mean is updated recursively based on only previous frames, hence no future frame information is needed. Experiments on Aurora-5 databases showed that while real time q-LSMN achieved slightly worse performance than non real time q-LSMN as expected, it improved the recognition accuracy up to 54.22% compared to that of non-real time conventional normalization methods such as cepstral mean normalization (CMN) and Log spectral mean normalization (LSMN).