Chao Xu, Yi Y. Liu, Yongsheng Yang, Pascale Fung, Z. Cao
{"title":"基于便携式设备的汉语短短语识别系统","authors":"Chao Xu, Yi Y. Liu, Yongsheng Yang, Pascale Fung, Z. Cao","doi":"10.1109/CHINSL.2004.1409628","DOIUrl":null,"url":null,"abstract":"With the proliferation of portable devices, speech recognition, especially name, address and command recognition, on these devices is a topic of growing relevance. A Mandarin short phrase recognition system is introduced in consideration of the limited resources and calculation ability of portable devices. A fixed-point front-end is developed, a discrete hidden Markov model is employed for acoustic modeling, and an SNR based likelihood weighting method is proposed to improve the noise robustness of the system. The memory size of the model set is 269 kB, the decoding time is 0.89 times of the speech duration, and the method for robustness gives a relative 15.2% word error rate reduction in a complex practical environment with both channel distortion and non-stationary noise present.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A system for Mandarin short phrase recognition on portable devices\",\"authors\":\"Chao Xu, Yi Y. Liu, Yongsheng Yang, Pascale Fung, Z. Cao\",\"doi\":\"10.1109/CHINSL.2004.1409628\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the proliferation of portable devices, speech recognition, especially name, address and command recognition, on these devices is a topic of growing relevance. A Mandarin short phrase recognition system is introduced in consideration of the limited resources and calculation ability of portable devices. A fixed-point front-end is developed, a discrete hidden Markov model is employed for acoustic modeling, and an SNR based likelihood weighting method is proposed to improve the noise robustness of the system. The memory size of the model set is 269 kB, the decoding time is 0.89 times of the speech duration, and the method for robustness gives a relative 15.2% word error rate reduction in a complex practical environment with both channel distortion and non-stationary noise present.\",\"PeriodicalId\":212562,\"journal\":{\"name\":\"2004 International Symposium on Chinese Spoken Language Processing\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2004 International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CHINSL.2004.1409628\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2004.1409628","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A system for Mandarin short phrase recognition on portable devices
With the proliferation of portable devices, speech recognition, especially name, address and command recognition, on these devices is a topic of growing relevance. A Mandarin short phrase recognition system is introduced in consideration of the limited resources and calculation ability of portable devices. A fixed-point front-end is developed, a discrete hidden Markov model is employed for acoustic modeling, and an SNR based likelihood weighting method is proposed to improve the noise robustness of the system. The memory size of the model set is 269 kB, the decoding time is 0.89 times of the speech duration, and the method for robustness gives a relative 15.2% word error rate reduction in a complex practical environment with both channel distortion and non-stationary noise present.