Shaohui Chang, Xiaohui Wang, Tingzhang Fang, Lin Qian
{"title":"语音唤醒与命令识别算法的设计与实现","authors":"Shaohui Chang, Xiaohui Wang, Tingzhang Fang, Lin Qian","doi":"10.1109/ihmsc55436.2022.00009","DOIUrl":null,"url":null,"abstract":"This paper analyzes methods of speech recognition and lays an emphasize on acoustic model, language model and decoding algorithm based on the output of these two models. Hidden Markov Model is applied to acoustic model to build a state graph of acoustic features of input speech sample. After testing, the arousing rate is above 95%, the mis-arousing-rate is below 5% and response time of the wake-on-voice model is about 0.2s.","PeriodicalId":447862,"journal":{"name":"2022 14th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design and Implementation of Wake-on-Voice and Command Recognition Algorithm\",\"authors\":\"Shaohui Chang, Xiaohui Wang, Tingzhang Fang, Lin Qian\",\"doi\":\"10.1109/ihmsc55436.2022.00009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper analyzes methods of speech recognition and lays an emphasize on acoustic model, language model and decoding algorithm based on the output of these two models. Hidden Markov Model is applied to acoustic model to build a state graph of acoustic features of input speech sample. After testing, the arousing rate is above 95%, the mis-arousing-rate is below 5% and response time of the wake-on-voice model is about 0.2s.\",\"PeriodicalId\":447862,\"journal\":{\"name\":\"2022 14th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)\",\"volume\":\"87 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ihmsc55436.2022.00009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ihmsc55436.2022.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design and Implementation of Wake-on-Voice and Command Recognition Algorithm
This paper analyzes methods of speech recognition and lays an emphasize on acoustic model, language model and decoding algorithm based on the output of these two models. Hidden Markov Model is applied to acoustic model to build a state graph of acoustic features of input speech sample. After testing, the arousing rate is above 95%, the mis-arousing-rate is below 5% and response time of the wake-on-voice model is about 0.2s.