{"title":"基于语音的机器人交互中微小解码图的鲁棒性研究","authors":"A. Abdelhamid, W. Abdulla, B. MacDonald","doi":"10.1109/RAM.2013.6758581","DOIUrl":null,"url":null,"abstract":"In this paper we study the robustness of a command decoding approach based on tiny decoding graphs for voice-based robotic interaction. This approach comprises the fusion of the grammar rules and the statistical n-gram language models to produce an elegant and quite efficient tiny decoding graph. The resulting tiny graph has several advantages such as high speed and improved robustness of command decoding even in adverse noisy conditions. To validate the robustness of the proposed approach, we employed a set of spoken commands from the Resource Management (RM1) command and control corpus. These commands are artificially corrupted by 10 types of noise at different signal-to-noise ratios (SNRs). Experimental results show that the proposed approach achieved word error rates of 1.9% and 29% for the commands at 20dB and 5dB respectively, whereas the word error rates of the same task using the traditional grammar rules were 43% and 75% for the commands at 20dB and 5dB SNRs, respectively.","PeriodicalId":287085,"journal":{"name":"2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the robustness of tiny decoding graphs for voice-based robotic interaction\",\"authors\":\"A. Abdelhamid, W. Abdulla, B. MacDonald\",\"doi\":\"10.1109/RAM.2013.6758581\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we study the robustness of a command decoding approach based on tiny decoding graphs for voice-based robotic interaction. This approach comprises the fusion of the grammar rules and the statistical n-gram language models to produce an elegant and quite efficient tiny decoding graph. The resulting tiny graph has several advantages such as high speed and improved robustness of command decoding even in adverse noisy conditions. To validate the robustness of the proposed approach, we employed a set of spoken commands from the Resource Management (RM1) command and control corpus. These commands are artificially corrupted by 10 types of noise at different signal-to-noise ratios (SNRs). Experimental results show that the proposed approach achieved word error rates of 1.9% and 29% for the commands at 20dB and 5dB respectively, whereas the word error rates of the same task using the traditional grammar rules were 43% and 75% for the commands at 20dB and 5dB SNRs, respectively.\",\"PeriodicalId\":287085,\"journal\":{\"name\":\"2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAM.2013.6758581\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAM.2013.6758581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the robustness of tiny decoding graphs for voice-based robotic interaction
In this paper we study the robustness of a command decoding approach based on tiny decoding graphs for voice-based robotic interaction. This approach comprises the fusion of the grammar rules and the statistical n-gram language models to produce an elegant and quite efficient tiny decoding graph. The resulting tiny graph has several advantages such as high speed and improved robustness of command decoding even in adverse noisy conditions. To validate the robustness of the proposed approach, we employed a set of spoken commands from the Resource Management (RM1) command and control corpus. These commands are artificially corrupted by 10 types of noise at different signal-to-noise ratios (SNRs). Experimental results show that the proposed approach achieved word error rates of 1.9% and 29% for the commands at 20dB and 5dB respectively, whereas the word error rates of the same task using the traditional grammar rules were 43% and 75% for the commands at 20dB and 5dB SNRs, respectively.