{"title":"机器人语音意图识别","authors":"Borui Shen, D. Inkpen","doi":"10.1109/MCSI.2016.042","DOIUrl":null,"url":null,"abstract":"We present the design of a dialog manager that performs speech intent recognition, based on a finite state machine which enables simultaneous processing of multiple sub-modules and maintains ordered transitions of system states. The dialog manager is integrated into a spoken dialog system. The application area of this system is targeted on robots, and the core problem that we address is to recognize user's speech intents, which could be either asking questions or giving commands to a robot. Our dialog manager is a sequence classifier based on hidden Markov models, and it uses part-of-speech tags as output symbols. The classifier take sentences of variable lengths as input. It is trained on a small data set and achieves and accuracy of 83%.","PeriodicalId":421998,"journal":{"name":"2016 Third International Conference on Mathematics and Computers in Sciences and in Industry (MCSI)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Speech Intent Recognition for Robots\",\"authors\":\"Borui Shen, D. Inkpen\",\"doi\":\"10.1109/MCSI.2016.042\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present the design of a dialog manager that performs speech intent recognition, based on a finite state machine which enables simultaneous processing of multiple sub-modules and maintains ordered transitions of system states. The dialog manager is integrated into a spoken dialog system. The application area of this system is targeted on robots, and the core problem that we address is to recognize user's speech intents, which could be either asking questions or giving commands to a robot. Our dialog manager is a sequence classifier based on hidden Markov models, and it uses part-of-speech tags as output symbols. The classifier take sentences of variable lengths as input. It is trained on a small data set and achieves and accuracy of 83%.\",\"PeriodicalId\":421998,\"journal\":{\"name\":\"2016 Third International Conference on Mathematics and Computers in Sciences and in Industry (MCSI)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Third International Conference on Mathematics and Computers in Sciences and in Industry (MCSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MCSI.2016.042\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Third International Conference on Mathematics and Computers in Sciences and in Industry (MCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MCSI.2016.042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
We present the design of a dialog manager that performs speech intent recognition, based on a finite state machine which enables simultaneous processing of multiple sub-modules and maintains ordered transitions of system states. The dialog manager is integrated into a spoken dialog system. The application area of this system is targeted on robots, and the core problem that we address is to recognize user's speech intents, which could be either asking questions or giving commands to a robot. Our dialog manager is a sequence classifier based on hidden Markov models, and it uses part-of-speech tags as output symbols. The classifier take sentences of variable lengths as input. It is trained on a small data set and achieves and accuracy of 83%.