{"title":"在马尔可夫决策过程框架内学习对话策略","authors":"E. Levin, R. Pieraccini, W. Eckert","doi":"10.1109/ASRU.1997.658989","DOIUrl":null,"url":null,"abstract":"We introduce a stochastic model for dialogue systems based on the Markov decision process. Within this framework we show that the problem of dialogue strategy design can be stated as an optimization problem, and solved by a variety of methods, including the reinforcement learning approach. The advantages of this new paradigm include objective evaluation of dialogue systems and their automatic design and adaptation. We show some preliminary results on learning a dialogue strategy for an air travel information system.","PeriodicalId":253278,"journal":{"name":"1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"155","resultStr":"{\"title\":\"Learning dialogue strategies within the Markov decision process framework\",\"authors\":\"E. Levin, R. Pieraccini, W. Eckert\",\"doi\":\"10.1109/ASRU.1997.658989\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We introduce a stochastic model for dialogue systems based on the Markov decision process. Within this framework we show that the problem of dialogue strategy design can be stated as an optimization problem, and solved by a variety of methods, including the reinforcement learning approach. The advantages of this new paradigm include objective evaluation of dialogue systems and their automatic design and adaptation. We show some preliminary results on learning a dialogue strategy for an air travel information system.\",\"PeriodicalId\":253278,\"journal\":{\"name\":\"1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"155\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASRU.1997.658989\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASRU.1997.658989","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning dialogue strategies within the Markov decision process framework
We introduce a stochastic model for dialogue systems based on the Markov decision process. Within this framework we show that the problem of dialogue strategy design can be stated as an optimization problem, and solved by a variety of methods, including the reinforcement learning approach. The advantages of this new paradigm include objective evaluation of dialogue systems and their automatic design and adaptation. We show some preliminary results on learning a dialogue strategy for an air travel information system.