自适应对话系统的在线复杂动作学习和用户状态估计

2012 IEEE 24th International Conference on Tools with Artificial Intelligence Pub Date : 2012-11-07 DOI:10.1109/ICTAI.2012.92

A. Papangelis, V. Karkaletsis, F. Makedon

{"title":"自适应对话系统的在线复杂动作学习和用户状态估计","authors":"A. Papangelis, V. Karkaletsis, F. Makedon","doi":"10.1109/ICTAI.2012.92","DOIUrl":null,"url":null,"abstract":"Dialogue Systems (DS) have been rapidly evolving during the last few years. In order for them to be able to adapt to their surroundings and to individual users, researchers have focused on adaptation techniques, giving rise to the field of Adaptive Dialogue Systems (ADS). One important sub field of ADS is learning what the system should say or do next. Most of the work done in this direction assumes the system performs simple actions, rather than behaviours described by sub-dialogues, i.e. complex actions. To this end we propose a novel online algorithm that ranks complex actions according to their performance and selects the top-k performing ones. To the best of our knowledge there is currently no work describing methods to learn policies for complex actions in an online fashion. We also propose an online algorithm able to estimate the effects of system actions on the user's state, in order to make more informed decisions about which action to take and guide the learning algorithm towards a desired final user state. Our results show that the proposed complex action learning algorithm outperforms simple Hierarchical Reinforcement Learning algorithms and that we are able to successfully estimate the effect an action will have on the user's state, using emotional states as a case study.","PeriodicalId":155588,"journal":{"name":"2012 IEEE 24th International Conference on Tools with Artificial Intelligence","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Online Complex Action Learning and User State Estimation for Adaptive Dialogue Systems\",\"authors\":\"A. Papangelis, V. Karkaletsis, F. Makedon\",\"doi\":\"10.1109/ICTAI.2012.92\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dialogue Systems (DS) have been rapidly evolving during the last few years. In order for them to be able to adapt to their surroundings and to individual users, researchers have focused on adaptation techniques, giving rise to the field of Adaptive Dialogue Systems (ADS). One important sub field of ADS is learning what the system should say or do next. Most of the work done in this direction assumes the system performs simple actions, rather than behaviours described by sub-dialogues, i.e. complex actions. To this end we propose a novel online algorithm that ranks complex actions according to their performance and selects the top-k performing ones. To the best of our knowledge there is currently no work describing methods to learn policies for complex actions in an online fashion. We also propose an online algorithm able to estimate the effects of system actions on the user's state, in order to make more informed decisions about which action to take and guide the learning algorithm towards a desired final user state. Our results show that the proposed complex action learning algorithm outperforms simple Hierarchical Reinforcement Learning algorithms and that we are able to successfully estimate the effect an action will have on the user's state, using emotional states as a case study.\",\"PeriodicalId\":155588,\"journal\":{\"name\":\"2012 IEEE 24th International Conference on Tools with Artificial Intelligence\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 24th International Conference on Tools with Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTAI.2012.92\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 24th International Conference on Tools with Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2012.92","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

对话系统(DS)在过去几年中发展迅速。为了使它们能够适应环境和个体用户，研究人员将重点放在了适应技术上，从而产生了自适应对话系统(ADS)领域。ADS的一个重要子领域是学习系统下一步应该说什么或做什么。在这个方向上完成的大多数工作都假设系统执行简单的操作，而不是由子对话描述的行为，即复杂的操作。为此，我们提出了一种新的在线算法，该算法根据复杂动作的性能对其进行排名，并选择表现最好的k个动作。据我们所知，目前还没有工作描述以在线方式学习复杂动作策略的方法。我们还提出了一种在线算法，能够估计系统动作对用户状态的影响，以便对采取哪种动作做出更明智的决定，并引导学习算法走向期望的最终用户状态。我们的研究结果表明，所提出的复杂动作学习算法优于简单的分层强化学习算法，并且我们能够成功地估计动作对用户状态的影响，使用情绪状态作为案例研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Online Complex Action Learning and User State Estimation for Adaptive Dialogue Systems

Dialogue Systems (DS) have been rapidly evolving during the last few years. In order for them to be able to adapt to their surroundings and to individual users, researchers have focused on adaptation techniques, giving rise to the field of Adaptive Dialogue Systems (ADS). One important sub field of ADS is learning what the system should say or do next. Most of the work done in this direction assumes the system performs simple actions, rather than behaviours described by sub-dialogues, i.e. complex actions. To this end we propose a novel online algorithm that ranks complex actions according to their performance and selects the top-k performing ones. To the best of our knowledge there is currently no work describing methods to learn policies for complex actions in an online fashion. We also propose an online algorithm able to estimate the effects of system actions on the user's state, in order to make more informed decisions about which action to take and guide the learning algorithm towards a desired final user state. Our results show that the proposed complex action learning algorithm outperforms simple Hierarchical Reinforcement Learning algorithms and that we are able to successfully estimate the effect an action will have on the user's state, using emotional states as a case study.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 IEEE 24th International Conference on Tools with Artificial Intelligence

自引率

0.00%

发文量