Reinforcement Learning for User Intent Prediction in Customer Service Bots

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval Pub Date : 2019-07-18 DOI:10.1145/3331184.3331370

Cen Chen, Chilin Fu, Xujun Hu, Xiaolu Zhang, Jun Zhou, Xiaolong Li, F. S. Bao

引用次数: 15

Abstract

A customer service bot is now a necessary component of an e-commerce platform. As a core module of the customer service bot, user intent prediction can help predict user questions before they ask. A typical solution is to find top candidate questions that a user will be interested in. Such solution ignores the inter-relationship between questions and often aims to maximize the immediate reward such as clicks, which may not be ideal in practice. Hence, we propose to view the problem as a sequential decision making process to better capture the long-term effects of each recommendation in the list. Intuitively, we formulate the problem as a Markov decision process and consider using reinforcement learning for the problem. With this approach, questions presented to users are both relevant and diverse. Experiments on offline real-world dataset and online system demonstrate the effectiveness of our proposed approach.

查看原文本刊更多论文

客服机器人中用户意图预测的强化学习

客户服务机器人现在是电子商务平台的必要组成部分。用户意图预测是客服机器人的核心模块，可以在用户提问之前预测用户的问题。一个典型的解决方案是找到用户可能感兴趣的最佳候选问题。这样的解决方案忽略了问题之间的相互关系，通常旨在最大化点击等即时奖励，这在实践中可能并不理想。因此，我们建议将问题视为一个连续的决策制定过程，以更好地捕获列表中每个建议的长期影响。直观地，我们将问题表述为马尔可夫决策过程，并考虑使用强化学习来解决问题。通过这种方法，呈现给用户的问题既相关又多样。在离线真实数据集和在线系统上的实验证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

自引率

0.00%

发文量