Agent-mediated application emergence through reinforcement learning from user feedback

2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) Pub Date : 2020-09-01 DOI:10.1109/WETICE49692.2020.00009

Walid Younes, F. Adreit, Sylvie Trouilhet, J. Arcangeli

引用次数: 5

Abstract

Cyber-physical and ambient systems surround the human user with applications that should be tailored as possible to her/his preferences and the current situation. We propose to build them automatically and on the fly by composition of software components present at the time in the environment, but without prior expression of the user’s needs or process specification or composition model. In order to produce knowledge useful for automatic composition in the absence of an initial guideline, we have developed a generic solution based on lifelong online reinforcement learning. It is decentralized within a multi-agent system where agents learn incrementally from user feedback to satisfy her/him. Different use cases have been experimented in which applications, adapted to the user and the situation, are composed and emerge automatically and continuously.

查看原文本刊更多论文

通过强化学习从用户反馈agent介导的应用涌现

网络物理和环境系统围绕着人类用户，应用程序应该尽可能地根据他/她的偏好和当前情况进行定制。我们建议通过组合环境中当时存在的软件组件来自动地、动态地构建它们，但不需要事先表达用户需求、过程规范或组合模型。为了在没有初始指导的情况下产生对自动作文有用的知识，我们开发了一个基于终身在线强化学习的通用解决方案。它在多代理系统中是分散的，代理从用户反馈中逐渐学习以满足她/他。已经试验了不同的用例，在这些用例中，适应用户和情况的应用程序是自动和连续地组成和出现的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)

自引率

0.00%

发文量