Reduction of Complexity in Q-Learning a Robot Control for an Assembly Cell by using Multiple Agents

Simul. Notes Eur. Pub Date : 2020-09-01 DOI:10.11128/sne.30.tn.10524

Georg Kunert, T. Pawletta, Sven Hartmann

引用次数: 0

Abstract

Production systems in Industry 4.0 are characterized by a high degree of system networking and adaptability. They are often characterized by jointed-arm robots, which have a high degree of adaptation. Networking and adaptivity increase the flexibility of a system, but also the complexity of the control, which requires the use of new development methods. In this context, the Simulation-Based Control approach, a model-based design method, and the concept of Reinforcement Learning (RL) are introduced and it is shown how a task-based robot control can be learned and executed. Afterwards, the time complexity of the Q-learning method will be examined using the application example of a robot-based assembly cell with two differently flexible system configurations. It is shown that, depending on the system configuration, the time complexity of learning can be significantly reduced when using several agents. In the studied case, the complexity decreases from exponential to linear. The modified RL structure is discussed in detail.

查看原文本刊更多论文

基于多智能体的机器人装配单元控制q -学习复杂度降低

工业4.0的生产系统的特点是高度的系统联网和适应性。它们的特点往往是关节臂机器人，具有高度的适应性。网络化和自适应增加了系统的灵活性，但也增加了控制的复杂性，这就需要使用新的开发方法。在此背景下，介绍了基于仿真的控制方法、基于模型的设计方法和强化学习(RL)的概念，并展示了如何学习和执行基于任务的机器人控制。随后，将使用具有两种不同柔性系统配置的机器人装配单元的应用实例来检验q -学习方法的时间复杂度。研究表明，根据不同的系统配置，当使用多个智能体时，学习的时间复杂度可以显著降低。在研究的情况下，复杂度从指数下降到线性。详细讨论了改进后的RL结构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Simul. Notes Eur.

自引率

0.00%

发文量