学习提高机器人执行非重复性任务时的表现

Volume 2: 41st Computers and Information in Engineering Conference (CIE) Pub Date : 2021-08-17 DOI:10.1115/detc2021-67627

Yeo Jung Yoon, Satyandra K. Gupta

{"title":"学习提高机器人执行非重复性任务时的表现","authors":"Yeo Jung Yoon, Satyandra K. Gupta","doi":"10.1115/detc2021-67627","DOIUrl":null,"url":null,"abstract":"\n To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.","PeriodicalId":23602,"journal":{"name":"Volume 2: 41st Computers and Information in Engineering Conference (CIE)","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning to Improve Performance During Non-Repetitive Tasks Performed by Robots\",\"authors\":\"Yeo Jung Yoon, Satyandra K. Gupta\",\"doi\":\"10.1115/detc2021-67627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.\",\"PeriodicalId\":23602,\"journal\":{\"name\":\"Volume 2: 41st Computers and Information in Engineering Conference (CIE)\",\"volume\":\"34 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 2: 41st Computers and Information in Engineering Conference (CIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2021-67627\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 2: 41st Computers and Information in Engineering Conference (CIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2021-67627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了执行非重复性任务，机器人需要对任务进行学习以提高任务性能。无法为此类非重复性任务预先构建性能模型。机器人可以执行具有一定工艺参数的任务的一小部分，并根据对任务执行情况的观察有吸引力地更新工艺参数。为了使学习过程高效，应该明确考虑探索和利用之间的权衡。过多的探索可能会导致时间的浪费，而对任务绩效没有明显的改善。另一方面，过早停止探索可能会导致任务性能不理想。本文描述了一种序列决策方法来选择参数集以提高任务性能。整体学习方法采用可行性偏抽样、代理模型构建和贪心优化。我们在机器人打磨的模拟中实现了我们的方法。并与其它实验设计方法进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning to Improve Performance During Non-Repetitive Tasks Performed by Robots

To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Volume 2: 41st Computers and Information in Engineering Conference (CIE)

自引率

0.00%

发文量