学习提高机器人执行非重复性任务时的表现

Yeo Jung Yoon, Satyandra K. Gupta
{"title":"学习提高机器人执行非重复性任务时的表现","authors":"Yeo Jung Yoon, Satyandra K. Gupta","doi":"10.1115/detc2021-67627","DOIUrl":null,"url":null,"abstract":"\n To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.","PeriodicalId":23602,"journal":{"name":"Volume 2: 41st Computers and Information in Engineering Conference (CIE)","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning to Improve Performance During Non-Repetitive Tasks Performed by Robots\",\"authors\":\"Yeo Jung Yoon, Satyandra K. Gupta\",\"doi\":\"10.1115/detc2021-67627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.\",\"PeriodicalId\":23602,\"journal\":{\"name\":\"Volume 2: 41st Computers and Information in Engineering Conference (CIE)\",\"volume\":\"34 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 2: 41st Computers and Information in Engineering Conference (CIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2021-67627\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 2: 41st Computers and Information in Engineering Conference (CIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2021-67627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

为了执行非重复性任务,机器人需要对任务进行学习以提高任务性能。无法为此类非重复性任务预先构建性能模型。机器人可以执行具有一定工艺参数的任务的一小部分,并根据对任务执行情况的观察有吸引力地更新工艺参数。为了使学习过程高效,应该明确考虑探索和利用之间的权衡。过多的探索可能会导致时间的浪费,而对任务绩效没有明显的改善。另一方面,过早停止探索可能会导致任务性能不理想。本文描述了一种序列决策方法来选择参数集以提高任务性能。整体学习方法采用可行性偏抽样、代理模型构建和贪心优化。我们在机器人打磨的模拟中实现了我们的方法。并与其它实验设计方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Learning to Improve Performance During Non-Repetitive Tasks Performed by Robots
To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信