有效地学习单臂投掷运动平滑服装

Lawrence Yunliang Chen, Huang Huang, Ellen R. Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, T. Kollar, Ken Goldberg
{"title":"有效地学习单臂投掷运动平滑服装","authors":"Lawrence Yunliang Chen, Huang Huang, Ellen R. Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, T. Kollar, Ken Goldberg","doi":"10.48550/arXiv.2206.08921","DOIUrl":null,"url":null,"abstract":"Recent work has shown that 2-arm\"fling\"motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment grasp point, the robot explores different parameterized fling trajectories in physical experiments. To improve learning efficiency, we propose a coarse-to-fine learning method that first uses a multi-armed bandit (MAB) framework to efficiently find a candidate fling action, which it then refines via a continuous optimization method. Further, we propose novel training and execution-time stopping criteria based on fling outcome uncertainty; the training-time stopping criterion increases data efficiency while the execution-time stopping criteria leverage repeated fling actions to increase performance. Compared to baselines, the proposed method significantly accelerates learning. Moreover, with prior experience on similar garments collected through self-supervision, the MAB learning time for a new garment is reduced by up to 87%. We evaluate on 36 real garments: towels, T-shirts, long-sleeve shirts, dresses, sweat pants, and jeans. Results suggest that using prior experience, a robot requires under 30 minutes to learn a fling action for a novel garment that achieves 60-94% coverage.","PeriodicalId":136210,"journal":{"name":"International Symposium of Robotics Research","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Efficiently Learning Single-Arm Fling Motions to Smooth Garments\",\"authors\":\"Lawrence Yunliang Chen, Huang Huang, Ellen R. Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, T. Kollar, Ken Goldberg\",\"doi\":\"10.48550/arXiv.2206.08921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent work has shown that 2-arm\\\"fling\\\"motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment grasp point, the robot explores different parameterized fling trajectories in physical experiments. To improve learning efficiency, we propose a coarse-to-fine learning method that first uses a multi-armed bandit (MAB) framework to efficiently find a candidate fling action, which it then refines via a continuous optimization method. Further, we propose novel training and execution-time stopping criteria based on fling outcome uncertainty; the training-time stopping criterion increases data efficiency while the execution-time stopping criteria leverage repeated fling actions to increase performance. Compared to baselines, the proposed method significantly accelerates learning. Moreover, with prior experience on similar garments collected through self-supervision, the MAB learning time for a new garment is reduced by up to 87%. We evaluate on 36 real garments: towels, T-shirts, long-sleeve shirts, dresses, sweat pants, and jeans. Results suggest that using prior experience, a robot requires under 30 minutes to learn a fling action for a novel garment that achieves 60-94% coverage.\",\"PeriodicalId\":136210,\"journal\":{\"name\":\"International Symposium of Robotics Research\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Symposium of Robotics Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2206.08921\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Symposium of Robotics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2206.08921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

最近的研究表明,两臂“抛”运动可以有效地使衣服光滑。我们考虑单臂投掷动作。双臂抛掷运动对机器人轨迹参数的调整较少,而单臂抛掷运动对轨迹参数非常敏感。我们考虑一个单6自由度机器人手臂,它学习投掷轨迹以实现高服装覆盖率。给定一个服装抓点,机器人在物理实验中探索不同的参数化抛动轨迹。为了提高学习效率,我们提出了一种从粗到精的学习方法,该方法首先使用多臂强盗(MAB)框架有效地找到候选投掷动作,然后通过连续优化方法对其进行细化。此外,我们提出了基于投掷结果不确定性的新的训练和执行时间停止标准;训练时间停止标准提高了数据效率,而执行时间停止标准利用重复的投掷动作来提高性能。与基线相比,该方法显著加快了学习速度。此外,通过自我监督收集类似服装的经验,新服装的MAB学习时间最多减少了87%。我们对36件真实的服装进行了评估:毛巾、t恤、长袖衬衫、连衣裙、运动裤和牛仔裤。结果表明,利用先前的经验,机器人需要在30分钟内学习一件新衣服的投掷动作,覆盖率达到60-94%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Efficiently Learning Single-Arm Fling Motions to Smooth Garments
Recent work has shown that 2-arm"fling"motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment grasp point, the robot explores different parameterized fling trajectories in physical experiments. To improve learning efficiency, we propose a coarse-to-fine learning method that first uses a multi-armed bandit (MAB) framework to efficiently find a candidate fling action, which it then refines via a continuous optimization method. Further, we propose novel training and execution-time stopping criteria based on fling outcome uncertainty; the training-time stopping criterion increases data efficiency while the execution-time stopping criteria leverage repeated fling actions to increase performance. Compared to baselines, the proposed method significantly accelerates learning. Moreover, with prior experience on similar garments collected through self-supervision, the MAB learning time for a new garment is reduced by up to 87%. We evaluate on 36 real garments: towels, T-shirts, long-sleeve shirts, dresses, sweat pants, and jeans. Results suggest that using prior experience, a robot requires under 30 minutes to learn a fling action for a novel garment that achieves 60-94% coverage.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信