有效地学习单臂投掷运动平滑服装

International Symposium of Robotics Research Pub Date : 2022-06-17 DOI:10.48550/arXiv.2206.08921

Lawrence Yunliang Chen, Huang Huang, Ellen R. Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, T. Kollar, Ken Goldberg

{"title":"有效地学习单臂投掷运动平滑服装","authors":"Lawrence Yunliang Chen, Huang Huang, Ellen R. Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, T. Kollar, Ken Goldberg","doi":"10.48550/arXiv.2206.08921","DOIUrl":null,"url":null,"abstract":"Recent work has shown that 2-arm\"fling\"motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment grasp point, the robot explores different parameterized fling trajectories in physical experiments. To improve learning efficiency, we propose a coarse-to-fine learning method that first uses a multi-armed bandit (MAB) framework to efficiently find a candidate fling action, which it then refines via a continuous optimization method. Further, we propose novel training and execution-time stopping criteria based on fling outcome uncertainty; the training-time stopping criterion increases data efficiency while the execution-time stopping criteria leverage repeated fling actions to increase performance. Compared to baselines, the proposed method significantly accelerates learning. Moreover, with prior experience on similar garments collected through self-supervision, the MAB learning time for a new garment is reduced by up to 87%. We evaluate on 36 real garments: towels, T-shirts, long-sleeve shirts, dresses, sweat pants, and jeans. Results suggest that using prior experience, a robot requires under 30 minutes to learn a fling action for a novel garment that achieves 60-94% coverage.","PeriodicalId":136210,"journal":{"name":"International Symposium of Robotics Research","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Efficiently Learning Single-Arm Fling Motions to Smooth Garments\",\"authors\":\"Lawrence Yunliang Chen, Huang Huang, Ellen R. Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, T. Kollar, Ken Goldberg\",\"doi\":\"10.48550/arXiv.2206.08921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent work has shown that 2-arm\\\"fling\\\"motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment grasp point, the robot explores different parameterized fling trajectories in physical experiments. To improve learning efficiency, we propose a coarse-to-fine learning method that first uses a multi-armed bandit (MAB) framework to efficiently find a candidate fling action, which it then refines via a continuous optimization method. Further, we propose novel training and execution-time stopping criteria based on fling outcome uncertainty; the training-time stopping criterion increases data efficiency while the execution-time stopping criteria leverage repeated fling actions to increase performance. Compared to baselines, the proposed method significantly accelerates learning. Moreover, with prior experience on similar garments collected through self-supervision, the MAB learning time for a new garment is reduced by up to 87%. We evaluate on 36 real garments: towels, T-shirts, long-sleeve shirts, dresses, sweat pants, and jeans. Results suggest that using prior experience, a robot requires under 30 minutes to learn a fling action for a novel garment that achieves 60-94% coverage.\",\"PeriodicalId\":136210,\"journal\":{\"name\":\"International Symposium of Robotics Research\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Symposium of Robotics Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2206.08921\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Symposium of Robotics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2206.08921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

最近的研究表明，两臂“抛”运动可以有效地使衣服光滑。我们考虑单臂投掷动作。双臂抛掷运动对机器人轨迹参数的调整较少，而单臂抛掷运动对轨迹参数非常敏感。我们考虑一个单6自由度机器人手臂，它学习投掷轨迹以实现高服装覆盖率。给定一个服装抓点，机器人在物理实验中探索不同的参数化抛动轨迹。为了提高学习效率，我们提出了一种从粗到精的学习方法，该方法首先使用多臂强盗(MAB)框架有效地找到候选投掷动作，然后通过连续优化方法对其进行细化。此外，我们提出了基于投掷结果不确定性的新的训练和执行时间停止标准;训练时间停止标准提高了数据效率，而执行时间停止标准利用重复的投掷动作来提高性能。与基线相比，该方法显著加快了学习速度。此外，通过自我监督收集类似服装的经验，新服装的MAB学习时间最多减少了87%。我们对36件真实的服装进行了评估:毛巾、t恤、长袖衬衫、连衣裙、运动裤和牛仔裤。结果表明，利用先前的经验，机器人需要在30分钟内学习一件新衣服的投掷动作，覆盖率达到60-94%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Efficiently Learning Single-Arm Fling Motions to Smooth Garments

Recent work has shown that 2-arm"fling"motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment grasp point, the robot explores different parameterized fling trajectories in physical experiments. To improve learning efficiency, we propose a coarse-to-fine learning method that first uses a multi-armed bandit (MAB) framework to efficiently find a candidate fling action, which it then refines via a continuous optimization method. Further, we propose novel training and execution-time stopping criteria based on fling outcome uncertainty; the training-time stopping criterion increases data efficiency while the execution-time stopping criteria leverage repeated fling actions to increase performance. Compared to baselines, the proposed method significantly accelerates learning. Moreover, with prior experience on similar garments collected through self-supervision, the MAB learning time for a new garment is reduced by up to 87%. We evaluate on 36 real garments: towels, T-shirts, long-sleeve shirts, dresses, sweat pants, and jeans. Results suggest that using prior experience, a robot requires under 30 minutes to learn a fling action for a novel garment that achieves 60-94% coverage.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Symposium of Robotics Research

自引率

0.00%

发文量