基于语义关键点的不同尺寸、形状和柔软度包装鞋的机器人操作框架

IF 5.2 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Yi Dong , Yangjun Liu , Jinjun Duan , Yang Li , Zhendong Dai
{"title":"基于语义关键点的不同尺寸、形状和柔软度包装鞋的机器人操作框架","authors":"Yi Dong ,&nbsp;Yangjun Liu ,&nbsp;Jinjun Duan ,&nbsp;Yang Li ,&nbsp;Zhendong Dai","doi":"10.1016/j.robot.2025.105174","DOIUrl":null,"url":null,"abstract":"<div><div>With the rapid development of the warehousing and logistics industries, the packing of goods has gradually attracted the attention of academia and industry. The packing of footwear products is a typical representative paired-item packing task involving irregular shapes and deformable objects. Although studies on shoe packing have been conducted, different initial states due to the irregular shapes of shoes and standard packing placement poses have not been considered. This study proposes a robotic manipulation framework, including a perception module, reorientation planners, and a packing planner, that can complete the packing of pairs of shoes in any initial state. First, to adapt to the large intraclass variations due to the states, shapes, and deformation of shoes, we propose a vision module based on semantic keypoints, which can also infer additional information such as sizes, states, poses, and manipulation points by combining geometric features. Subsequently, we not only propose primitive-based reorientation methods for different states of a single deformable shoe but also propose a fast reorientation method for the top state using box edge contact and gravity, which further improve the efficiency of reorientation. Finally, based on the perception module and reorientation methods, we propose a task planner for packing paired shoes in any initial state to provide an optimal packing strategy. Real-world experiments were conducted to verify the robustness of the reorientation methods and the effectiveness of the packing strategy for various types of shoes. In this study, we highlight the potential of semantic keypoint representation, introduce new perspectives on the reorientation of 3D deformable objects and multi-object manipulation, and provide a reference for paired object packing.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"194 ","pages":"Article 105174"},"PeriodicalIF":5.2000,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robotic manipulation framework based on semantic keypoints for packing shoes of different sizes, shapes, and softness\",\"authors\":\"Yi Dong ,&nbsp;Yangjun Liu ,&nbsp;Jinjun Duan ,&nbsp;Yang Li ,&nbsp;Zhendong Dai\",\"doi\":\"10.1016/j.robot.2025.105174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the rapid development of the warehousing and logistics industries, the packing of goods has gradually attracted the attention of academia and industry. The packing of footwear products is a typical representative paired-item packing task involving irregular shapes and deformable objects. Although studies on shoe packing have been conducted, different initial states due to the irregular shapes of shoes and standard packing placement poses have not been considered. This study proposes a robotic manipulation framework, including a perception module, reorientation planners, and a packing planner, that can complete the packing of pairs of shoes in any initial state. First, to adapt to the large intraclass variations due to the states, shapes, and deformation of shoes, we propose a vision module based on semantic keypoints, which can also infer additional information such as sizes, states, poses, and manipulation points by combining geometric features. Subsequently, we not only propose primitive-based reorientation methods for different states of a single deformable shoe but also propose a fast reorientation method for the top state using box edge contact and gravity, which further improve the efficiency of reorientation. Finally, based on the perception module and reorientation methods, we propose a task planner for packing paired shoes in any initial state to provide an optimal packing strategy. Real-world experiments were conducted to verify the robustness of the reorientation methods and the effectiveness of the packing strategy for various types of shoes. In this study, we highlight the potential of semantic keypoint representation, introduce new perspectives on the reorientation of 3D deformable objects and multi-object manipulation, and provide a reference for paired object packing.</div></div>\",\"PeriodicalId\":49592,\"journal\":{\"name\":\"Robotics and Autonomous Systems\",\"volume\":\"194 \",\"pages\":\"Article 105174\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Autonomous Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0921889025002714\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889025002714","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

随着仓储物流业的快速发展,货物的包装问题逐渐引起学术界和产业界的重视。鞋类产品的包装是一种典型的、具有代表性的、涉及不规则形状和可变形物体的成对包装任务。虽然对鞋的包装进行了研究,但没有考虑到由于鞋的不规则形状和标准包装放置姿势而产生的不同初始状态。本研究提出了一个机器人操作框架,包括感知模块、重新定位规划器和打包规划器,可以在任何初始状态下完成双鞋的打包。首先,为了适应由于鞋子的状态、形状和变形引起的类内大变化,我们提出了一个基于语义关键点的视觉模块,该模块还可以通过结合几何特征来推断尺寸、状态、姿势和操作点等附加信息。在此基础上,提出了基于原语的单变形鞋不同状态的重定向方法,并提出了基于盒边接触和重力的顶部状态快速重定向方法,进一步提高了重定向效率。最后,基于感知模块和重新定位方法,我们提出了一个任务规划器,用于在任何初始状态下打包成对的鞋子,以提供最优的打包策略。通过实际实验验证了该方法的鲁棒性和不同类型鞋子包装策略的有效性。在这项研究中,我们强调了语义关键点表示的潜力,介绍了三维可变形物体的重新定向和多物体操作的新视角,并为成对物体打包提供了参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Robotic manipulation framework based on semantic keypoints for packing shoes of different sizes, shapes, and softness
With the rapid development of the warehousing and logistics industries, the packing of goods has gradually attracted the attention of academia and industry. The packing of footwear products is a typical representative paired-item packing task involving irregular shapes and deformable objects. Although studies on shoe packing have been conducted, different initial states due to the irregular shapes of shoes and standard packing placement poses have not been considered. This study proposes a robotic manipulation framework, including a perception module, reorientation planners, and a packing planner, that can complete the packing of pairs of shoes in any initial state. First, to adapt to the large intraclass variations due to the states, shapes, and deformation of shoes, we propose a vision module based on semantic keypoints, which can also infer additional information such as sizes, states, poses, and manipulation points by combining geometric features. Subsequently, we not only propose primitive-based reorientation methods for different states of a single deformable shoe but also propose a fast reorientation method for the top state using box edge contact and gravity, which further improve the efficiency of reorientation. Finally, based on the perception module and reorientation methods, we propose a task planner for packing paired shoes in any initial state to provide an optimal packing strategy. Real-world experiments were conducted to verify the robustness of the reorientation methods and the effectiveness of the packing strategy for various types of shoes. In this study, we highlight the potential of semantic keypoint representation, introduce new perspectives on the reorientation of 3D deformable objects and multi-object manipulation, and provide a reference for paired object packing.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Robotics and Autonomous Systems
Robotics and Autonomous Systems 工程技术-机器人学
CiteScore
9.00
自引率
7.00%
发文量
164
审稿时长
4.5 months
期刊介绍: Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信