基于关键点的机器人抓取检测方法

IF 5.2 2区计算机科学 Q2 ROBOTICS

Journal of Field Robotics Pub Date : 2024-10-11 DOI:10.1002/rob.22447

Song Yan, Lei Zhang

{"title":"基于关键点的机器人抓取检测方法","authors":"Song Yan, Lei Zhang","doi":"10.1002/rob.22447","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study introduces a novel keypoint-based grasp detection network, denoted as GKSCConv-Net, which operates on n-channel input images. The network architecture comprises three SCConv2D layers and three SCConvT2D layers. The SCConvT2D layers facilitate upsampling to maintain consistent dimensions between the output and input images. The resultant output consists of maps indicating left grasp points, right grasp points, and grasp center keypoints. The accuracy of predictions is enhanced through the incorporation of the keypoint refinement module and feature fusion module. To validate the model's generalization and applicability, comprehensive training, testing, and evaluation are conducted on diverse data sets, including the Cornell data set, Jacquard data set, and others representing real-world scenarios. Furthermore, ablation experiments are employed to substantiate the efficacy of the spatial reconstruction unit (SRU) and channel reconstruction unit (CRU) within the SCConv, exploring their impact on grasp keypoint detection outcomes. Real robotic grasping experiments ultimately affirm the model's outstanding performance in practical settings.</p>\n </div>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"42 4","pages":"1271-1286"},"PeriodicalIF":5.2000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robot Grasping Detection Method Based on Keypoints\",\"authors\":\"Song Yan, Lei Zhang\",\"doi\":\"10.1002/rob.22447\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>This study introduces a novel keypoint-based grasp detection network, denoted as GKSCConv-Net, which operates on n-channel input images. The network architecture comprises three SCConv2D layers and three SCConvT2D layers. The SCConvT2D layers facilitate upsampling to maintain consistent dimensions between the output and input images. The resultant output consists of maps indicating left grasp points, right grasp points, and grasp center keypoints. The accuracy of predictions is enhanced through the incorporation of the keypoint refinement module and feature fusion module. To validate the model's generalization and applicability, comprehensive training, testing, and evaluation are conducted on diverse data sets, including the Cornell data set, Jacquard data set, and others representing real-world scenarios. Furthermore, ablation experiments are employed to substantiate the efficacy of the spatial reconstruction unit (SRU) and channel reconstruction unit (CRU) within the SCConv, exploring their impact on grasp keypoint detection outcomes. Real robotic grasping experiments ultimately affirm the model's outstanding performance in practical settings.</p>\\n </div>\",\"PeriodicalId\":192,\"journal\":{\"name\":\"Journal of Field Robotics\",\"volume\":\"42 4\",\"pages\":\"1271-1286\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2024-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Field Robotics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/rob.22447\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22447","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

摘要

本文介绍了一种新的基于关键点的抓取检测网络，称为gkscconvn - net，该网络可在n通道输入图像上运行。网络架构包括三个SCConv2D层和三个SCConvT2D层。SCConvT2D层促进上采样，以保持输出和输入图像之间的一致尺寸。结果输出由表示左抓地点、右抓地点和抓地点中心关键点的映射组成。结合关键点细化模块和特征融合模块，提高了预测的准确性。为了验证模型的泛化和适用性，我们在不同的数据集上进行了全面的训练、测试和评估，包括Cornell数据集、Jacquard数据集以及其他代表真实场景的数据集。此外，利用消融实验验证了SCConv内空间重构单元（SRU）和通道重构单元（CRU）的有效性，探讨了它们对抓取关键点检测结果的影响。真实的机器人抓取实验最终肯定了该模型在实际环境中的出色表现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Robot Grasping Detection Method Based on Keypoints

This study introduces a novel keypoint-based grasp detection network, denoted as GKSCConv-Net, which operates on n-channel input images. The network architecture comprises three SCConv2D layers and three SCConvT2D layers. The SCConvT2D layers facilitate upsampling to maintain consistent dimensions between the output and input images. The resultant output consists of maps indicating left grasp points, right grasp points, and grasp center keypoints. The accuracy of predictions is enhanced through the incorporation of the keypoint refinement module and feature fusion module. To validate the model's generalization and applicability, comprehensive training, testing, and evaluation are conducted on diverse data sets, including the Cornell data set, Jacquard data set, and others representing real-world scenarios. Furthermore, ablation experiments are employed to substantiate the efficacy of the spatial reconstruction unit (SRU) and channel reconstruction unit (CRU) within the SCConv, exploring their impact on grasp keypoint detection outcomes. Real robotic grasping experiments ultimately affirm the model's outstanding performance in practical settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Field Robotics 工程技术-机器人学

CiteScore

15.00

自引率

3.60%

发文量

审稿时长

6 months

期刊介绍： The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments. The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.