{"title":"基于关键点的机器人抓取检测方法","authors":"Song Yan, Lei Zhang","doi":"10.1002/rob.22447","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study introduces a novel keypoint-based grasp detection network, denoted as GKSCConv-Net, which operates on n-channel input images. The network architecture comprises three SCConv2D layers and three SCConvT2D layers. The SCConvT2D layers facilitate upsampling to maintain consistent dimensions between the output and input images. The resultant output consists of maps indicating left grasp points, right grasp points, and grasp center keypoints. The accuracy of predictions is enhanced through the incorporation of the keypoint refinement module and feature fusion module. To validate the model's generalization and applicability, comprehensive training, testing, and evaluation are conducted on diverse data sets, including the Cornell data set, Jacquard data set, and others representing real-world scenarios. Furthermore, ablation experiments are employed to substantiate the efficacy of the spatial reconstruction unit (SRU) and channel reconstruction unit (CRU) within the SCConv, exploring their impact on grasp keypoint detection outcomes. Real robotic grasping experiments ultimately affirm the model's outstanding performance in practical settings.</p>\n </div>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"42 4","pages":"1271-1286"},"PeriodicalIF":4.2000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robot Grasping Detection Method Based on Keypoints\",\"authors\":\"Song Yan, Lei Zhang\",\"doi\":\"10.1002/rob.22447\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>This study introduces a novel keypoint-based grasp detection network, denoted as GKSCConv-Net, which operates on n-channel input images. The network architecture comprises three SCConv2D layers and three SCConvT2D layers. The SCConvT2D layers facilitate upsampling to maintain consistent dimensions between the output and input images. The resultant output consists of maps indicating left grasp points, right grasp points, and grasp center keypoints. The accuracy of predictions is enhanced through the incorporation of the keypoint refinement module and feature fusion module. To validate the model's generalization and applicability, comprehensive training, testing, and evaluation are conducted on diverse data sets, including the Cornell data set, Jacquard data set, and others representing real-world scenarios. Furthermore, ablation experiments are employed to substantiate the efficacy of the spatial reconstruction unit (SRU) and channel reconstruction unit (CRU) within the SCConv, exploring their impact on grasp keypoint detection outcomes. Real robotic grasping experiments ultimately affirm the model's outstanding performance in practical settings.</p>\\n </div>\",\"PeriodicalId\":192,\"journal\":{\"name\":\"Journal of Field Robotics\",\"volume\":\"42 4\",\"pages\":\"1271-1286\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Field Robotics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/rob.22447\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22447","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
Robot Grasping Detection Method Based on Keypoints
This study introduces a novel keypoint-based grasp detection network, denoted as GKSCConv-Net, which operates on n-channel input images. The network architecture comprises three SCConv2D layers and three SCConvT2D layers. The SCConvT2D layers facilitate upsampling to maintain consistent dimensions between the output and input images. The resultant output consists of maps indicating left grasp points, right grasp points, and grasp center keypoints. The accuracy of predictions is enhanced through the incorporation of the keypoint refinement module and feature fusion module. To validate the model's generalization and applicability, comprehensive training, testing, and evaluation are conducted on diverse data sets, including the Cornell data set, Jacquard data set, and others representing real-world scenarios. Furthermore, ablation experiments are employed to substantiate the efficacy of the spatial reconstruction unit (SRU) and channel reconstruction unit (CRU) within the SCConv, exploring their impact on grasp keypoint detection outcomes. Real robotic grasping experiments ultimately affirm the model's outstanding performance in practical settings.
期刊介绍:
The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments.
The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.