{"title":"基于扩展残差卷积神经网络的机器人抓取","authors":"Jing Zhao, Liang Ye, Yufeng Wang, Huasong Min","doi":"10.1109/WRCSARA57040.2022.9903934","DOIUrl":null,"url":null,"abstract":"Grasping detection for multiple objects is an important task in the application of robots. Despite the achievement obtained by several encoding-decoding deep learning models, grasping detection for objects with much different sizes is still not well solved. In this paper, a Dilated Residual Connection Neural Network (DR-ConvNet) is proposed to achieve the multi-object grasping detection in RGB-D images. On one hand, the detection and location for small objects can addressed well due to the encoding-decoding structure. On the other hand, the proposed dilated residual module can capture more complete features for large objects by its larger receptive field than ordinary convolutions. To verify and highlight the advantages of the proposed DR-ConvNet, an expanded new dataset called S-Cornell is built based on the Cornell grasping dataset. On this dataset, the proposed method can achieve 9S.7% and 95.4% accuracy in single-object and multi-object grasping detection respectively. Several experiments prove that the proposed model can outperform some current popular grasping detection methods in RGB-D images.","PeriodicalId":106730,"journal":{"name":"2022 WRC Symposium on Advanced Robotics and Automation (WRC SARA)","volume":"263 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robot Grasping using Dilated Residual Convolutional Neural Network\",\"authors\":\"Jing Zhao, Liang Ye, Yufeng Wang, Huasong Min\",\"doi\":\"10.1109/WRCSARA57040.2022.9903934\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Grasping detection for multiple objects is an important task in the application of robots. Despite the achievement obtained by several encoding-decoding deep learning models, grasping detection for objects with much different sizes is still not well solved. In this paper, a Dilated Residual Connection Neural Network (DR-ConvNet) is proposed to achieve the multi-object grasping detection in RGB-D images. On one hand, the detection and location for small objects can addressed well due to the encoding-decoding structure. On the other hand, the proposed dilated residual module can capture more complete features for large objects by its larger receptive field than ordinary convolutions. To verify and highlight the advantages of the proposed DR-ConvNet, an expanded new dataset called S-Cornell is built based on the Cornell grasping dataset. On this dataset, the proposed method can achieve 9S.7% and 95.4% accuracy in single-object and multi-object grasping detection respectively. Several experiments prove that the proposed model can outperform some current popular grasping detection methods in RGB-D images.\",\"PeriodicalId\":106730,\"journal\":{\"name\":\"2022 WRC Symposium on Advanced Robotics and Automation (WRC SARA)\",\"volume\":\"263 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 WRC Symposium on Advanced Robotics and Automation (WRC SARA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WRCSARA57040.2022.9903934\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 WRC Symposium on Advanced Robotics and Automation (WRC SARA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WRCSARA57040.2022.9903934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Robot Grasping using Dilated Residual Convolutional Neural Network
Grasping detection for multiple objects is an important task in the application of robots. Despite the achievement obtained by several encoding-decoding deep learning models, grasping detection for objects with much different sizes is still not well solved. In this paper, a Dilated Residual Connection Neural Network (DR-ConvNet) is proposed to achieve the multi-object grasping detection in RGB-D images. On one hand, the detection and location for small objects can addressed well due to the encoding-decoding structure. On the other hand, the proposed dilated residual module can capture more complete features for large objects by its larger receptive field than ordinary convolutions. To verify and highlight the advantages of the proposed DR-ConvNet, an expanded new dataset called S-Cornell is built based on the Cornell grasping dataset. On this dataset, the proposed method can achieve 9S.7% and 95.4% accuracy in single-object and multi-object grasping detection respectively. Several experiments prove that the proposed model can outperform some current popular grasping detection methods in RGB-D images.