{"title":"SUGrasping: a semantic grasping framework based on multi-head 3D U-Net","authors":"He Cao, Yunzhou Zhang, Zhexue Ge, Xin Chen, Xiaozheng Liu, Jiaqi Zhao","doi":"10.1007/s11042-024-20037-w","DOIUrl":null,"url":null,"abstract":"<p>Object grasping is an important skill for robots to interact with the real world, especially in unstructured environments where occlusions and different shapes of target objects are present. In this work, we introduce a robot grasping pipeline called SUGrasping, which can obtain the grasping poses more precisely for target objects. The grasping pipeline treats the Truncated Signed Distance Function (TSDF) and point clouds of the grasping scene as input simultaneously. The proposed multi-head 3D U-Net accepts reconstructed TSDF representation and outputs the grasping configurations, including predicted grasp quality, orientation and width of the gripper. The point cloud is fed into PointNet to obtain the semantic segmentation results for all objects in the grasping workspace. With the help of point cloud inside the gripper, the relationship between the gripper and semantic information can be established. It makes robots know which object they are grasping, rather than just removing objects in the workspace like previous works. Experimental results show that the proposed method has an improvement in grasping success rate and percent cleared of target objects, which outperforms state-of-the-art methods compared in this paper.</p>","PeriodicalId":18770,"journal":{"name":"Multimedia Tools and Applications","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimedia Tools and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11042-024-20037-w","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Object grasping is an important skill for robots to interact with the real world, especially in unstructured environments where occlusions and different shapes of target objects are present. In this work, we introduce a robot grasping pipeline called SUGrasping, which can obtain the grasping poses more precisely for target objects. The grasping pipeline treats the Truncated Signed Distance Function (TSDF) and point clouds of the grasping scene as input simultaneously. The proposed multi-head 3D U-Net accepts reconstructed TSDF representation and outputs the grasping configurations, including predicted grasp quality, orientation and width of the gripper. The point cloud is fed into PointNet to obtain the semantic segmentation results for all objects in the grasping workspace. With the help of point cloud inside the gripper, the relationship between the gripper and semantic information can be established. It makes robots know which object they are grasping, rather than just removing objects in the workspace like previous works. Experimental results show that the proposed method has an improvement in grasping success rate and percent cleared of target objects, which outperforms state-of-the-art methods compared in this paper.
期刊介绍:
Multimedia Tools and Applications publishes original research articles on multimedia development and system support tools as well as case studies of multimedia applications. It also features experimental and survey articles. The journal is intended for academics, practitioners, scientists and engineers who are involved in multimedia system research, design and applications. All papers are peer reviewed.
Specific areas of interest include:
- Multimedia Tools:
- Multimedia Applications:
- Prototype multimedia systems and platforms