Xinghua Wang, Yuting Tang, Xiaolong Liu, Jie Wang, Jiawen Cao, Ruijin Sun
{"title":"Research on Robot Target Classification and Localization Based on Improved Mask R-CNN","authors":"Xinghua Wang, Yuting Tang, Xiaolong Liu, Jie Wang, Jiawen Cao, Ruijin Sun","doi":"10.1002/cpe.70247","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The small workpieces are easily missed during detection, and the irregular workpieces are difficult to recognize and segment effectively by traditional detection algorithms in the industrial field. The traditional target detection algorithms have problems such as low accuracy and poor generalization performance. This paper proposes a robot target recognition and positioning method based on the improved Mask R-CNN. First, the network structure is designed to add a Convolutional Block Attention Module (CBAM) in the backbone, replace the Feature Pyramid Network (FPN) structure used in the original model of Mask R-CNN with a Path Aggregation Network (PAN) structure, and increase the receptive field to enhance the recognition of small target objects and the segmentation of multi-objects. Second, after classification is completed, according to the segmentation information, the output is augmented with center coordinates and rotation angle information. Finally, comparative experiments are conducted in the COCO dataset and the industrial part dataset to verify the effectiveness and practicality of the proposed algorithm. The experimental results show that the improved model achieves an AP<sub>50</sub> of 60.6 in the COCO dataset and 99.4 in the industrial parts dataset. Additionally, in single-object and multi-object grasping experiments, the grasping accuracy is 91.5% and 85.3%, respectively.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 21-22","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70247","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
The small workpieces are easily missed during detection, and the irregular workpieces are difficult to recognize and segment effectively by traditional detection algorithms in the industrial field. The traditional target detection algorithms have problems such as low accuracy and poor generalization performance. This paper proposes a robot target recognition and positioning method based on the improved Mask R-CNN. First, the network structure is designed to add a Convolutional Block Attention Module (CBAM) in the backbone, replace the Feature Pyramid Network (FPN) structure used in the original model of Mask R-CNN with a Path Aggregation Network (PAN) structure, and increase the receptive field to enhance the recognition of small target objects and the segmentation of multi-objects. Second, after classification is completed, according to the segmentation information, the output is augmented with center coordinates and rotation angle information. Finally, comparative experiments are conducted in the COCO dataset and the industrial part dataset to verify the effectiveness and practicality of the proposed algorithm. The experimental results show that the improved model achieves an AP50 of 60.6 in the COCO dataset and 99.4 in the industrial parts dataset. Additionally, in single-object and multi-object grasping experiments, the grasping accuracy is 91.5% and 85.3%, respectively.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.