{"title":"MBB-YOLO:全面改进的拥挤物体检测轻量级算法","authors":"Junguo Liao, Haonan Tian","doi":"10.1002/cpe.8219","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Object detection in crowded scenes involves various difficulties, such as small objects, occluded objects, and insufficient features. Existing models for crowded object detection often focus on only one detection difficulty, and they are too large to be applied in practice. To address the diverse challenges of object detection in crowded scenes, we construct a lightweight crowded object detector called MBB-YOLO, which contains several modules for comprehensive improvement. To improve the network's ability to extract fine-grained features, we use SPD-Conv and the proposed MS-Conv to replace the strided convolution in the network. An bi-branch multi-scale convolution attention (BMCA) module is proposed to aggregate multi-scale contextual information. We also propose boundary-NMS to better identify proposal boxes from different objects, which reduces suppression errors caused by object occlusion. MBB-YOLO achieves 87.6% AP and an inference speed of 78.8 FPS on the CrowdHuman dataset, which surpasses other mainstream lightweight object detectors.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 22","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MBB-YOLO: A comprehensively improved lightweight algorithm for crowded object detection\",\"authors\":\"Junguo Liao, Haonan Tian\",\"doi\":\"10.1002/cpe.8219\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Object detection in crowded scenes involves various difficulties, such as small objects, occluded objects, and insufficient features. Existing models for crowded object detection often focus on only one detection difficulty, and they are too large to be applied in practice. To address the diverse challenges of object detection in crowded scenes, we construct a lightweight crowded object detector called MBB-YOLO, which contains several modules for comprehensive improvement. To improve the network's ability to extract fine-grained features, we use SPD-Conv and the proposed MS-Conv to replace the strided convolution in the network. An bi-branch multi-scale convolution attention (BMCA) module is proposed to aggregate multi-scale contextual information. We also propose boundary-NMS to better identify proposal boxes from different objects, which reduces suppression errors caused by object occlusion. MBB-YOLO achieves 87.6% AP and an inference speed of 78.8 FPS on the CrowdHuman dataset, which surpasses other mainstream lightweight object detectors.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"36 22\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.8219\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.8219","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
MBB-YOLO: A comprehensively improved lightweight algorithm for crowded object detection
Object detection in crowded scenes involves various difficulties, such as small objects, occluded objects, and insufficient features. Existing models for crowded object detection often focus on only one detection difficulty, and they are too large to be applied in practice. To address the diverse challenges of object detection in crowded scenes, we construct a lightweight crowded object detector called MBB-YOLO, which contains several modules for comprehensive improvement. To improve the network's ability to extract fine-grained features, we use SPD-Conv and the proposed MS-Conv to replace the strided convolution in the network. An bi-branch multi-scale convolution attention (BMCA) module is proposed to aggregate multi-scale contextual information. We also propose boundary-NMS to better identify proposal boxes from different objects, which reduces suppression errors caused by object occlusion. MBB-YOLO achieves 87.6% AP and an inference speed of 78.8 FPS on the CrowdHuman dataset, which surpasses other mainstream lightweight object detectors.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.