Xudong Gao , Xiaoguang Gao , Jia Rong , Xiaowei Chen , Xiang Liao , Jun Chen
{"title":"HB-net: Holistic bursting cell cluster integrated network for occluded multi-objects recognition","authors":"Xudong Gao , Xiaoguang Gao , Jia Rong , Xiaowei Chen , Xiang Liao , Jun Chen","doi":"10.1016/j.neucom.2025.130071","DOIUrl":null,"url":null,"abstract":"<div><div>Within the realm of image recognition, a specific category of multi-label classification (MLC) challenges arises when objects within the visual field may occlude one another, demanding simultaneous identification of both occluded and occluding objects. While traditional convolutional neural networks (CNNs) address these tasks, they are often bulky and achieve only moderate accuracy. To overcome this limitation, this paper introduces HB-net, a novel integrated network framework inspired by the Holistic Bursting (HB) cell from cutting-edge neural science research. Built upon the foundation of HB cell clusters, HB-net is designed to address the intricate task of simultaneously recognizing multiple occluded objects within images. The framework incorporates various Bursting cell cluster structures along with an evidence accumulation mechanism to enhance performance. Testing on multiple datasets, including digits and letters, shows that models incorporating the HB framework achieve a significant 2.98% improvement in recognition accuracy compared to models without the HB framework (1.0298 times, p=0.0499). Although in high-noise settings, standard CNNs exhibit slightly greater robustness when compared to HB-net models, the models that combine the HB framework and EA mechanism achieve a comparable level of accuracy and resilience to ResNet50, despite having only three convolutional layers and approximately <span><math><mrow><mn>1</mn><mo>/</mo><mn>30</mn></mrow></math></span> of the parameters. These findings of this study offer valuable insights for improving computer vision algorithms. The essential code is provided at <span><span>https://github.com/d-lab438/hb-net.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"637 ","pages":"Article 130071"},"PeriodicalIF":5.5000,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092523122500743X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Within the realm of image recognition, a specific category of multi-label classification (MLC) challenges arises when objects within the visual field may occlude one another, demanding simultaneous identification of both occluded and occluding objects. While traditional convolutional neural networks (CNNs) address these tasks, they are often bulky and achieve only moderate accuracy. To overcome this limitation, this paper introduces HB-net, a novel integrated network framework inspired by the Holistic Bursting (HB) cell from cutting-edge neural science research. Built upon the foundation of HB cell clusters, HB-net is designed to address the intricate task of simultaneously recognizing multiple occluded objects within images. The framework incorporates various Bursting cell cluster structures along with an evidence accumulation mechanism to enhance performance. Testing on multiple datasets, including digits and letters, shows that models incorporating the HB framework achieve a significant 2.98% improvement in recognition accuracy compared to models without the HB framework (1.0298 times, p=0.0499). Although in high-noise settings, standard CNNs exhibit slightly greater robustness when compared to HB-net models, the models that combine the HB framework and EA mechanism achieve a comparable level of accuracy and resilience to ResNet50, despite having only three convolutional layers and approximately of the parameters. These findings of this study offer valuable insights for improving computer vision algorithms. The essential code is provided at https://github.com/d-lab438/hb-net.git.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.