Shuang Zeng , Hai Liu , Tingting Liu , Qiuxia Liu , Minhong Wang , Bing Yang , Zhaoli Zhang
{"title":"Exploiting module evolution correlation relationship for fine-grained bird image classification with structural functional representation","authors":"Shuang Zeng , Hai Liu , Tingting Liu , Qiuxia Liu , Minhong Wang , Bing Yang , Zhaoli Zhang","doi":"10.1016/j.neucom.2025.130609","DOIUrl":null,"url":null,"abstract":"<div><div>Fine-grained bird image classification (FBIC) targeting the classification of avian subspecies is facing challenges due to confusing background, partly occlusions and varied posture. In this paper, we proposed a novel module evolution correlation relationship modeling for FBIC task, which can learn structural functional representations among different functional feathers in a bird. Specially, the proposed EMECR model includes two modules, such as periodic topology mining (PTM), and multi-scale semantics alignment strategy (MSAS). The PTM module is proposed to reveal local periodic organizations with implicit functional expressions, and the MSAS is leveraged for better semantic modeling. In addition, a joint loss is designed to suppress the outliers and enforce semantic consistency. Furthermore, to better model the module evolution relationships between different functional representations and hierarchical context correlation, Mamba architecture is employed as the decoder with its linear computational complexity. Experiments on CUB-200–2011 and NABirds verify that our method can obtain robust results and significantly outperform the existing state-of-the-art FBIC methods. Extended experiments have been conducted on the Stanford Cars dataset to suggest the potential of generalizing our method on other fine-grained visual classification tasks.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"647 ","pages":"Article 130609"},"PeriodicalIF":5.5000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225012810","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Fine-grained bird image classification (FBIC) targeting the classification of avian subspecies is facing challenges due to confusing background, partly occlusions and varied posture. In this paper, we proposed a novel module evolution correlation relationship modeling for FBIC task, which can learn structural functional representations among different functional feathers in a bird. Specially, the proposed EMECR model includes two modules, such as periodic topology mining (PTM), and multi-scale semantics alignment strategy (MSAS). The PTM module is proposed to reveal local periodic organizations with implicit functional expressions, and the MSAS is leveraged for better semantic modeling. In addition, a joint loss is designed to suppress the outliers and enforce semantic consistency. Furthermore, to better model the module evolution relationships between different functional representations and hierarchical context correlation, Mamba architecture is employed as the decoder with its linear computational complexity. Experiments on CUB-200–2011 and NABirds verify that our method can obtain robust results and significantly outperform the existing state-of-the-art FBIC methods. Extended experiments have been conducted on the Stanford Cars dataset to suggest the potential of generalizing our method on other fine-grained visual classification tasks.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.