{"title":"FMFI: Transformer based four branches multi-granularity feature integration for person Re-ID","authors":"Xiaohan Zheng, Jian Lu, Jiahui Xing, Kaibing Zhang","doi":"10.1016/j.eswa.2025.128150","DOIUrl":null,"url":null,"abstract":"<div><div>Extracting multi-granularity features is a critical challenge in person re-identification (Re-ID). While convolutional neural networks (CNNs) are effectively capture local salient features through convolutional kernels, they struggle to construct globally discriminative representations. In contrast, Transformer networks can model long-range dependencies and establish global contextual relationships among features, making them a powerful tool for multi-granularity feature learning in Re-ID. To comprehensively extract multi-granularity features from various aspects such as clothing attributes, walking postures, and social interactions, we propose FMFI, a four-branch multi-granularity feature extraction and integration method based on Transformer. FMFI employs a four-branch architecture to capture diverse feature representations, enhancing the model’s expressiveness and robustness. Specifically, we introduce the Quadra-Net (QN) structure, which extends from the final layer of a replicated Transformer. By appropriately scaling the branch-wise feature weights and aggregating global tokens from all four branches, FMFI constructs enhanced global feature representations. Furthermore, we design the Refined Global Feature (RGF) module, which refines the initial global features and establishes connections with the newly integrated features, leading to more distinctive and discriminative global representations. Extensive experiments on the Market1501, CUHK03, and MSMT17 Re-ID datasets demonstrate that the proposed FMFI method outperforms most existing Re-ID approaches. Our model significantly enriches feature representations and improves the extraction of multi-granularity features, thereby enhancing person re-identification performance.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"288 ","pages":"Article 128150"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425017701","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Extracting multi-granularity features is a critical challenge in person re-identification (Re-ID). While convolutional neural networks (CNNs) are effectively capture local salient features through convolutional kernels, they struggle to construct globally discriminative representations. In contrast, Transformer networks can model long-range dependencies and establish global contextual relationships among features, making them a powerful tool for multi-granularity feature learning in Re-ID. To comprehensively extract multi-granularity features from various aspects such as clothing attributes, walking postures, and social interactions, we propose FMFI, a four-branch multi-granularity feature extraction and integration method based on Transformer. FMFI employs a four-branch architecture to capture diverse feature representations, enhancing the model’s expressiveness and robustness. Specifically, we introduce the Quadra-Net (QN) structure, which extends from the final layer of a replicated Transformer. By appropriately scaling the branch-wise feature weights and aggregating global tokens from all four branches, FMFI constructs enhanced global feature representations. Furthermore, we design the Refined Global Feature (RGF) module, which refines the initial global features and establishes connections with the newly integrated features, leading to more distinctive and discriminative global representations. Extensive experiments on the Market1501, CUHK03, and MSMT17 Re-ID datasets demonstrate that the proposed FMFI method outperforms most existing Re-ID approaches. Our model significantly enriches feature representations and improves the extraction of multi-granularity features, thereby enhancing person re-identification performance.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.