TROPE: Triplet-Guided Feature Refinement for Person Re-Identification

IF 5.3 3区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Emerging Topics in Computational Intelligence Pub Date : 2024-06-04 DOI:10.1109/TETCI.2024.3406411

Divya Singh;Jimson Mathew;Mayank Agarwal;Mahesh Govind

{"title":"TROPE: Triplet-Guided Feature Refinement for Person Re-Identification","authors":"Divya Singh;Jimson Mathew;Mayank Agarwal;Mahesh Govind","doi":"10.1109/TETCI.2024.3406411","DOIUrl":null,"url":null,"abstract":"Person Re-Identification (PRid) has garnered research attention with the rising popularity of intelligent video surveillance. Deep learning methods like Convolutional Neural Networks (CNN) are vital in PRid. The CNNs acquire image characteristics that are distinctive, referred to as image features, by analyzing the entire image. This process enables them to recognize and differentiate between various images. However, in the case of PRid, this search across the entire image may lead the model to emphasize distinctive image features of the background while neglecting subtle but essential distinguishing regions of the person. This tendency can be observed in the heat maps generated from trained models. Therefore, it is crucial to direct the model's attention towards the vital regions of a person. Relying solely on global features might be limited in effectiveness, as it does not sufficiently capture essential finer details. Therefore, it becomes necessary to pinpoint significant features at a local level and guide the model to prioritize these features for improved results. Inspired to identify and prioritize vital regions based on local features, we propose TROPE (Triplet-Guided Feature Refinement for Person Re-Identification) in this paper. The technique involves analyzing the intermediate features of hard positive and hard negative images and generating weight vectors. These vectors are utilized to shift the attention of the network to specific regions of interest. The proposed method achieves 87.5% mAP and 95.6% Rank1 accuracy on Market1501. 77.6% mAP and 88.8% Rank1 accuracy on DukeMTMC and 78.5% mAP, 80.1% Rank1 accuracy on CUHK03 dataset.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 1","pages":"706-716"},"PeriodicalIF":5.3000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10547534/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Person Re-Identification (PRid) has garnered research attention with the rising popularity of intelligent video surveillance. Deep learning methods like Convolutional Neural Networks (CNN) are vital in PRid. The CNNs acquire image characteristics that are distinctive, referred to as image features, by analyzing the entire image. This process enables them to recognize and differentiate between various images. However, in the case of PRid, this search across the entire image may lead the model to emphasize distinctive image features of the background while neglecting subtle but essential distinguishing regions of the person. This tendency can be observed in the heat maps generated from trained models. Therefore, it is crucial to direct the model's attention towards the vital regions of a person. Relying solely on global features might be limited in effectiveness, as it does not sufficiently capture essential finer details. Therefore, it becomes necessary to pinpoint significant features at a local level and guide the model to prioritize these features for improved results. Inspired to identify and prioritize vital regions based on local features, we propose TROPE (Triplet-Guided Feature Refinement for Person Re-Identification) in this paper. The technique involves analyzing the intermediate features of hard positive and hard negative images and generating weight vectors. These vectors are utilized to shift the attention of the network to specific regions of interest. The proposed method achieves 87.5% mAP and 95.6% Rank1 accuracy on Market1501. 77.6% mAP and 88.8% Rank1 accuracy on DukeMTMC and 78.5% mAP, 80.1% Rank1 accuracy on CUHK03 dataset.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Emerging Topics in Computational Intelligence Mathematics-Control and Optimization

CiteScore

10.30

自引率

7.50%

发文量

147

期刊介绍： The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys. TETCI is an electronics only publication. TETCI publishes six issues per year. Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.