{"title":"用于人员重新识别的反向金字塔注意力引导网络","authors":"Jiang Liu, Wei Bai, Yun Hui","doi":"10.4018/ijcini.349982","DOIUrl":null,"url":null,"abstract":"Person re-identification aims to retrieve pedestrians with the same identity across different cameras. However, current methods increase attention to interfering regions when dealing with complex backgrounds and occlusion, especially in the presence of similar interfering features. To enhance the robustness of the model, we propose the Reverse Pyramid Attention Guidance (RPAG) network, using a reverse pyramid structure to learn features at multiple granularities. To mitigate the impact of occlusion, we introduce the Similar Feature Filtering (SFF) attention module at the pixel level, using graph convolution to adaptively select occluded regions, thereby enhancing retrieval accuracy by filtering out irrelevant parts. Combining the reverse pyramid structure with the pixel-level attention module strengthens adaptability to complex scenes, guides multi-granularity feature learning, and effectively handles various occlusion scenarios. RPAG achieved Rank-1 accuracies of 96.2%, 93.2%, 88.7%, and 73.2% on the Market1501, DukeMTMC-ReID, MSMT17, and Occluded-Duke datasets, respectively.","PeriodicalId":509295,"journal":{"name":"International Journal of Cognitive Informatics and Natural Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reverse Pyramid Attention Guidance Network for Person Re-Identification\",\"authors\":\"Jiang Liu, Wei Bai, Yun Hui\",\"doi\":\"10.4018/ijcini.349982\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Person re-identification aims to retrieve pedestrians with the same identity across different cameras. However, current methods increase attention to interfering regions when dealing with complex backgrounds and occlusion, especially in the presence of similar interfering features. To enhance the robustness of the model, we propose the Reverse Pyramid Attention Guidance (RPAG) network, using a reverse pyramid structure to learn features at multiple granularities. To mitigate the impact of occlusion, we introduce the Similar Feature Filtering (SFF) attention module at the pixel level, using graph convolution to adaptively select occluded regions, thereby enhancing retrieval accuracy by filtering out irrelevant parts. Combining the reverse pyramid structure with the pixel-level attention module strengthens adaptability to complex scenes, guides multi-granularity feature learning, and effectively handles various occlusion scenarios. RPAG achieved Rank-1 accuracies of 96.2%, 93.2%, 88.7%, and 73.2% on the Market1501, DukeMTMC-ReID, MSMT17, and Occluded-Duke datasets, respectively.\",\"PeriodicalId\":509295,\"journal\":{\"name\":\"International Journal of Cognitive Informatics and Natural Intelligence\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Cognitive Informatics and Natural Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/ijcini.349982\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Informatics and Natural Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijcini.349982","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reverse Pyramid Attention Guidance Network for Person Re-Identification
Person re-identification aims to retrieve pedestrians with the same identity across different cameras. However, current methods increase attention to interfering regions when dealing with complex backgrounds and occlusion, especially in the presence of similar interfering features. To enhance the robustness of the model, we propose the Reverse Pyramid Attention Guidance (RPAG) network, using a reverse pyramid structure to learn features at multiple granularities. To mitigate the impact of occlusion, we introduce the Similar Feature Filtering (SFF) attention module at the pixel level, using graph convolution to adaptively select occluded regions, thereby enhancing retrieval accuracy by filtering out irrelevant parts. Combining the reverse pyramid structure with the pixel-level attention module strengthens adaptability to complex scenes, guides multi-granularity feature learning, and effectively handles various occlusion scenarios. RPAG achieved Rank-1 accuracies of 96.2%, 93.2%, 88.7%, and 73.2% on the Market1501, DukeMTMC-ReID, MSMT17, and Occluded-Duke datasets, respectively.