Hongkai Hu , Qiang Liu , Jing Peng , Fei Li , Yuyan Huang , Lupin Liu
{"title":"随机灰度补丁和自适应图通道关注的可见红外人再识别","authors":"Hongkai Hu , Qiang Liu , Jing Peng , Fei Li , Yuyan Huang , Lupin Liu","doi":"10.1016/j.jvcir.2025.104579","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread deployment of video surveillance systems has made person recognition technology essential across various fields, including smart surveillance, law enforcement, and criminal investigation. However, person re-identification (Re-ID) using single-modal images struggles in low-light or nighttime conditions. To address the cross-modal matching challenge between visible and infrared images, we propose a novel method. We introduce a random gray patch (RGP) module that simulates infrared images by converting visible image regions to grayscale, reducing modality discrepancy. Additionally, a non_local adaptive graph channel (NAGC) attention module captures long-range dependencies and adjusts feature channel importance. Finally, we introduce a cross-modal contrast loss, which optimizes feature distances between samples of the same identity across different modalities, further improving cross-modal matching performance. Experimental results on SYSU-MM01 and RegDB datasets show that our method significantly outperforms existing approaches.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"112 ","pages":"Article 104579"},"PeriodicalIF":3.1000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Random gray patch and adaptive graph channel attention for visible-infrared person re-identification\",\"authors\":\"Hongkai Hu , Qiang Liu , Jing Peng , Fei Li , Yuyan Huang , Lupin Liu\",\"doi\":\"10.1016/j.jvcir.2025.104579\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The widespread deployment of video surveillance systems has made person recognition technology essential across various fields, including smart surveillance, law enforcement, and criminal investigation. However, person re-identification (Re-ID) using single-modal images struggles in low-light or nighttime conditions. To address the cross-modal matching challenge between visible and infrared images, we propose a novel method. We introduce a random gray patch (RGP) module that simulates infrared images by converting visible image regions to grayscale, reducing modality discrepancy. Additionally, a non_local adaptive graph channel (NAGC) attention module captures long-range dependencies and adjusts feature channel importance. Finally, we introduce a cross-modal contrast loss, which optimizes feature distances between samples of the same identity across different modalities, further improving cross-modal matching performance. Experimental results on SYSU-MM01 and RegDB datasets show that our method significantly outperforms existing approaches.</div></div>\",\"PeriodicalId\":54755,\"journal\":{\"name\":\"Journal of Visual Communication and Image Representation\",\"volume\":\"112 \",\"pages\":\"Article 104579\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Visual Communication and Image Representation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1047320325001932\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001932","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Random gray patch and adaptive graph channel attention for visible-infrared person re-identification
The widespread deployment of video surveillance systems has made person recognition technology essential across various fields, including smart surveillance, law enforcement, and criminal investigation. However, person re-identification (Re-ID) using single-modal images struggles in low-light or nighttime conditions. To address the cross-modal matching challenge between visible and infrared images, we propose a novel method. We introduce a random gray patch (RGP) module that simulates infrared images by converting visible image regions to grayscale, reducing modality discrepancy. Additionally, a non_local adaptive graph channel (NAGC) attention module captures long-range dependencies and adjusts feature channel importance. Finally, we introduce a cross-modal contrast loss, which optimizes feature distances between samples of the same identity across different modalities, further improving cross-modal matching performance. Experimental results on SYSU-MM01 and RegDB datasets show that our method significantly outperforms existing approaches.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.