基于多模态训练的室内人再识别

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2025-04-16 DOI:10.1109/JIOT.2025.3561213

Can Su;Xinlei Xue;Lei Ma;Xiaolong Zhang;Wei Yan;Kaigui Bian

{"title":"基于多模态训练的室内人再识别","authors":"Can Su;Xinlei Xue;Lei Ma;Xiaolong Zhang;Wei Yan;Kaigui Bian","doi":"10.1109/JIOT.2025.3561213","DOIUrl":null,"url":null,"abstract":"Existing person re-identification (ReID) methods mainly rely on images and videos to match persons across cameras, yet visual data captured by cameras are vulnerable to environmental interferences (e.g., illumination and occlusion) or personal appearance changes, leading to performance degradation under such scenes. Meanwhile, the popularization of Wi-Fi networks has allowed probe requests to be captured for mobile sensing applications, such as crowd counting and trajectory estimation. However, the MAC address randomization technique adopted by modern devices breaks the association of probe requests and adversely affects the functionality of these applications. In this article, we propose MaRPA, the first multimodal training approach that incorporates both videos and Wi-Fi probe requests to simultaneously promote tasks of probe requests association and person ReID. MaRPA first distinguishes among pairwise probe request frames through a contrastive learning model. It then matches video and probe request sequences by exploring their similarities from the position and the vision aspects. Matched videos and probe requests provide complementary information and generate more robust features for both tasks. To evaluate MaRPA, we contribute a new dataset containing synchronous videos and probe requests data for probe requests association and person ReID. Experimental results demonstrate the effectiveness of our approach. For probe requests association, it achieves >85% discrimination accuracy and >0.90 V-measure score; for person ReID, it achieves 75.8% mean average precision and 90.6% Rank-1, improving state-of-the-art video-based ReID methods by over 40%.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 14","pages":"26289-26302"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust Indoor Person Re-Identification With Multimodal Training\",\"authors\":\"Can Su;Xinlei Xue;Lei Ma;Xiaolong Zhang;Wei Yan;Kaigui Bian\",\"doi\":\"10.1109/JIOT.2025.3561213\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Existing person re-identification (ReID) methods mainly rely on images and videos to match persons across cameras, yet visual data captured by cameras are vulnerable to environmental interferences (e.g., illumination and occlusion) or personal appearance changes, leading to performance degradation under such scenes. Meanwhile, the popularization of Wi-Fi networks has allowed probe requests to be captured for mobile sensing applications, such as crowd counting and trajectory estimation. However, the MAC address randomization technique adopted by modern devices breaks the association of probe requests and adversely affects the functionality of these applications. In this article, we propose MaRPA, the first multimodal training approach that incorporates both videos and Wi-Fi probe requests to simultaneously promote tasks of probe requests association and person ReID. MaRPA first distinguishes among pairwise probe request frames through a contrastive learning model. It then matches video and probe request sequences by exploring their similarities from the position and the vision aspects. Matched videos and probe requests provide complementary information and generate more robust features for both tasks. To evaluate MaRPA, we contribute a new dataset containing synchronous videos and probe requests data for probe requests association and person ReID. Experimental results demonstrate the effectiveness of our approach. For probe requests association, it achieves >85% discrimination accuracy and >0.90 V-measure score; for person ReID, it achieves 75.8% mean average precision and 90.6% Rank-1, improving state-of-the-art video-based ReID methods by over 40%.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 14\",\"pages\":\"26289-26302\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10966040/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10966040/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

现有的人员再识别（ReID）方法主要依靠图像和视频来匹配跨摄像机的人员，但摄像机捕获的视觉数据容易受到环境干扰（例如光照和遮挡）或个人外观变化的影响，导致在此类场景下的性能下降。同时，Wi-Fi网络的普及使得探测请求可以被捕获用于移动传感应用，如人群计数和轨迹估计。然而，现代设备采用的MAC地址随机化技术打破了探测请求的关联，并对这些应用程序的功能产生了不利影响。在本文中，我们提出了MaRPA，这是第一个结合视频和Wi-Fi探测请求的多模式训练方法，同时促进探测请求关联和人员ReID任务。MaRPA首先通过对比学习模型区分两两探测请求帧。然后通过从位置和视觉方面探索其相似性来匹配视频和探针请求序列。匹配的视频和探测请求提供了互补的信息，并为这两个任务生成了更健壮的特性。为了评估MaRPA，我们提供了一个包含同步视频和探测请求数据的新数据集，用于探测请求关联和人员ReID。实验结果证明了该方法的有效性。对于探测请求关联，识别准确率达到85%，V-measure得分达到0.90；对于人的ReID，它达到了75.8%的平均精度和90.6%的Rank-1，比目前最先进的基于视频的ReID方法提高了40%以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Robust Indoor Person Re-Identification With Multimodal Training

Existing person re-identification (ReID) methods mainly rely on images and videos to match persons across cameras, yet visual data captured by cameras are vulnerable to environmental interferences (e.g., illumination and occlusion) or personal appearance changes, leading to performance degradation under such scenes. Meanwhile, the popularization of Wi-Fi networks has allowed probe requests to be captured for mobile sensing applications, such as crowd counting and trajectory estimation. However, the MAC address randomization technique adopted by modern devices breaks the association of probe requests and adversely affects the functionality of these applications. In this article, we propose MaRPA, the first multimodal training approach that incorporates both videos and Wi-Fi probe requests to simultaneously promote tasks of probe requests association and person ReID. MaRPA first distinguishes among pairwise probe request frames through a contrastive learning model. It then matches video and probe request sequences by exploring their similarities from the position and the vision aspects. Matched videos and probe requests provide complementary information and generate more robust features for both tasks. To evaluate MaRPA, we contribute a new dataset containing synchronous videos and probe requests data for probe requests association and person ReID. Experimental results demonstrate the effectiveness of our approach. For probe requests association, it achieves >85% discrimination accuracy and >0.90 V-measure score; for person ReID, it achieves 75.8% mean average precision and 90.6% Rank-1, improving state-of-the-art video-based ReID methods by over 40%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.