Can Su;Xinlei Xue;Lei Ma;Xiaolong Zhang;Wei Yan;Kaigui Bian
{"title":"基于多模态训练的室内人再识别","authors":"Can Su;Xinlei Xue;Lei Ma;Xiaolong Zhang;Wei Yan;Kaigui Bian","doi":"10.1109/JIOT.2025.3561213","DOIUrl":null,"url":null,"abstract":"Existing person re-identification (ReID) methods mainly rely on images and videos to match persons across cameras, yet visual data captured by cameras are vulnerable to environmental interferences (e.g., illumination and occlusion) or personal appearance changes, leading to performance degradation under such scenes. Meanwhile, the popularization of Wi-Fi networks has allowed probe requests to be captured for mobile sensing applications, such as crowd counting and trajectory estimation. However, the MAC address randomization technique adopted by modern devices breaks the association of probe requests and adversely affects the functionality of these applications. In this article, we propose MaRPA, the first multimodal training approach that incorporates both videos and Wi-Fi probe requests to simultaneously promote tasks of probe requests association and person ReID. MaRPA first distinguishes among pairwise probe request frames through a contrastive learning model. It then matches video and probe request sequences by exploring their similarities from the position and the vision aspects. Matched videos and probe requests provide complementary information and generate more robust features for both tasks. To evaluate MaRPA, we contribute a new dataset containing synchronous videos and probe requests data for probe requests association and person ReID. Experimental results demonstrate the effectiveness of our approach. For probe requests association, it achieves >85% discrimination accuracy and >0.90 V-measure score; for person ReID, it achieves 75.8% mean average precision and 90.6% Rank-1, improving state-of-the-art video-based ReID methods by over 40%.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 14","pages":"26289-26302"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust Indoor Person Re-Identification With Multimodal Training\",\"authors\":\"Can Su;Xinlei Xue;Lei Ma;Xiaolong Zhang;Wei Yan;Kaigui Bian\",\"doi\":\"10.1109/JIOT.2025.3561213\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Existing person re-identification (ReID) methods mainly rely on images and videos to match persons across cameras, yet visual data captured by cameras are vulnerable to environmental interferences (e.g., illumination and occlusion) or personal appearance changes, leading to performance degradation under such scenes. Meanwhile, the popularization of Wi-Fi networks has allowed probe requests to be captured for mobile sensing applications, such as crowd counting and trajectory estimation. However, the MAC address randomization technique adopted by modern devices breaks the association of probe requests and adversely affects the functionality of these applications. In this article, we propose MaRPA, the first multimodal training approach that incorporates both videos and Wi-Fi probe requests to simultaneously promote tasks of probe requests association and person ReID. MaRPA first distinguishes among pairwise probe request frames through a contrastive learning model. It then matches video and probe request sequences by exploring their similarities from the position and the vision aspects. Matched videos and probe requests provide complementary information and generate more robust features for both tasks. To evaluate MaRPA, we contribute a new dataset containing synchronous videos and probe requests data for probe requests association and person ReID. Experimental results demonstrate the effectiveness of our approach. For probe requests association, it achieves >85% discrimination accuracy and >0.90 V-measure score; for person ReID, it achieves 75.8% mean average precision and 90.6% Rank-1, improving state-of-the-art video-based ReID methods by over 40%.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 14\",\"pages\":\"26289-26302\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10966040/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10966040/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Robust Indoor Person Re-Identification With Multimodal Training
Existing person re-identification (ReID) methods mainly rely on images and videos to match persons across cameras, yet visual data captured by cameras are vulnerable to environmental interferences (e.g., illumination and occlusion) or personal appearance changes, leading to performance degradation under such scenes. Meanwhile, the popularization of Wi-Fi networks has allowed probe requests to be captured for mobile sensing applications, such as crowd counting and trajectory estimation. However, the MAC address randomization technique adopted by modern devices breaks the association of probe requests and adversely affects the functionality of these applications. In this article, we propose MaRPA, the first multimodal training approach that incorporates both videos and Wi-Fi probe requests to simultaneously promote tasks of probe requests association and person ReID. MaRPA first distinguishes among pairwise probe request frames through a contrastive learning model. It then matches video and probe request sequences by exploring their similarities from the position and the vision aspects. Matched videos and probe requests provide complementary information and generate more robust features for both tasks. To evaluate MaRPA, we contribute a new dataset containing synchronous videos and probe requests data for probe requests association and person ReID. Experimental results demonstrate the effectiveness of our approach. For probe requests association, it achieves >85% discrimination accuracy and >0.90 V-measure score; for person ReID, it achieves 75.8% mean average precision and 90.6% Rank-1, improving state-of-the-art video-based ReID methods by over 40%.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.