{"title":"基于emd的被遮挡人再识别局部匹配","authors":"Hoang-Anh Nguyen , Thuy-Binh Nguyen , Hong-Quan Nguyen , Thi-Lan Le","doi":"10.1016/j.mlwa.2025.100663","DOIUrl":null,"url":null,"abstract":"<div><div>Person re-identification (Re-ID) is a vital computer vision task focused on matching images of a person of interest as they move across multiple non-overlapping cameras. Thanks to advancements in deep learning models, numerous important milestones have been achieved in the field of person Re-ID. Recent efforts have concentrated on addressing a more realistic scenario where pedestrians are partially occluded. This trend indicates a promising future for the practical implementation of person Re-ID systems. This paper builds upon our previous work, which successfully addressed single-shot person Re-ID using local matching information. For this task, Earth Mover’s Distance (EMD) is employed as a metric to measure similarity between two distributions. To handle multi-shot Re-ID, the proposed framework integrates a feature block, adapting the single-shot methodology to a multi-shot setting. Unlike conventional person Re-ID methods that employ a manually determined images of person, the proposed framework takes a query tracklet as input, which is automatically generated through human detection and tracking steps. To evaluate the proposed method, FAPR dataset (Fully Automated Person ReID) is used. This dataset is one of the few publicly available datasets built specifically for an end-to-end person Re-ID system. Various scenarios are rigorously examined to demonstrate the effectiveness of the proposed framework, especially in challenging conditions with strong occlusion. Across eight experimental scenarios, the proposed method achieves matching rates at rank-1 ranging from 76.3% to 100%. These results underscore the robustness and efficacy of our approach. Our source code is made available at: <span><span>https://github.com/anhnhust/emd-person-reid</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"20 ","pages":"Article 100663"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EMD-based local matching for occluded person re-identification\",\"authors\":\"Hoang-Anh Nguyen , Thuy-Binh Nguyen , Hong-Quan Nguyen , Thi-Lan Le\",\"doi\":\"10.1016/j.mlwa.2025.100663\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Person re-identification (Re-ID) is a vital computer vision task focused on matching images of a person of interest as they move across multiple non-overlapping cameras. Thanks to advancements in deep learning models, numerous important milestones have been achieved in the field of person Re-ID. Recent efforts have concentrated on addressing a more realistic scenario where pedestrians are partially occluded. This trend indicates a promising future for the practical implementation of person Re-ID systems. This paper builds upon our previous work, which successfully addressed single-shot person Re-ID using local matching information. For this task, Earth Mover’s Distance (EMD) is employed as a metric to measure similarity between two distributions. To handle multi-shot Re-ID, the proposed framework integrates a feature block, adapting the single-shot methodology to a multi-shot setting. Unlike conventional person Re-ID methods that employ a manually determined images of person, the proposed framework takes a query tracklet as input, which is automatically generated through human detection and tracking steps. To evaluate the proposed method, FAPR dataset (Fully Automated Person ReID) is used. This dataset is one of the few publicly available datasets built specifically for an end-to-end person Re-ID system. Various scenarios are rigorously examined to demonstrate the effectiveness of the proposed framework, especially in challenging conditions with strong occlusion. Across eight experimental scenarios, the proposed method achieves matching rates at rank-1 ranging from 76.3% to 100%. These results underscore the robustness and efficacy of our approach. Our source code is made available at: <span><span>https://github.com/anhnhust/emd-person-reid</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":74093,\"journal\":{\"name\":\"Machine learning with applications\",\"volume\":\"20 \",\"pages\":\"Article 100663\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Machine learning with applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666827025000465\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
人物再识别(Re-ID)是一项重要的计算机视觉任务,重点是匹配感兴趣的人在多个不重叠的摄像机中移动时的图像。由于深度学习模型的进步,在人身份识别领域取得了许多重要的里程碑。最近的努力集中在解决行人部分被遮挡的更现实的情况。这一趋势预示着个人身份再识别系统的实际实施前景光明。本文建立在我们之前的工作基础上,该工作成功地解决了使用本地匹配信息的单枪人Re-ID问题。在此任务中,采用了土动器距离(EMD)作为度量两个分布之间相似性的度量。为了处理多镜头Re-ID,提出的框架集成了一个特征块,使单镜头方法适应多镜头设置。与使用手动确定的人员图像的传统人员Re-ID方法不同,所提出的框架采用查询tracklet作为输入,该查询tracklet通过人工检测和跟踪步骤自动生成。为了评估所提出的方法,使用了FAPR数据集(Fully Automated Person ReID)。该数据集是专门为端到端人员Re-ID系统构建的少数公开可用的数据集之一。各种场景都经过严格检查,以证明所提出的框架的有效性,特别是在具有强遮挡的挑战性条件下。在8个实验场景中,该方法在rank-1上的匹配率为76.3% ~ 100%。这些结果强调了我们方法的稳健性和有效性。我们的源代码可以在https://github.com/anhnhust/emd-person-reid上获得。
EMD-based local matching for occluded person re-identification
Person re-identification (Re-ID) is a vital computer vision task focused on matching images of a person of interest as they move across multiple non-overlapping cameras. Thanks to advancements in deep learning models, numerous important milestones have been achieved in the field of person Re-ID. Recent efforts have concentrated on addressing a more realistic scenario where pedestrians are partially occluded. This trend indicates a promising future for the practical implementation of person Re-ID systems. This paper builds upon our previous work, which successfully addressed single-shot person Re-ID using local matching information. For this task, Earth Mover’s Distance (EMD) is employed as a metric to measure similarity between two distributions. To handle multi-shot Re-ID, the proposed framework integrates a feature block, adapting the single-shot methodology to a multi-shot setting. Unlike conventional person Re-ID methods that employ a manually determined images of person, the proposed framework takes a query tracklet as input, which is automatically generated through human detection and tracking steps. To evaluate the proposed method, FAPR dataset (Fully Automated Person ReID) is used. This dataset is one of the few publicly available datasets built specifically for an end-to-end person Re-ID system. Various scenarios are rigorously examined to demonstrate the effectiveness of the proposed framework, especially in challenging conditions with strong occlusion. Across eight experimental scenarios, the proposed method achieves matching rates at rank-1 ranging from 76.3% to 100%. These results underscore the robustness and efficacy of our approach. Our source code is made available at: https://github.com/anhnhust/emd-person-reid.