{"title":"EMD-based local matching for occluded person re-identification","authors":"Hoang-Anh Nguyen , Thuy-Binh Nguyen , Hong-Quan Nguyen , Thi-Lan Le","doi":"10.1016/j.mlwa.2025.100663","DOIUrl":null,"url":null,"abstract":"<div><div>Person re-identification (Re-ID) is a vital computer vision task focused on matching images of a person of interest as they move across multiple non-overlapping cameras. Thanks to advancements in deep learning models, numerous important milestones have been achieved in the field of person Re-ID. Recent efforts have concentrated on addressing a more realistic scenario where pedestrians are partially occluded. This trend indicates a promising future for the practical implementation of person Re-ID systems. This paper builds upon our previous work, which successfully addressed single-shot person Re-ID using local matching information. For this task, Earth Mover’s Distance (EMD) is employed as a metric to measure similarity between two distributions. To handle multi-shot Re-ID, the proposed framework integrates a feature block, adapting the single-shot methodology to a multi-shot setting. Unlike conventional person Re-ID methods that employ a manually determined images of person, the proposed framework takes a query tracklet as input, which is automatically generated through human detection and tracking steps. To evaluate the proposed method, FAPR dataset (Fully Automated Person ReID) is used. This dataset is one of the few publicly available datasets built specifically for an end-to-end person Re-ID system. Various scenarios are rigorously examined to demonstrate the effectiveness of the proposed framework, especially in challenging conditions with strong occlusion. Across eight experimental scenarios, the proposed method achieves matching rates at rank-1 ranging from 76.3% to 100%. These results underscore the robustness and efficacy of our approach. Our source code is made available at: <span><span>https://github.com/anhnhust/emd-person-reid</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"20 ","pages":"Article 100663"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Person re-identification (Re-ID) is a vital computer vision task focused on matching images of a person of interest as they move across multiple non-overlapping cameras. Thanks to advancements in deep learning models, numerous important milestones have been achieved in the field of person Re-ID. Recent efforts have concentrated on addressing a more realistic scenario where pedestrians are partially occluded. This trend indicates a promising future for the practical implementation of person Re-ID systems. This paper builds upon our previous work, which successfully addressed single-shot person Re-ID using local matching information. For this task, Earth Mover’s Distance (EMD) is employed as a metric to measure similarity between two distributions. To handle multi-shot Re-ID, the proposed framework integrates a feature block, adapting the single-shot methodology to a multi-shot setting. Unlike conventional person Re-ID methods that employ a manually determined images of person, the proposed framework takes a query tracklet as input, which is automatically generated through human detection and tracking steps. To evaluate the proposed method, FAPR dataset (Fully Automated Person ReID) is used. This dataset is one of the few publicly available datasets built specifically for an end-to-end person Re-ID system. Various scenarios are rigorously examined to demonstrate the effectiveness of the proposed framework, especially in challenging conditions with strong occlusion. Across eight experimental scenarios, the proposed method achieves matching rates at rank-1 ranging from 76.3% to 100%. These results underscore the robustness and efficacy of our approach. Our source code is made available at: https://github.com/anhnhust/emd-person-reid.