{"title":"A Pedestrian Re-identification Algorithm Based on 3D Convolution and Non_Local Block","authors":"Xiaojun Bai, Feihu Jiang, Q. Zhao","doi":"10.1145/3532342.3532349","DOIUrl":null,"url":null,"abstract":"In the application of video-based pedestrian re-identification, introduced deep learning method to learn feature representation of pedestrian. In order to improve feature quality, introduced 3D convolution block as backbone network to aggregate temporal and spatial features; for issue of human body occlusion in video frames, introduced Non_Local block to capture long distance dependence between frames, and eventually eliminate the impact of occlusion. Optimal embedding scheme of 3D convolution and Non_Local block in backbone network is designed via experiments, and has proved that rich features of pedestrian can be extracted from video frames by this solution, which helps to improve the accuracy of re-identification.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"124 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Symposium on Signal Processing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3532342.3532349","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In the application of video-based pedestrian re-identification, introduced deep learning method to learn feature representation of pedestrian. In order to improve feature quality, introduced 3D convolution block as backbone network to aggregate temporal and spatial features; for issue of human body occlusion in video frames, introduced Non_Local block to capture long distance dependence between frames, and eventually eliminate the impact of occlusion. Optimal embedding scheme of 3D convolution and Non_Local block in backbone network is designed via experiments, and has proved that rich features of pedestrian can be extracted from video frames by this solution, which helps to improve the accuracy of re-identification.