{"title":"Stronger Baseline for Vehicle Re-Identification in the Wild","authors":"Chih-Chung Hsu, Cing-Hao Hung, Chih-Yu Jian, Yi-Xiu Zhuang","doi":"10.1109/VCIP47243.2019.8965867","DOIUrl":null,"url":null,"abstract":"Recently, re-identification tasks in computer vision field draw attention. Vehicle re-identification can be used to find the suspect car (target) from a vast surveillance video dataset. One of the most critical issues in the vehicle re-identification task is how to learn the effective feature representation. In general, pairwise learning such as the contrastive and triplet loss functions is adopted to learn the discriminative feature based on the convolution neural network. A good backbone network will lead to a significant improvement in the car re-identification task. In this paper, a stronger baseline method is proposed to achieve a better feature representation ability. First, we integrate the shift-invariant convolutional neural network with ResNet backbone to enhance the consistency feature learning. Afterward, a multi-layer feature fusion module is proposed to incorporate the middle- and high-level features to further improve the performance of car re-identification. Experimental results demonstrated that the proposed stronger baseline method achieves state-of-the-art performance in terms of mean averaging precision.","PeriodicalId":388109,"journal":{"name":"2019 IEEE Visual Communications and Image Processing (VCIP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Visual Communications and Image Processing (VCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP47243.2019.8965867","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, re-identification tasks in computer vision field draw attention. Vehicle re-identification can be used to find the suspect car (target) from a vast surveillance video dataset. One of the most critical issues in the vehicle re-identification task is how to learn the effective feature representation. In general, pairwise learning such as the contrastive and triplet loss functions is adopted to learn the discriminative feature based on the convolution neural network. A good backbone network will lead to a significant improvement in the car re-identification task. In this paper, a stronger baseline method is proposed to achieve a better feature representation ability. First, we integrate the shift-invariant convolutional neural network with ResNet backbone to enhance the consistency feature learning. Afterward, a multi-layer feature fusion module is proposed to incorporate the middle- and high-level features to further improve the performance of car re-identification. Experimental results demonstrated that the proposed stronger baseline method achieves state-of-the-art performance in terms of mean averaging precision.