Inter-Intra Camera Identity Learning for Person Re-Identification with Training in Single Camera

2023 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2023-07-01 DOI:10.1109/ICME55011.2023.00414

Guoqing Zhang, Zhiyuan Luo, Weisi Lin, Xuan Jing

{"title":"Inter-Intra Camera Identity Learning for Person Re-Identification with Training in Single Camera","authors":"Guoqing Zhang, Zhiyuan Luo, Weisi Lin, Xuan Jing","doi":"10.1109/ICME55011.2023.00414","DOIUrl":null,"url":null,"abstract":"Traditional person re-identification (re-ID) methods generally rely on inter-camera person images to smooth the domain disparities between cameras. However, collecting and annotating a large number of inter-camera identities is extremely difficult and time-consuming, and this makes it hard to deploy person re-ID systems in new locations. To tackle this challenge, this paper studies the single-camera-training (SCT) setting where every person in the training set only appears in one camera. In this work, we design a novel inter-intra camera identity learning (I2CIL) framework to effectively address the SCT person re-ID. Specifically, (i) we design a Dual-Branch Identity Learning (DBIL) network consisting of inter-camera and intra-camera learning branches to learn person ID discriminative information. The former learns camera-irrelevant feature representations by constraining the distance of inter-camera negative sample pairs closer than the distance of intra-camera negative sample pairs. The latter focuses on pulling the distance of intra-camera positive sample pairs closer and pushing the distance of intra-camera negative sample pairs further, partially alleviating weak ID discrimination caused by the lack of inter-camera annotations. (ii) We design a Mixed-Sampling Joint Learning (MSJL) strategy, which is capable to capture inter- and intra-camera samples and independently accomplish the inter- and intra-camera learning tasks at the same time, avoiding the mutual interference between the two tasks. Extensive experiments on two public SCT datasets prove the superiority of the proposed approach.","PeriodicalId":321830,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo (ICME)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Multimedia and Expo (ICME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME55011.2023.00414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Traditional person re-identification (re-ID) methods generally rely on inter-camera person images to smooth the domain disparities between cameras. However, collecting and annotating a large number of inter-camera identities is extremely difficult and time-consuming, and this makes it hard to deploy person re-ID systems in new locations. To tackle this challenge, this paper studies the single-camera-training (SCT) setting where every person in the training set only appears in one camera. In this work, we design a novel inter-intra camera identity learning (I2CIL) framework to effectively address the SCT person re-ID. Specifically, (i) we design a Dual-Branch Identity Learning (DBIL) network consisting of inter-camera and intra-camera learning branches to learn person ID discriminative information. The former learns camera-irrelevant feature representations by constraining the distance of inter-camera negative sample pairs closer than the distance of intra-camera negative sample pairs. The latter focuses on pulling the distance of intra-camera positive sample pairs closer and pushing the distance of intra-camera negative sample pairs further, partially alleviating weak ID discrimination caused by the lack of inter-camera annotations. (ii) We design a Mixed-Sampling Joint Learning (MSJL) strategy, which is capable to capture inter- and intra-camera samples and independently accomplish the inter- and intra-camera learning tasks at the same time, avoiding the mutual interference between the two tasks. Extensive experiments on two public SCT datasets prove the superiority of the proposed approach.

查看原文本刊更多论文

基于单摄像机训练的人再识别的摄像机内识别学习

传统的人物再识别(re-ID)方法通常依靠相机间的人物图像来平滑相机间的域差异。然而，收集和标注大量的摄像头间身份是非常困难和耗时的，这使得在新的地点部署人员重新识别系统变得困难。为了解决这个问题，本文研究了单摄像机训练(SCT)设置，其中训练集中的每个人只出现在一个摄像机中。在这项工作中，我们设计了一个新的相机内部身份学习(I2CIL)框架来有效地解决SCT人员重新识别问题。具体来说，(i)我们设计了一个双分支身份学习(DBIL)网络，该网络由相机间和相机内的学习分支组成，用于学习人的身份识别信息。前者通过约束相机间负样本对的距离小于相机内负样本对的距离来学习与相机无关的特征表示。后者侧重于拉近相机内正样本对的距离，拉近相机内负样本对的距离，部分缓解由于缺乏相机间注释而导致的弱ID辨别。(ii)我们设计了一种混合采样联合学习(MSJL)策略，该策略能够捕获相机间和相机内的样本，同时独立完成相机间和相机内的学习任务，避免了两个任务之间的相互干扰。在两个公开的SCT数据集上的大量实验证明了该方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Multimedia and Expo (ICME)

自引率

0.00%

发文量