Optimizing Train-Test Data for Person Re-Identification in Real-World Applications

Herman G. J. Groot, Tunç Alkanat, E. Bondarev, P. D. With
{"title":"Optimizing Train-Test Data for Person Re-Identification in Real-World Applications","authors":"Herman G. J. Groot, Tunç Alkanat, E. Bondarev, P. D. With","doi":"10.1145/3523111.3523121","DOIUrl":null,"url":null,"abstract":"Person re-identification (re-ID) aims to recognize an identity in non-overlapping camera views. Recently, re-ID received increased attention due to the growth of deep learning and its prominent applications in the field of automated video surveillance. The performance of deep learning-based methods relies heavily on the quality of training datasets and protocols. Particularly, parameters associated to the train and test set construction affect the overall performance. However, public re-ID datasets usually come with a fixed set of parameters, which are partly suitable for optimizing re-ID applications. In this paper, we study dataset construction parameters to improve re-ID performance. To this end, we first experiment on the temporal subsampling rate of the sequence of bounding boxes. Second, an experiment is performed on the effects of bounding-box enlargement under various temporal sampling rates. Thirdly, we analyze how the optimal choice of such dataset design parameters change with the dataset size. The experiments reveal that a performance increase of 2.1% Rank-1 is possible over a state-of-the-art re-ID model when optimizing the dataset construction parameters, thereby increasing the state-of-the-art performance from 91.9% to 94.0% Rank-1 on the popular DukeMTMC-reID dataset. The obtained results are not specific for the applied model and likely generalize to others.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3523111.3523121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Person re-identification (re-ID) aims to recognize an identity in non-overlapping camera views. Recently, re-ID received increased attention due to the growth of deep learning and its prominent applications in the field of automated video surveillance. The performance of deep learning-based methods relies heavily on the quality of training datasets and protocols. Particularly, parameters associated to the train and test set construction affect the overall performance. However, public re-ID datasets usually come with a fixed set of parameters, which are partly suitable for optimizing re-ID applications. In this paper, we study dataset construction parameters to improve re-ID performance. To this end, we first experiment on the temporal subsampling rate of the sequence of bounding boxes. Second, an experiment is performed on the effects of bounding-box enlargement under various temporal sampling rates. Thirdly, we analyze how the optimal choice of such dataset design parameters change with the dataset size. The experiments reveal that a performance increase of 2.1% Rank-1 is possible over a state-of-the-art re-ID model when optimizing the dataset construction parameters, thereby increasing the state-of-the-art performance from 91.9% to 94.0% Rank-1 on the popular DukeMTMC-reID dataset. The obtained results are not specific for the applied model and likely generalize to others.
优化列车测试数据在现实世界中的再识别应用
人物再识别(re-ID)的目的是在不重叠的相机视图中识别身份。最近,由于深度学习的发展及其在自动视频监控领域的突出应用,重新识别受到了越来越多的关注。基于深度学习的方法的性能在很大程度上依赖于训练数据集和协议的质量。特别是,与训练和测试集结构相关的参数会影响整体性能。然而,公共re-ID数据集通常带有一组固定的参数,这在一定程度上适合于优化re-ID应用程序。在本文中,我们研究了数据集构建参数来提高re-ID性能。为此,我们首先对边界框序列的时间子采样率进行了实验。其次,对不同采样率下的边界盒放大效果进行了实验研究。第三,分析了数据集设计参数的最优选择随数据集大小的变化情况。实验表明,当优化数据集构建参数时,性能可能比最先进的re-ID模型提高2.1%,从而将流行的DukeMTMC-reID数据集上的最先进性能从91.9%提高到94.0% Rank-1。所获得的结果并不特定于所应用的模型,可能会推广到其他模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信