LSTIF:Long-short Temporal Information Fusion Architecture for Video-based Person Re-identification

2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI) Pub Date : 2021-08-01 DOI:10.1109/ICCEAI52939.2021.00027

Xingzhe Sun, Shanna Zhuang, Zhengyou Wang

引用次数: 0

Abstract

Person re-identification is a major application of computer vision in reality. Since the data obtained by monitoring in real life is often in video format, and the walking poses of pedestrians are different, in addition to the appearance of pedestrians, how to obtain the motion features of pedestrians, is extremely important for video-based person re-identification. Therefore, for the temporal information of the video, we propose a Long-short Temporal Information Fusion (LSTIF) network. We aggregate temporal information from two perspectives, short-term features containing detailed information and long-term features containing global information. Simultaneously, in order to reduce the amount of calculation, this network also uses non-local blocks, and extend the outpu feature map to the same size as the input, which is convenient for calculation. This paper verifies the effectiveness of our method on two commonly used datasets iLIDS-VID and DukeMTMC-VideoReID.

查看原文本刊更多论文

基于视频的人物再识别的长-短时间信息融合体系结构

人的再识别是计算机视觉在现实中的一个重要应用。由于现实生活中监控获得的数据往往是视频格式的，而行人的行走姿势又各不相同，所以除了行人的外观外，如何获取行人的运动特征，对于基于视频的人再识别来说是极其重要的。因此，针对视频的时间信息，我们提出了一种长-短时间信息融合(LSTIF)网络。我们从两个角度聚合时间信息，包含详细信息的短期特征和包含全局信息的长期特征。同时，为了减少计算量，该网络还使用了非局部块，并将输出特征映射扩展到与输入相同的大小，方便计算。本文在两个常用数据集iLIDS-VID和DukeMTMC-VideoReID上验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)

自引率

0.00%

发文量