基于视频的人物再识别多尺度特征融合网络

2021 IEEE International Conference on Electronic Technology, Communication and Information (ICETCI) Pub Date : 2021-08-27 DOI:10.1109/ICETCI53161.2021.9563438

Penggao Liu, M. Ai, Guozhi Shan

{"title":"基于视频的人物再识别多尺度特征融合网络","authors":"Penggao Liu, M. Ai, Guozhi Shan","doi":"10.1109/ICETCI53161.2021.9563438","DOIUrl":null,"url":null,"abstract":"In recent years, person re-identification technology has been greatly developed. Image-based person re-identification algorithms have achieved excellent performance on open source datasets. In contrast, the development of video-based person re-identification technology is relatively backward. At present, the main research work of video-based person re-identification algorithms is focused on the processing of temporal information in the picture sequence. Complex appearance features are not effective when performing temporal fusion, so the frame-level features used are almost based on global features. This paper proposes a video person re-identification model based on multi-scale feature fusion. The multi-scale feature fusion of the model is embodied in the design of the frame-level feature extraction module. This module extracts the frame-level features of different scales, and then catenates them together into vectors, which not only improves the feature discrimination degree, but also makes the catenated frame-level features carry out effective temporal fusion, and the test results on the Mars dataset have reached a competitive level. At the same time, a series of comparative experiments were carried out on the model parameters to achieve further optimization of performance.","PeriodicalId":170858,"journal":{"name":"2021 IEEE International Conference on Electronic Technology, Communication and Information (ICETCI)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Scale Feature Fusion Network for Video-Based Person Re-Identification\",\"authors\":\"Penggao Liu, M. Ai, Guozhi Shan\",\"doi\":\"10.1109/ICETCI53161.2021.9563438\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, person re-identification technology has been greatly developed. Image-based person re-identification algorithms have achieved excellent performance on open source datasets. In contrast, the development of video-based person re-identification technology is relatively backward. At present, the main research work of video-based person re-identification algorithms is focused on the processing of temporal information in the picture sequence. Complex appearance features are not effective when performing temporal fusion, so the frame-level features used are almost based on global features. This paper proposes a video person re-identification model based on multi-scale feature fusion. The multi-scale feature fusion of the model is embodied in the design of the frame-level feature extraction module. This module extracts the frame-level features of different scales, and then catenates them together into vectors, which not only improves the feature discrimination degree, but also makes the catenated frame-level features carry out effective temporal fusion, and the test results on the Mars dataset have reached a competitive level. At the same time, a series of comparative experiments were carried out on the model parameters to achieve further optimization of performance.\",\"PeriodicalId\":170858,\"journal\":{\"name\":\"2021 IEEE International Conference on Electronic Technology, Communication and Information (ICETCI)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Electronic Technology, Communication and Information (ICETCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICETCI53161.2021.9563438\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Electronic Technology, Communication and Information (ICETCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETCI53161.2021.9563438","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

近年来，人的再识别技术得到了很大的发展。基于图像的人物再识别算法在开源数据集上取得了优异的性能。相比之下，基于视频的人物再识别技术的发展相对落后。目前，基于视频的人物再识别算法的主要研究工作集中在对图像序列中时间信息的处理上。复杂的外观特征在进行时间融合时效果不佳，因此所使用的帧级特征几乎是基于全局特征的。提出了一种基于多尺度特征融合的视频人物再识别模型。模型的多尺度特征融合体现在帧级特征提取模块的设计中。该模块对不同尺度的帧级特征进行提取，然后将它们串接在一起形成向量，不仅提高了特征识别程度，而且使串接的帧级特征进行了有效的时间融合，在火星数据集上的测试结果达到了一定的竞争水平。同时，对模型参数进行了一系列对比实验，进一步优化了性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-Scale Feature Fusion Network for Video-Based Person Re-Identification

In recent years, person re-identification technology has been greatly developed. Image-based person re-identification algorithms have achieved excellent performance on open source datasets. In contrast, the development of video-based person re-identification technology is relatively backward. At present, the main research work of video-based person re-identification algorithms is focused on the processing of temporal information in the picture sequence. Complex appearance features are not effective when performing temporal fusion, so the frame-level features used are almost based on global features. This paper proposes a video person re-identification model based on multi-scale feature fusion. The multi-scale feature fusion of the model is embodied in the design of the frame-level feature extraction module. This module extracts the frame-level features of different scales, and then catenates them together into vectors, which not only improves the feature discrimination degree, but also makes the catenated frame-level features carry out effective temporal fusion, and the test results on the Mars dataset have reached a competitive level. At the same time, a series of comparative experiments were carried out on the model parameters to achieve further optimization of performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE International Conference on Electronic Technology, Communication and Information (ICETCI)

自引率

0.00%

发文量