Multi - Task Learning of Confounding Factors in Pose-Based Gait Recognition

2020 19th RoEduNet Conference: Networking in Education and Research (RoEduNet) Pub Date : 2020-12-11 DOI:10.1109/RoEduNet51892.2020.9324873

Adrian Cosma, I. Radoi

{"title":"Multi - Task Learning of Confounding Factors in Pose-Based Gait Recognition","authors":"Adrian Cosma, I. Radoi","doi":"10.1109/RoEduNet51892.2020.9324873","DOIUrl":null,"url":null,"abstract":"This paper proposes a method for performing gait-recognition using skeletons extracted from human pose-estimation networks. Gait is a powerful biometric feature that has been used successfully to identify people, even in the presence of confounding factors such as different view angles and carrying/clothing variations. While most methods make use of Gait Energy Images (GEIs), we propose MFINet, a novel method for processing a sequence of skeletons extracted from an available pre-trained human pose estimation network, that incorporates confounding factors in the decision process. Inspired by methods in the area of activity recognition, we used a skeleton image representation (TSSI) in our experiments to capture temporal dynamics, as well as the skeleton spatial structure. Based on an extensive evaluation on the popular gait-recognition CASIA-B dataset, we show that MFINet is performing better than existing state-of-the-art pose-based methods, obtaining an accuracy of over 85% in scenarios with the same angle for both gallery and probe sets.","PeriodicalId":140521,"journal":{"name":"2020 19th RoEduNet Conference: Networking in Education and Research (RoEduNet)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 19th RoEduNet Conference: Networking in Education and Research (RoEduNet)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RoEduNet51892.2020.9324873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

This paper proposes a method for performing gait-recognition using skeletons extracted from human pose-estimation networks. Gait is a powerful biometric feature that has been used successfully to identify people, even in the presence of confounding factors such as different view angles and carrying/clothing variations. While most methods make use of Gait Energy Images (GEIs), we propose MFINet, a novel method for processing a sequence of skeletons extracted from an available pre-trained human pose estimation network, that incorporates confounding factors in the decision process. Inspired by methods in the area of activity recognition, we used a skeleton image representation (TSSI) in our experiments to capture temporal dynamics, as well as the skeleton spatial structure. Based on an extensive evaluation on the popular gait-recognition CASIA-B dataset, we show that MFINet is performing better than existing state-of-the-art pose-based methods, obtaining an accuracy of over 85% in scenarios with the same angle for both gallery and probe sets.

查看原文本刊更多论文

基于姿态的步态识别中混杂因素的多任务学习

本文提出了一种利用人体姿态估计网络中提取的骨骼进行步态识别的方法。步态是一种强大的生物特征，已被成功地用于识别人，即使存在诸如不同视角和携带/服装变化等混杂因素。虽然大多数方法使用步态能量图像(GEIs)，但我们提出了MFINet，这是一种处理从可用的预训练人体姿态估计网络中提取的骨骼序列的新方法，该方法在决策过程中考虑了混杂因素。受活动识别领域方法的启发，我们在实验中使用了骨架图像表示(TSSI)来捕捉时间动态以及骨架空间结构。基于对流行的步态识别CASIA-B数据集的广泛评估，我们表明MFINet的表现优于现有的最先进的基于姿势的方法，在相同角度的情况下，对于画廊和探针集，MFINet的准确率超过85%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 19th RoEduNet Conference: Networking in Education and Research (RoEduNet)

自引率

0.00%

发文量