Accuracy and Performance Comparison of Video Action Recognition Approaches

2020 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2020-08-20 DOI:10.1109/HPEC43674.2020.9286249

Matthew Hutchinson, S. Samsi, W. Arcand, David Bestor, Bill Bergeron, C. Byun, Micheal Houle, M. Hubbell, Michael J. Jones, J. Kepner, Andrew Kirby, P. Michaleas, Lauren Milechin, J. Mullen, Andrew Prout, Antonio Rosa, A. Reuther, Charles Yee, V. Gadepally

{"title":"Accuracy and Performance Comparison of Video Action Recognition Approaches","authors":"Matthew Hutchinson, S. Samsi, W. Arcand, David Bestor, Bill Bergeron, C. Byun, Micheal Houle, M. Hubbell, Michael J. Jones, J. Kepner, Andrew Kirby, P. Michaleas, Lauren Milechin, J. Mullen, Andrew Prout, Antonio Rosa, A. Reuther, Charles Yee, V. Gadepally","doi":"10.1109/HPEC43674.2020.9286249","DOIUrl":null,"url":null,"abstract":"Over the past few years, there has been significant interest in video action recognition systems and models. However, direct comparison of accuracy and computational performance results remain clouded by differing training environments, hardware specifications, hyperparameters, pipelines, and inference methods. This article provides a direct comparison between fourteen “off-the-shelf” and state-of-the-art models by ensuring consistency in these training characteristics in order to provide readers with a meaningful comparison across different types of video action recognition algorithms. Accuracy of the models is evaluated using standard Top-1 and Top-5 accuracy metrics in addition to a proposed new accuracy metric. Additionally, we compare computational performance of distributed training from two to sixty-four GPUs on a state-of-the-art HPC system.","PeriodicalId":168544,"journal":{"name":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPEC43674.2020.9286249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Over the past few years, there has been significant interest in video action recognition systems and models. However, direct comparison of accuracy and computational performance results remain clouded by differing training environments, hardware specifications, hyperparameters, pipelines, and inference methods. This article provides a direct comparison between fourteen “off-the-shelf” and state-of-the-art models by ensuring consistency in these training characteristics in order to provide readers with a meaningful comparison across different types of video action recognition algorithms. Accuracy of the models is evaluated using standard Top-1 and Top-5 accuracy metrics in addition to a proposed new accuracy metric. Additionally, we compare computational performance of distributed training from two to sixty-four GPUs on a state-of-the-art HPC system.

查看原文本刊更多论文

视频动作识别方法的准确率和性能比较

在过去的几年中，人们对视频动作识别系统和模型产生了极大的兴趣。然而，由于不同的训练环境、硬件规格、超参数、管道和推理方法，对准确性和计算性能结果的直接比较仍然模糊不清。本文通过确保这些训练特征的一致性，对14种“现成的”和最先进的模型进行了直接比较，以便为读者提供不同类型视频动作识别算法之间有意义的比较。除了提出的新精度度量外，还使用标准的Top-1和Top-5精度度量来评估模型的精度。此外，我们在最先进的高性能计算系统上比较了2到64个gpu的分布式训练的计算性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量