基于隐马尔可夫模型和神经网络的人体动作识别

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY) Pub Date : 2017-09-01 DOI:10.1109/SISY.2017.8080544

E. Gedat, Pascal Fechner, Richard Fiebelkorn, R. Vandenhouten

{"title":"基于隐马尔可夫模型和神经网络的人体动作识别","authors":"E. Gedat, Pascal Fechner, Richard Fiebelkorn, R. Vandenhouten","doi":"10.1109/SISY.2017.8080544","DOIUrl":null,"url":null,"abstract":"A human action recognition method is introduced that detects a set of actions in videos by a temporal expansion with hidden Markov models of a pose detection with an artificial neural network. The method was set-up and tested using eleven actions from the MOCAP motion capture database comprising 3,947 frames. A poses alphabet of fourteen relevant poses was defined to be learned by an artificial neural network. It was trained with the skeletons and a manual pose classification of 370 key frame images from the motions resulting in an accuracy of 83.5 %. Three actions prevalent in the used motions were chosen to be recognized with the interplay of one separate hidden Markov model for each action. From the output of the trained artificial neural network for all 1,891 frames of seven of the eleven motions together with a manual pose and action classification the matrices of the hidden Markov models for the four actions were calculated. A tailored maximum likelihood estimation based on the Viterbi algorithm generated an action proposal for each frame. The resulting overall accuracy was 83.2 % precision and 83.7 % recall for recognition of the actions.","PeriodicalId":352891,"journal":{"name":"2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Human action recognition with hidden Markov models and neural network derived poses\",\"authors\":\"E. Gedat, Pascal Fechner, Richard Fiebelkorn, R. Vandenhouten\",\"doi\":\"10.1109/SISY.2017.8080544\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A human action recognition method is introduced that detects a set of actions in videos by a temporal expansion with hidden Markov models of a pose detection with an artificial neural network. The method was set-up and tested using eleven actions from the MOCAP motion capture database comprising 3,947 frames. A poses alphabet of fourteen relevant poses was defined to be learned by an artificial neural network. It was trained with the skeletons and a manual pose classification of 370 key frame images from the motions resulting in an accuracy of 83.5 %. Three actions prevalent in the used motions were chosen to be recognized with the interplay of one separate hidden Markov model for each action. From the output of the trained artificial neural network for all 1,891 frames of seven of the eleven motions together with a manual pose and action classification the matrices of the hidden Markov models for the four actions were calculated. A tailored maximum likelihood estimation based on the Viterbi algorithm generated an action proposal for each frame. The resulting overall accuracy was 83.2 % precision and 83.7 % recall for recognition of the actions.\",\"PeriodicalId\":352891,\"journal\":{\"name\":\"2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SISY.2017.8080544\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SISY.2017.8080544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

介绍了一种基于隐马尔可夫模型的人工神经网络姿态检测方法，通过时间展开对视频中的一组动作进行检测的人体动作识别方法。该方法使用了包含3947帧的MOCAP动作捕捉数据库中的11个动作进行了设置和测试。定义了由14个相关姿势组成的姿势字母表，并通过人工神经网络进行学习。它通过骨骼和370个关键帧图像的手动姿势分类进行训练，结果准确率为83.5%。选取运动中常见的三个动作，用一个独立的隐马尔可夫模型对每个动作进行相互作用识别。从11个动作中的7个动作的全部1891帧的训练后的人工神经网络输出，结合人工姿态和动作分类，计算了4个动作的隐马尔可夫模型矩阵。基于Viterbi算法的最大似然估计为每一帧生成一个动作建议。结果表明，识别动作的总体准确率为83.2%，召回率为83.7%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Human action recognition with hidden Markov models and neural network derived poses

A human action recognition method is introduced that detects a set of actions in videos by a temporal expansion with hidden Markov models of a pose detection with an artificial neural network. The method was set-up and tested using eleven actions from the MOCAP motion capture database comprising 3,947 frames. A poses alphabet of fourteen relevant poses was defined to be learned by an artificial neural network. It was trained with the skeletons and a manual pose classification of 370 key frame images from the motions resulting in an accuracy of 83.5 %. Three actions prevalent in the used motions were chosen to be recognized with the interplay of one separate hidden Markov model for each action. From the output of the trained artificial neural network for all 1,891 frames of seven of the eleven motions together with a manual pose and action classification the matrices of the hidden Markov models for the four actions were calculated. A tailored maximum likelihood estimation based on the Viterbi algorithm generated an action proposal for each frame. The resulting overall accuracy was 83.2 % precision and 83.7 % recall for recognition of the actions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)

自引率

0.00%

发文量