{"title":"基于实时二维单姿态估计框架的时间卷积神经网络活动识别模型","authors":"Devansh Srivastav, A. Bajpai, Abhishek Singhal","doi":"10.1109/confluence52989.2022.9734159","DOIUrl":null,"url":null,"abstract":"Human pose estimation and human activity recognition are two of the most researched domains in computer vision applications. Surveillance, human-computer interaction and video retrieval, all benefit from robust solutions of this domain. Due to disparities in inter-personal parameters, mobility efficiency, and recording configurations, the process is challenging. In this paper, deep learning algorithms are used for both pose estimation and activity recognition models. The pose estimation model generates an array of objects for each keypoint with their coordinated over a specific interval. This time series data is then consumed by the activity recognition model which was trained using a temporal convolutional neural network. The attained accuracy of the model was found to be 92.7% with a loss of 0.19.","PeriodicalId":261941,"journal":{"name":"2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Temporal Convolutional Neural Network Based Activity Recognition Model using a Real-Time Two-Dimensional Single Pose Estimation Framework\",\"authors\":\"Devansh Srivastav, A. Bajpai, Abhishek Singhal\",\"doi\":\"10.1109/confluence52989.2022.9734159\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human pose estimation and human activity recognition are two of the most researched domains in computer vision applications. Surveillance, human-computer interaction and video retrieval, all benefit from robust solutions of this domain. Due to disparities in inter-personal parameters, mobility efficiency, and recording configurations, the process is challenging. In this paper, deep learning algorithms are used for both pose estimation and activity recognition models. The pose estimation model generates an array of objects for each keypoint with their coordinated over a specific interval. This time series data is then consumed by the activity recognition model which was trained using a temporal convolutional neural network. The attained accuracy of the model was found to be 92.7% with a loss of 0.19.\",\"PeriodicalId\":261941,\"journal\":{\"name\":\"2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/confluence52989.2022.9734159\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/confluence52989.2022.9734159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Temporal Convolutional Neural Network Based Activity Recognition Model using a Real-Time Two-Dimensional Single Pose Estimation Framework
Human pose estimation and human activity recognition are two of the most researched domains in computer vision applications. Surveillance, human-computer interaction and video retrieval, all benefit from robust solutions of this domain. Due to disparities in inter-personal parameters, mobility efficiency, and recording configurations, the process is challenging. In this paper, deep learning algorithms are used for both pose estimation and activity recognition models. The pose estimation model generates an array of objects for each keypoint with their coordinated over a specific interval. This time series data is then consumed by the activity recognition model which was trained using a temporal convolutional neural network. The attained accuracy of the model was found to be 92.7% with a loss of 0.19.