Comparison of CNN-Based Methods for Yoga Pose Classification

Turkish Journal of Engineering and Environmental Sciences Pub Date : 2023-05-11 DOI:10.31127/tuje.1275826

Vildan ATALAY AYDIN

{"title":"Comparison of CNN-Based Methods for Yoga Pose Classification","authors":"Vildan ATALAY AYDIN","doi":"10.31127/tuje.1275826","DOIUrl":null,"url":null,"abstract":"Yoga is an exercise developed in ancient India. People perform yoga in order to have mental, physical, and spiritual benefits. While yoga helps build strength in the mind and body, incorrect postures might result in serious injuries. Therefore, yoga exercisers need either an expert or a platform to receive feedback on their performance. Since access to experts is not an option for everyone, a system to provide feedback on the yoga poses is required. To this end, commercial products such as smart yoga mats and smart pants are produced; Kinect cameras, sensors, and wearable devices are used. However, these solutions are either uncomfortable to wear or not affordable for everyone. Nonetheless, a system that employs computer vision techniques is a requirement. In this paper, we propose a deep-learning model for yoga pose classification, which is the first step of a quality assessment system. We introduce a wavelet-based model that first takes wavelet transform of input images. The acquired subbands, i.e., approximation, horizontal, vertical, and diagonal coefficients of the wavelet transform are then fed into separate convolutional neural networks (CNN). The obtained probability results for each group are fused in order to have the final yoga class prediction. A publicly available dataset with 5 yoga poses is used. Since the number of images in the dataset is not enough for a deep learning model, we also perform data augmentation to increase the number of images. We compare our results to a CNN model and the three models that employ the subbands separately. Results obtained using the proposed model outperforms the accuracy output achieved with the compared models.","PeriodicalId":23377,"journal":{"name":"Turkish Journal of Engineering and Environmental Sciences","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Engineering and Environmental Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31127/tuje.1275826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Yoga is an exercise developed in ancient India. People perform yoga in order to have mental, physical, and spiritual benefits. While yoga helps build strength in the mind and body, incorrect postures might result in serious injuries. Therefore, yoga exercisers need either an expert or a platform to receive feedback on their performance. Since access to experts is not an option for everyone, a system to provide feedback on the yoga poses is required. To this end, commercial products such as smart yoga mats and smart pants are produced; Kinect cameras, sensors, and wearable devices are used. However, these solutions are either uncomfortable to wear or not affordable for everyone. Nonetheless, a system that employs computer vision techniques is a requirement. In this paper, we propose a deep-learning model for yoga pose classification, which is the first step of a quality assessment system. We introduce a wavelet-based model that first takes wavelet transform of input images. The acquired subbands, i.e., approximation, horizontal, vertical, and diagonal coefficients of the wavelet transform are then fed into separate convolutional neural networks (CNN). The obtained probability results for each group are fused in order to have the final yoga class prediction. A publicly available dataset with 5 yoga poses is used. Since the number of images in the dataset is not enough for a deep learning model, we also perform data augmentation to increase the number of images. We compare our results to a CNN model and the three models that employ the subbands separately. Results obtained using the proposed model outperforms the accuracy output achieved with the compared models.

查看原文本刊更多论文

基于cnn的瑜伽姿势分类方法比较

瑜伽是古印度发展起来的一种运动。人们练习瑜伽是为了获得心理、身体和精神上的好处。虽然瑜伽有助于增强身心的力量，但不正确的姿势可能会导致严重的伤害。因此，瑜伽练习者需要一个专家或一个平台来接收他们的表现反馈。因为并不是每个人都能获得专家的帮助，所以需要一个系统来提供关于瑜伽姿势的反馈。为此，生产出智能瑜伽垫、智能裤子等商业产品;使用Kinect摄像头、传感器和可穿戴设备。然而，这些解决方案要么穿着不舒服，要么不是每个人都负担得起。尽管如此，一个采用计算机视觉技术的系统是必需的。在本文中，我们提出了一个瑜伽姿势分类的深度学习模型，这是质量评估系统的第一步。我们引入了一种基于小波的模型，该模型首先对输入图像进行小波变换。获得的子带，即小波变换的近似、水平、垂直和对角系数，然后被馈送到单独的卷积神经网络(CNN)中。将得到的每组概率结果进行融合，从而得到最终的瑜伽课预测。使用了一个公开的数据集，其中包含5个瑜伽姿势。由于数据集中的图像数量不足以用于深度学习模型，我们还执行数据增强以增加图像数量。我们将我们的结果与CNN模型和分别使用子带的三种模型进行比较。使用该模型获得的结果优于使用比较模型获得的精度输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Turkish Journal of Engineering and Environmental Sciences

自引率

0.00%

发文量