LaBelle: A Deep Learning APP that Helps You Learn Ballet

Signal & Image Processing Trends Pub Date : 2022-10-22 DOI:10.5121/csit.2022.121714

Sarah Fan, Kevin Guo, Yu Sun

{"title":"LaBelle: A Deep Learning APP that Helps You Learn Ballet","authors":"Sarah Fan, Kevin Guo, Yu Sun","doi":"10.5121/csit.2022.121714","DOIUrl":null,"url":null,"abstract":"Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.","PeriodicalId":170432,"journal":{"name":"Signal & Image Processing Trends","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal & Image Processing Trends","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/csit.2022.121714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.

查看原文本刊更多论文

LaBelle:一个深度学习应用程序，帮助你学习芭蕾

人体姿势估计已被证明在改善医疗保健，体育等现实世界应用中的多功能性[1]。正确的站位、姿势和动作是这些活动成功的关键。本文将解释深度学习移动芭蕾应用LaBelle[2]背后的研究过程。LaBelle拍摄了两个短视频:一个是老师的，一个是学生的。该应用程序利用MediaPipe Pose来识别、分析和存储两位舞者的姿势和动作数据，计算不同关节和主要身体部位之间产生的角度。该应用的AI模型使用K-means聚类算法为学生数据集和教师数据集创建一组聚类[3]。使用这两组聚类，LaBelle识别学生视频中的关键帧，并在教师聚类集中搜索匹配的属性和帧集。它会评估配对帧之间的差异，并产生最终分数以及需要改进的姿势的反馈。我们提出了一种无监督的引导学习方法，提高了视频比较的效率，这种方法通常既费时又耗资源。这种有效的模型不仅可以用于舞蹈，也可以用于体育和医学(物理治疗之类的活动)，在这些领域，姿势、形式和动作通常很难用肉眼追踪。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Signal & Image Processing Trends

自引率

0.00%

发文量