{"title":"Skeleton-Based Motion Recognition for Labanotation Generation Based on the Fusion of Neural Networks","authors":"Jiasheng Du, Jiaji Wang, Jianpo Li","doi":"10.1002/cav.70073","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Labanotation is a scientific method for documenting dance movements that has been widely adopted globally. Existing methods for Labanotation action recognition perform poorly in handling complex movements and integrating spatiotemporal information. To address this, we propose a multi-branch spatiotemporal fusion network with attention mechanisms aimed at accurately recognizing Labanotation actions from motion capture data. Initially, we convert motion capture data into three-dimensional coordinates and extract skeleton vector features. Subsequently, we enhance feature representation by extracting temporal difference features and skeleton angle features from the skeleton vectors. These features are processed using gated recurrent units and residual networks to effectively integrate spatiotemporal information. Finally, attention mechanisms are applied in the model to differentiate the importance of different positions in the features. This method effectively models spatiotemporal relationships, thereby improving the accuracy of Labanotation action recognition. We conducted experiments on two segmented motion capture datasets, demonstrating the effectiveness of each module. Compared to existing methods, our approach shows superior performance and strong generalization ability. Given the relative simplicity of upper limb action recognition, our focus primarily lies on lower limb action recognition. Notably, this marks the first application of skeleton angle features in the field of Labanotation action recognition.</p>\n </div>","PeriodicalId":50645,"journal":{"name":"Computer Animation and Virtual Worlds","volume":"36 5","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Animation and Virtual Worlds","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cav.70073","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Labanotation is a scientific method for documenting dance movements that has been widely adopted globally. Existing methods for Labanotation action recognition perform poorly in handling complex movements and integrating spatiotemporal information. To address this, we propose a multi-branch spatiotemporal fusion network with attention mechanisms aimed at accurately recognizing Labanotation actions from motion capture data. Initially, we convert motion capture data into three-dimensional coordinates and extract skeleton vector features. Subsequently, we enhance feature representation by extracting temporal difference features and skeleton angle features from the skeleton vectors. These features are processed using gated recurrent units and residual networks to effectively integrate spatiotemporal information. Finally, attention mechanisms are applied in the model to differentiate the importance of different positions in the features. This method effectively models spatiotemporal relationships, thereby improving the accuracy of Labanotation action recognition. We conducted experiments on two segmented motion capture datasets, demonstrating the effectiveness of each module. Compared to existing methods, our approach shows superior performance and strong generalization ability. Given the relative simplicity of upper limb action recognition, our focus primarily lies on lower limb action recognition. Notably, this marks the first application of skeleton angle features in the field of Labanotation action recognition.
期刊介绍:
With the advent of very powerful PCs and high-end graphics cards, there has been an incredible development in Virtual Worlds, real-time computer animation and simulation, games. But at the same time, new and cheaper Virtual Reality devices have appeared allowing an interaction with these real-time Virtual Worlds and even with real worlds through Augmented Reality. Three-dimensional characters, especially Virtual Humans are now of an exceptional quality, which allows to use them in the movie industry. But this is only a beginning, as with the development of Artificial Intelligence and Agent technology, these characters will become more and more autonomous and even intelligent. They will inhabit the Virtual Worlds in a Virtual Life together with animals and plants.