{"title":"掌握击剑技术与机器学习:基于视频的分类和校正系统","authors":"Solange Emmenegger, Matthias Egli, M. Pouly","doi":"10.1109/SDS57534.2023.00025","DOIUrl":null,"url":null,"abstract":"In fencing and other sports, athletes must continually execute numerous movements absent of expert supervision. In this article, we address this issue by creating a smart coach that classffies and corrects fencing movements using video footage. To avoid contextual bias, a variety of machine learning models, including LSTM, CNN-BiLSTM and attention based models were trained on fencers’ keypoints. For this purpose, we collected and annotated a video dataset featuring more than 1200 videos of four fencing movements and corresponding error patterns. This will be published along with this work under the Creative Commons License 4.0. The actions in this dataset can be classified with the masked self-attention architecture attaining a macro-averaged F1 score of over 99% on the test set. In an additional effort to show the impact of label quality, the performances are boosted on average by 3% with the introduction of multi-labeling.","PeriodicalId":150544,"journal":{"name":"2023 10th IEEE Swiss Conference on Data Science (SDS)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mastering Fencing Techniques with Machine Learning: A Video-Based Classification and Correction System\",\"authors\":\"Solange Emmenegger, Matthias Egli, M. Pouly\",\"doi\":\"10.1109/SDS57534.2023.00025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In fencing and other sports, athletes must continually execute numerous movements absent of expert supervision. In this article, we address this issue by creating a smart coach that classffies and corrects fencing movements using video footage. To avoid contextual bias, a variety of machine learning models, including LSTM, CNN-BiLSTM and attention based models were trained on fencers’ keypoints. For this purpose, we collected and annotated a video dataset featuring more than 1200 videos of four fencing movements and corresponding error patterns. This will be published along with this work under the Creative Commons License 4.0. The actions in this dataset can be classified with the masked self-attention architecture attaining a macro-averaged F1 score of over 99% on the test set. In an additional effort to show the impact of label quality, the performances are boosted on average by 3% with the introduction of multi-labeling.\",\"PeriodicalId\":150544,\"journal\":{\"name\":\"2023 10th IEEE Swiss Conference on Data Science (SDS)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 10th IEEE Swiss Conference on Data Science (SDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SDS57534.2023.00025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 10th IEEE Swiss Conference on Data Science (SDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SDS57534.2023.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mastering Fencing Techniques with Machine Learning: A Video-Based Classification and Correction System
In fencing and other sports, athletes must continually execute numerous movements absent of expert supervision. In this article, we address this issue by creating a smart coach that classffies and corrects fencing movements using video footage. To avoid contextual bias, a variety of machine learning models, including LSTM, CNN-BiLSTM and attention based models were trained on fencers’ keypoints. For this purpose, we collected and annotated a video dataset featuring more than 1200 videos of four fencing movements and corresponding error patterns. This will be published along with this work under the Creative Commons License 4.0. The actions in this dataset can be classified with the masked self-attention architecture attaining a macro-averaged F1 score of over 99% on the test set. In an additional effort to show the impact of label quality, the performances are boosted on average by 3% with the introduction of multi-labeling.