网球动作识别的混合深度学习模型：通过CNN-BiLSTM集成增强专业训练

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Concurrency and Computation-Practice & Experience Pub Date : 2025-03-13 DOI:10.1002/cpe.70029

Zhaokun Chen, Qin Xie, Wei Jiang

{"title":"网球动作识别的混合深度学习模型：通过CNN-BiLSTM集成增强专业训练","authors":"Zhaokun Chen, Qin Xie, Wei Jiang","doi":"10.1002/cpe.70029","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Classifying tennis movements from video data presents significant challenges, including overfitting, limited datasets, low accuracy, and difficulty in capturing dynamic, real-world conditions such as variable lighting, camera angles, and complex player movements. Existing approaches lack robustness and practicality for real-time applications, which are crucial for sports analysts and coaches. To address these challenges, this paper proposes an advanced architecture that strategically integrates the Bidirectional Long Short-Term Memory Network (BiLSTM) and transfer learning from the lightweight Convolutional Neural Network (CNN) MobileNetV2. The motivation behind this work lies in enabling coaches to objectively analyze player performance and tailor training strategies based on precise movement recognition. The model is designed to enhance video representation capture, improve action classification accuracy, and operate efficiently in real-world conditions. Validation with the THETIS dataset demonstrates state-of-the-art results, achieving 96.72% accuracy and 96.97% recall, significantly outperforming existing methods. Additionally, the integration of cloud and edge computing capabilities facilitates real-time detection of tennis actions, providing immediate, actionable insights for practitioners. A motivating case study showcases how this method can effectively identify and analyze complex movements such as smashes and slices, addressing long-standing challenges in video-based tennis training. This research offers a robust and adaptable solution for classifying tennis actions, with promising implications for trainers and sports analysts seeking efficient and scalable tools for video analysis.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 6-8","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Deep Learning Models for Tennis Action Recognition: Enhancing Professional Training Through CNN-BiLSTM Integration\",\"authors\":\"Zhaokun Chen, Qin Xie, Wei Jiang\",\"doi\":\"10.1002/cpe.70029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Classifying tennis movements from video data presents significant challenges, including overfitting, limited datasets, low accuracy, and difficulty in capturing dynamic, real-world conditions such as variable lighting, camera angles, and complex player movements. Existing approaches lack robustness and practicality for real-time applications, which are crucial for sports analysts and coaches. To address these challenges, this paper proposes an advanced architecture that strategically integrates the Bidirectional Long Short-Term Memory Network (BiLSTM) and transfer learning from the lightweight Convolutional Neural Network (CNN) MobileNetV2. The motivation behind this work lies in enabling coaches to objectively analyze player performance and tailor training strategies based on precise movement recognition. The model is designed to enhance video representation capture, improve action classification accuracy, and operate efficiently in real-world conditions. Validation with the THETIS dataset demonstrates state-of-the-art results, achieving 96.72% accuracy and 96.97% recall, significantly outperforming existing methods. Additionally, the integration of cloud and edge computing capabilities facilitates real-time detection of tennis actions, providing immediate, actionable insights for practitioners. A motivating case study showcases how this method can effectively identify and analyze complex movements such as smashes and slices, addressing long-standing challenges in video-based tennis training. This research offers a robust and adaptable solution for classifying tennis actions, with promising implications for trainers and sports analysts seeking efficient and scalable tools for video analysis.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 6-8\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70029\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70029","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

从视频数据中对网球运动进行分类面临着巨大的挑战，包括过度拟合、数据集有限、准确率低，以及难以捕捉真实世界中的动态条件，如多变的光线、摄像机角度和复杂的球员动作。现有方法在实时应用方面缺乏鲁棒性和实用性，而这对体育分析师和教练来说至关重要。为了应对这些挑战，本文提出了一种先进的架构，它战略性地集成了双向长短期记忆网络（BiLSTM）和轻量级卷积神经网络（CNN）MobileNetV2 的迁移学习。这项工作的动机在于使教练员能够客观地分析球员的表现，并根据精确的动作识别来定制训练策略。该模型旨在增强视频表征捕捉，提高动作分类准确性，并在真实世界条件下高效运行。使用 THETIS 数据集进行的验证显示了最先进的结果，准确率达到 96.72%，召回率达到 96.97%，明显优于现有方法。此外，云计算和边缘计算能力的整合促进了网球动作的实时检测，为从业人员提供了即时、可操作的见解。一个令人振奋的案例研究展示了这种方法如何有效识别和分析复杂动作，如击球和切球，从而解决基于视频的网球训练中长期存在的难题。这项研究为网球动作分类提供了一种稳健、适应性强的解决方案，对于寻求高效、可扩展的视频分析工具的训练员和体育分析师来说具有重要意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hybrid Deep Learning Models for Tennis Action Recognition: Enhancing Professional Training Through CNN-BiLSTM Integration

Classifying tennis movements from video data presents significant challenges, including overfitting, limited datasets, low accuracy, and difficulty in capturing dynamic, real-world conditions such as variable lighting, camera angles, and complex player movements. Existing approaches lack robustness and practicality for real-time applications, which are crucial for sports analysts and coaches. To address these challenges, this paper proposes an advanced architecture that strategically integrates the Bidirectional Long Short-Term Memory Network (BiLSTM) and transfer learning from the lightweight Convolutional Neural Network (CNN) MobileNetV2. The motivation behind this work lies in enabling coaches to objectively analyze player performance and tailor training strategies based on precise movement recognition. The model is designed to enhance video representation capture, improve action classification accuracy, and operate efficiently in real-world conditions. Validation with the THETIS dataset demonstrates state-of-the-art results, achieving 96.72% accuracy and 96.97% recall, significantly outperforming existing methods. Additionally, the integration of cloud and edge computing capabilities facilitates real-time detection of tennis actions, providing immediate, actionable insights for practitioners. A motivating case study showcases how this method can effectively identify and analyze complex movements such as smashes and slices, addressing long-standing challenges in video-based tennis training. This research offers a robust and adaptable solution for classifying tennis actions, with promising implications for trainers and sports analysts seeking efficient and scalable tools for video analysis.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Concurrency and Computation-Practice & Experience 工程技术-计算机：理论方法

CiteScore

5.00

自引率

10.00%

发文量

664

审稿时长

9.6 months

期刊介绍： Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.