Yijia Wen , Xiaoyan Luo , Lei Zheng , Liangnan Qi , Xiaofeng Shi
{"title":"vAQA-SS: Vision-based action quality assessment for style-based skiing","authors":"Yijia Wen , Xiaoyan Luo , Lei Zheng , Liangnan Qi , Xiaofeng Shi","doi":"10.1016/j.displa.2025.103020","DOIUrl":null,"url":null,"abstract":"<div><div>Vision-based Action Quality Assessment (AQA) aims to evaluate action quality in video data, aligning with the subjective scores of human experts. Due to the unique challenges posed by different sports, it is difficult to design a uniform AQA system applicable to all sports. Consequently, many current sports AQA methods focus on specific disciplines such as diving and gymnastics. In contrast, skiing AQA, characterized by high-dynamic actions and complex outdoor scenes, faces additional challenges. Therefore, we constructed a specific dataset for style-based skiing, which focuses athlete’s movement style and execution, encompassing diverse skiing events with detailed annotations on action classes and athletes’ final scores, named Skiing-6. Leveraging this dataset, we designed two vision-based skiing action quality assessment (vAQA-SS) models. One model directly generates an absolute AQA score by measuring the quality of an athlete’s actions in the input video without any external reference, termed <em>or</em>AQA, which assesses athlete performance based on low-level spatiotemporal features of the video data alongside high-level pose features. The other model calculates a relative AQA score, deriving the performance score of an athlete’s actions from the source input video with a reference video, termed <em>wr</em>AQA. Finally, we conducted extensive experiments on Skiing-6 and SkiTB to demonstrate the effectiveness of our vAQA-SS models. The results demonstrate that our approach achieves significant improvements in both absolute evaluation (<em>or</em>AQA) and relative evaluation (<em>wr</em>AQA), surpassing other similar sports AQA methods.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"88 ","pages":"Article 103020"},"PeriodicalIF":3.7000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000575","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Vision-based Action Quality Assessment (AQA) aims to evaluate action quality in video data, aligning with the subjective scores of human experts. Due to the unique challenges posed by different sports, it is difficult to design a uniform AQA system applicable to all sports. Consequently, many current sports AQA methods focus on specific disciplines such as diving and gymnastics. In contrast, skiing AQA, characterized by high-dynamic actions and complex outdoor scenes, faces additional challenges. Therefore, we constructed a specific dataset for style-based skiing, which focuses athlete’s movement style and execution, encompassing diverse skiing events with detailed annotations on action classes and athletes’ final scores, named Skiing-6. Leveraging this dataset, we designed two vision-based skiing action quality assessment (vAQA-SS) models. One model directly generates an absolute AQA score by measuring the quality of an athlete’s actions in the input video without any external reference, termed orAQA, which assesses athlete performance based on low-level spatiotemporal features of the video data alongside high-level pose features. The other model calculates a relative AQA score, deriving the performance score of an athlete’s actions from the source input video with a reference video, termed wrAQA. Finally, we conducted extensive experiments on Skiing-6 and SkiTB to demonstrate the effectiveness of our vAQA-SS models. The results demonstrate that our approach achieves significant improvements in both absolute evaluation (orAQA) and relative evaluation (wrAQA), surpassing other similar sports AQA methods.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.