基于姿态卷积序列网络的运动视频帧级事件检测

MMSports '19 Pub Date : 2019-10-15 DOI:10.1145/3347318.3355525

Moritz Einfalt, Charles Dampeyrou, D. Zecha, R. Lienhart

{"title":"基于姿态卷积序列网络的运动视频帧级事件检测","authors":"Moritz Einfalt, Charles Dampeyrou, D. Zecha, R. Lienhart","doi":"10.1145/3347318.3355525","DOIUrl":null,"url":null,"abstract":"In this paper we address the problem of automatic event detection in athlete motion for automated performance analysis in athletics. We specifically consider the detection of stride-, jump- and landing related events from monocular recordings in long and triple jump. Existing work on event detection in sports often uses manually designed features on body and pose configurations of the athlete to infer the occurrence of events. We present a two-step approach, where temporal 2D pose sequences extracted from the videos form the basis for learning an event detection model. We formulate the detection of discrete events as a sequence translation task and propose a convolutional sequence network that can accurately predict the timing of event occurrences. Our best performing architecture achieves a precision/recall of 92.3%/89.0% in detecting start and end of ground contact during the run-up and jump of an athlete at a temporal precision of +/- 1 frame at 200Hz. The results show that 2D pose sequences are a suitable motion representation for learning event detection in a sequence-to-sequence framework.","PeriodicalId":322390,"journal":{"name":"MMSports '19","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Frame-Level Event Detection in Athletics Videos with Pose-Based Convolutional Sequence Networks\",\"authors\":\"Moritz Einfalt, Charles Dampeyrou, D. Zecha, R. Lienhart\",\"doi\":\"10.1145/3347318.3355525\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we address the problem of automatic event detection in athlete motion for automated performance analysis in athletics. We specifically consider the detection of stride-, jump- and landing related events from monocular recordings in long and triple jump. Existing work on event detection in sports often uses manually designed features on body and pose configurations of the athlete to infer the occurrence of events. We present a two-step approach, where temporal 2D pose sequences extracted from the videos form the basis for learning an event detection model. We formulate the detection of discrete events as a sequence translation task and propose a convolutional sequence network that can accurately predict the timing of event occurrences. Our best performing architecture achieves a precision/recall of 92.3%/89.0% in detecting start and end of ground contact during the run-up and jump of an athlete at a temporal precision of +/- 1 frame at 200Hz. The results show that 2D pose sequences are a suitable motion representation for learning event detection in a sequence-to-sequence framework.\",\"PeriodicalId\":322390,\"journal\":{\"name\":\"MMSports '19\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MMSports '19\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3347318.3355525\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MMSports '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3347318.3355525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

摘要

本文研究了运动员运动中的自动事件检测问题，以实现田径运动中的自动成绩分析。我们特别考虑检测跨步，跳跃和着陆相关事件从单目记录在跳远和三级跳远。现有的运动事件检测工作通常使用人工设计的运动员身体和姿势配置特征来推断事件的发生。我们提出了一种两步方法，其中从视频中提取的时间2D姿势序列构成了学习事件检测模型的基础。我们将离散事件的检测表述为一个序列转换任务，并提出了一个可以准确预测事件发生时间的卷积序列网络。我们的最佳表现架构在检测运动员助跑和跳跃过程中地面接触开始和结束的精度/召回率为92.3%/89.0%，时间精度为+/- 1帧，频率为200Hz。结果表明，在序列到序列框架中，二维姿态序列是一种适合学习事件检测的运动表示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Frame-Level Event Detection in Athletics Videos with Pose-Based Convolutional Sequence Networks

In this paper we address the problem of automatic event detection in athlete motion for automated performance analysis in athletics. We specifically consider the detection of stride-, jump- and landing related events from monocular recordings in long and triple jump. Existing work on event detection in sports often uses manually designed features on body and pose configurations of the athlete to infer the occurrence of events. We present a two-step approach, where temporal 2D pose sequences extracted from the videos form the basis for learning an event detection model. We formulate the detection of discrete events as a sequence translation task and propose a convolutional sequence network that can accurately predict the timing of event occurrences. Our best performing architecture achieves a precision/recall of 92.3%/89.0% in detecting start and end of ground contact during the run-up and jump of an athlete at a temporal precision of +/- 1 frame at 200Hz. The results show that 2D pose sequences are a suitable motion representation for learning event detection in a sequence-to-sequence framework.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

MMSports '19

自引率

0.00%

发文量