Detecting and Tracking Player in Football Videos Using Two-Stage Mask R-CNN Approach

A. M. Husein, Chalvin, Kalvintirta Ciptady Ciptady, Raymond Suryadi, M. Harahap
{"title":"Detecting and Tracking Player in Football Videos Using Two-Stage Mask R-CNN Approach","authors":"A. M. Husein, Chalvin, Kalvintirta Ciptady Ciptady, Raymond Suryadi, M. Harahap","doi":"10.34306/conferenceseries.v4i1.643","DOIUrl":null,"url":null,"abstract":"Football is one of the most popular sports worldwide and capable of attracting the attention of millions of fans to a single match in the top leagues. The English Premier League, Spanish LaLiga, German Bundesliga, Italian Serie A, and French Ligue 1 are the five best leagues in the world today. There was an experiment where researchers want to analyze the efficiency and accuracy percentage of tracking and detection using the deep learning method of the Mask R-CNN model in classifying positive and negative X-Ray images in football matches. In this study, we applied Mask R-CNN for the segmentation and detection of football players. This model was based on two different backbones, namely ResNet101 and DenseNet. Both backbones produced accuracy values that were not significantly different, but the DenseNet approach performed better than ResNet101 based on testing results in the validation and testing sets. Based on comprehensive experiment results on the dataset, it has been shown that the Mask R-CNN approach with DenseNet can achieve better results compared to Mask R-CNN with ResNet101. Due to insufficient understanding of the characteristics of image types and the uneven distribution of various types of data sourced from random videos, there was still room for improvement in the trained model.","PeriodicalId":505674,"journal":{"name":"Conference Series","volume":"382 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34306/conferenceseries.v4i1.643","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Football is one of the most popular sports worldwide and capable of attracting the attention of millions of fans to a single match in the top leagues. The English Premier League, Spanish LaLiga, German Bundesliga, Italian Serie A, and French Ligue 1 are the five best leagues in the world today. There was an experiment where researchers want to analyze the efficiency and accuracy percentage of tracking and detection using the deep learning method of the Mask R-CNN model in classifying positive and negative X-Ray images in football matches. In this study, we applied Mask R-CNN for the segmentation and detection of football players. This model was based on two different backbones, namely ResNet101 and DenseNet. Both backbones produced accuracy values that were not significantly different, but the DenseNet approach performed better than ResNet101 based on testing results in the validation and testing sets. Based on comprehensive experiment results on the dataset, it has been shown that the Mask R-CNN approach with DenseNet can achieve better results compared to Mask R-CNN with ResNet101. Due to insufficient understanding of the characteristics of image types and the uneven distribution of various types of data sourced from random videos, there was still room for improvement in the trained model.
使用两级掩码 R-CNN 方法检测和跟踪足球视频中的球员
足球是世界上最受欢迎的运动之一,顶级联赛的一场比赛就能吸引数百万球迷的关注。英格兰足球超级联赛、西班牙足球甲级联赛、德国足球甲级联赛、意大利足球甲级联赛和法国足球甲级联赛是当今世界上最好的五大联赛。在一项实验中,研究人员希望分析使用 Mask R-CNN 模型的深度学习方法对足球比赛中的正负 X 光图像进行分类时,跟踪和检测的效率和准确率。在这项研究中,我们将 Mask R-CNN 应用于足球运动员的分割和检测。该模型基于两个不同的骨干网,即 ResNet101 和 DenseNet。两个骨干网产生的准确率值没有明显差异,但根据验证集和测试集的测试结果,DenseNet 方法的表现优于 ResNet101。基于数据集的综合实验结果表明,与使用 ResNet101 的掩码 R-CNN 方法相比,使用 DenseNet 的掩码 R-CNN 方法能取得更好的结果。由于对图像类型的特征了解不够,以及随机视频中各类数据分布不均,训练出的模型仍有改进空间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信