NPIPVis: A visualization system involving NBA visual analysis and integrated learning model prediction

Q1 Computer Science
Zhuo Shi , Mingrui Li , Meng Wang , Jing Shen , Wei Chen , Xiaonan Luo
{"title":"NPIPVis: A visualization system involving NBA visual analysis and integrated learning model prediction","authors":"Zhuo Shi ,&nbsp;Mingrui Li ,&nbsp;Meng Wang ,&nbsp;Jing Shen ,&nbsp;Wei Chen ,&nbsp;Xiaonan Luo","doi":"10.1016/j.vrih.2022.08.008","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Data-driven event analysis has gradually become the backbone of modern competitive sports analysis. Competitive sports data analysis tasks increasingly use computer vision and machine-learning models for intelligent data analysis. Existing sports visualization systems focus on the player–team data visualization, which is not intuitive enough for team season win–loss data and game time-series data visualization and neglects the prediction of all-star players.</p></div><div><h3>Methods</h3><p>This study used an interactive visualization system designed with parallel aggregated ordered hypergraph dynamic hypergraphs, Calliope visualization data story technology, and iStoryline narrative visualization technology to visualize the regular statistics and game time data of players and teams. NPIPVis includes dynamic hypergraphs of a teamʹs wins and losses and game plot narrative visualization components. In addition, an integrated learning-based all-star player prediction model, SRR-voting, which starts from the existing minority and majority samples, was proposed using the synthetic minority oversampling technique and RandomUnderSampler methods to generate and eliminate samples of a certain size to balance the number of allstar and average players in the datasets. Next, a random forest algorithm was introduced to extract and construct the features of players and combined with the voting integrated model to predict the all-star players, using Grid- SearchCV, to optimize the hyperparameters of each model in integrated learning and then combined with five-fold cross-validation to improve the generalization ability of the model. Finally, the SHapley Additive exPlanations (SHAP) model was introduced to enhance the interpretability of the model.</p></div><div><h3>Results</h3><p>The experimental results of comparing the SRR-voting model with six common models show that the accuracy, F1-score, and recall metrics are significantly improved, which verifies the effectiveness and practicality of the SRR-voting model.</p></div><div><h3>Conclusions</h3><p>This study combines data visualization and machine learning to design a National Basketball Association data visualization system to help the general audience visualize game data and predict all-star players; this can also be extended to other sports events or related fields.</p></div>","PeriodicalId":33538,"journal":{"name":"Virtual Reality Intelligent Hardware","volume":"4 5","pages":"Pages 444-458"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2096579622000833/pdf?md5=1a85324d30377a749ed5c9c70fb6f227&pid=1-s2.0-S2096579622000833-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Reality Intelligent Hardware","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2096579622000833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Data-driven event analysis has gradually become the backbone of modern competitive sports analysis. Competitive sports data analysis tasks increasingly use computer vision and machine-learning models for intelligent data analysis. Existing sports visualization systems focus on the player–team data visualization, which is not intuitive enough for team season win–loss data and game time-series data visualization and neglects the prediction of all-star players.

Methods

This study used an interactive visualization system designed with parallel aggregated ordered hypergraph dynamic hypergraphs, Calliope visualization data story technology, and iStoryline narrative visualization technology to visualize the regular statistics and game time data of players and teams. NPIPVis includes dynamic hypergraphs of a teamʹs wins and losses and game plot narrative visualization components. In addition, an integrated learning-based all-star player prediction model, SRR-voting, which starts from the existing minority and majority samples, was proposed using the synthetic minority oversampling technique and RandomUnderSampler methods to generate and eliminate samples of a certain size to balance the number of allstar and average players in the datasets. Next, a random forest algorithm was introduced to extract and construct the features of players and combined with the voting integrated model to predict the all-star players, using Grid- SearchCV, to optimize the hyperparameters of each model in integrated learning and then combined with five-fold cross-validation to improve the generalization ability of the model. Finally, the SHapley Additive exPlanations (SHAP) model was introduced to enhance the interpretability of the model.

Results

The experimental results of comparing the SRR-voting model with six common models show that the accuracy, F1-score, and recall metrics are significantly improved, which verifies the effectiveness and practicality of the SRR-voting model.

Conclusions

This study combines data visualization and machine learning to design a National Basketball Association data visualization system to help the general audience visualize game data and predict all-star players; this can also be extended to other sports events or related fields.

NPIPVis:一个包含NBA可视化分析和综合学习模型预测的可视化系统
数据驱动的赛事分析已逐渐成为现代竞技体育分析的中坚力量。竞技体育数据分析任务越来越多地使用计算机视觉和机器学习模型进行智能数据分析。现有的体育可视化系统主要集中在球员-球队数据的可视化上,对球队赛季胜负数据和比赛时间序列数据的可视化不够直观,忽略了对全明星球员的预测。方法采用并行聚合有序超图动态超图、Calliope可视化数据故事技术和iStoryline叙事可视化技术设计的交互式可视化系统,对运动员和球队的常规统计数据和比赛时间数据进行可视化。NPIPVis包括球队输赢的动态超图和游戏情节叙事可视化组件。此外,提出了基于学习的全明星球员综合预测模型SRR-voting,该模型从现有的少数和多数样本出发,采用合成少数过采样技术和RandomUnderSampler方法生成和剔除一定规模的样本,以平衡数据集中全明星和普通球员的数量。接下来,引入随机森林算法提取和构造球员特征,结合投票综合模型预测全明星球员,利用Grid- SearchCV对综合学习中各模型的超参数进行优化,再结合五重交叉验证提高模型的泛化能力。最后,引入SHapley加性解释(SHAP)模型,增强模型的可解释性。结果将SRR-voting模型与6种常用模型进行对比,结果表明该模型在准确率、F1-score和召回率指标上均有显著提高,验证了SRR-voting模型的有效性和实用性。本研究将数据可视化与机器学习相结合,设计一个nba数据可视化系统,帮助普通观众将比赛数据可视化,预测全明星球员;这也可以扩展到其他体育赛事或相关领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Virtual Reality  Intelligent Hardware
Virtual Reality Intelligent Hardware Computer Science-Computer Graphics and Computer-Aided Design
CiteScore
6.40
自引率
0.00%
发文量
35
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信