Senne Michielssen, Adam Maloof, Joe Haumacher, Alexander Dreger, Kyle Bonicki, Karl Hallgren
{"title":"Using large language models to generate baseball spray charts in the absence of numerical data","authors":"Senne Michielssen, Adam Maloof, Joe Haumacher, Alexander Dreger, Kyle Bonicki, Karl Hallgren","doi":"10.1177/17543371241257734","DOIUrl":null,"url":null,"abstract":"Although baseball has been revolutionized by analytics, not all teams have access to high quality data. While many high school, collegiate, and club teams do not have high speed cameras and radars, they often do record a text-based play-by-play account of the game. The purpose of this study is to demonstrate how to use large language models to convert play-by-play information into quantitative data. We walk through the specific example of spray charts, which depict where on the baseball diamond a hitter tends to put the ball in play. Spray charts are a particularly relevant example because of their use in informing in-game strategy decisions (e.g., the infield shift). This study successfully generates spray charts for collegiate baseball players with 95% accuracy.","PeriodicalId":20674,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology","volume":"67 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/17543371241257734","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Although baseball has been revolutionized by analytics, not all teams have access to high quality data. While many high school, collegiate, and club teams do not have high speed cameras and radars, they often do record a text-based play-by-play account of the game. The purpose of this study is to demonstrate how to use large language models to convert play-by-play information into quantitative data. We walk through the specific example of spray charts, which depict where on the baseball diamond a hitter tends to put the ball in play. Spray charts are a particularly relevant example because of their use in informing in-game strategy decisions (e.g., the infield shift). This study successfully generates spray charts for collegiate baseball players with 95% accuracy.
期刊介绍:
The Journal of Sports Engineering and Technology covers the development of novel sports apparel, footwear, and equipment; and the materials, instrumentation, and processes that make advances in sports possible.