棒球统计数据的模糊聚类

B. Bushong
{"title":"棒球统计数据的模糊聚类","authors":"B. Bushong","doi":"10.1109/NAFIPS.2007.383812","DOIUrl":null,"url":null,"abstract":"A previous investigation into the ability of fuzzy clustering to be a sound method for comparing Major League Baseball players' batting averages yielded promising results. Yet, the study involved a rather small sample of 90 batting averages, which were fuzzy clustered into three categories. Furthermore, the primary study focused on batting averages, a statistic that is incapable of reflecting a player's hitting ability on its own, and it certainly does not account for the defensive skill of a player. While the original work highlighted some of the inherent advantages to fuzzy clustering, the small sample size and number of groups did not allow for a complete spectrum to be generated. Without an uncondensed continuum from which to compare the relative overall skills of all players, the original results limited the practical applications of fuzzy clustering in Major League Baseball. The current research aims to greatly improve upon the first study and emphasize the potential gains from the implementation of fuzzy clustering in the practices of Major League Baseball. In an effort to provide a more comprehensive analysis of baseball statistics, this investigation includes two additional hitting statistics, on base percentage and slugging percentage, and incorporates fielding percentage. The three added statistics reflect a player's bat control, power, and defensive reliability, respectively, all of which teams use to gauge a player's skills. All three offensive statistics are averaged to generate an inclusive measure of a player's offensive capabilities, and the corresponding fielding percentage was added as a second dimension into the fuzzy clustering program. The new four-input model is a more developed and more applicable version of the one produced in the original research. Fuzzy clustering of batting averages, on base percentages, slugging percentages, and fielding percentages is an innovative way for teams to compare an individual's skills to that of all professional players simultaneously, since fuzzy clustering is ideal for establishing relationships between data that would not normally be associated. Baseball statisticians will no longer be forced to merely note the numerical difference between players' three key hitting statistics and a critical defensive measure. Instead, players can be grouped according to their relative production, providing organizations with a more comprehensive view of players' capabilities. In this investigation, 968 Major League Baseball players' selected statistics were fuzzy clustered into nine groups, in an effort to better express the range of baseball skills. The results of the research offer insight into an amount of data that cannot efficiently be processed by an individual, which would make fuzzy clustering of batting averages an invaluable tool for Major League Baseball. Motivational resources are greatly needed in such a mentally and emotionally draining sport, and fuzzy-clustered statistics would provide organizations with such a resource. Players can be greatly uplifted when shown that they rank relatively well among their peers. Successful players can also use their relative groupings to secure better contracts. Owners, however, can save money in contract negotiations and retain desired players that are currently not performing well, according to the fuzzy cluster to which they belong. An additional way for owners to conserve funds is by utilizing fuzzy clustering in the scouting process. Instead of large travel budgets, fuzzy clustering can be used to compare the skill of a prospect among his peers. Finally, the teams that are the quickest to apply fuzzy clustering of baseball data to player trades will gain a competitive edge. Teams will seek trades for players of the same relative skill level, and possibly receive two or more players for one that has slightly better nominal numbers. Major League Baseball is filled with countless amounts of data. Fuzzy clustering of statistics would alleviate some of the inefficient data processing, and the pioneering organization will accrue the most benefits from the implementation of fuzzy clustering of baseball statistics.","PeriodicalId":292853,"journal":{"name":"NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Fuzzy Clustering of Baseball Statistics\",\"authors\":\"B. Bushong\",\"doi\":\"10.1109/NAFIPS.2007.383812\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A previous investigation into the ability of fuzzy clustering to be a sound method for comparing Major League Baseball players' batting averages yielded promising results. Yet, the study involved a rather small sample of 90 batting averages, which were fuzzy clustered into three categories. Furthermore, the primary study focused on batting averages, a statistic that is incapable of reflecting a player's hitting ability on its own, and it certainly does not account for the defensive skill of a player. While the original work highlighted some of the inherent advantages to fuzzy clustering, the small sample size and number of groups did not allow for a complete spectrum to be generated. Without an uncondensed continuum from which to compare the relative overall skills of all players, the original results limited the practical applications of fuzzy clustering in Major League Baseball. The current research aims to greatly improve upon the first study and emphasize the potential gains from the implementation of fuzzy clustering in the practices of Major League Baseball. In an effort to provide a more comprehensive analysis of baseball statistics, this investigation includes two additional hitting statistics, on base percentage and slugging percentage, and incorporates fielding percentage. The three added statistics reflect a player's bat control, power, and defensive reliability, respectively, all of which teams use to gauge a player's skills. All three offensive statistics are averaged to generate an inclusive measure of a player's offensive capabilities, and the corresponding fielding percentage was added as a second dimension into the fuzzy clustering program. The new four-input model is a more developed and more applicable version of the one produced in the original research. Fuzzy clustering of batting averages, on base percentages, slugging percentages, and fielding percentages is an innovative way for teams to compare an individual's skills to that of all professional players simultaneously, since fuzzy clustering is ideal for establishing relationships between data that would not normally be associated. Baseball statisticians will no longer be forced to merely note the numerical difference between players' three key hitting statistics and a critical defensive measure. Instead, players can be grouped according to their relative production, providing organizations with a more comprehensive view of players' capabilities. In this investigation, 968 Major League Baseball players' selected statistics were fuzzy clustered into nine groups, in an effort to better express the range of baseball skills. The results of the research offer insight into an amount of data that cannot efficiently be processed by an individual, which would make fuzzy clustering of batting averages an invaluable tool for Major League Baseball. Motivational resources are greatly needed in such a mentally and emotionally draining sport, and fuzzy-clustered statistics would provide organizations with such a resource. Players can be greatly uplifted when shown that they rank relatively well among their peers. Successful players can also use their relative groupings to secure better contracts. Owners, however, can save money in contract negotiations and retain desired players that are currently not performing well, according to the fuzzy cluster to which they belong. An additional way for owners to conserve funds is by utilizing fuzzy clustering in the scouting process. Instead of large travel budgets, fuzzy clustering can be used to compare the skill of a prospect among his peers. Finally, the teams that are the quickest to apply fuzzy clustering of baseball data to player trades will gain a competitive edge. Teams will seek trades for players of the same relative skill level, and possibly receive two or more players for one that has slightly better nominal numbers. Major League Baseball is filled with countless amounts of data. Fuzzy clustering of statistics would alleviate some of the inefficient data processing, and the pioneering organization will accrue the most benefits from the implementation of fuzzy clustering of baseball statistics.\",\"PeriodicalId\":292853,\"journal\":{\"name\":\"NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NAFIPS.2007.383812\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAFIPS.2007.383812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

之前对模糊聚类作为比较美国职业棒球大联盟球员击球率的可靠方法的能力进行了调查,结果令人鼓舞。然而,这项研究涉及了一个相当小的90个击球率样本,这些样本被模糊地分为三类。此外,主要的研究集中在击球率上,这是一个不能反映球员击球能力的统计数据,当然也不能说明球员的防守技能。虽然最初的工作强调了模糊聚类的一些固有优势,但小样本量和组数不允许生成完整的谱。由于没有一个非浓缩的连续体来比较所有球员的相对整体技能,原始结果限制了模糊聚类在美国职业棒球大联盟中的实际应用。目前的研究旨在大大改进第一项研究,并强调在美国职业棒球大联盟实践中实施模糊聚类的潜在收益。为了对棒球统计数据进行更全面的分析,本调查包括了两个额外的打击统计数据,上垒率和击球率,并纳入了外野击球率。这三个增加的数据分别反映了球员的击球控制、力量和防守可靠性,所有这些都是球队用来衡量球员技术的指标。将所有三个进攻统计数据进行平均,以生成一个球员进攻能力的包容性指标,并将相应的防守率作为第二个维度添加到模糊聚类程序中。新的四输入模型是原始研究中产生的模型的更完善和更适用的版本。击球率、上垒率、击球率和守备率的模糊聚类是团队将个人技能与所有职业球员同时进行比较的一种创新方式,因为模糊聚类对于建立通常不相关的数据之间的关系是理想的。棒球统计学家将不再被迫仅仅注意球员的三个关键打击数据和关键防守措施之间的数字差异。相反,玩家可以根据他们的相对产出进行分组,为组织提供更全面的玩家能力视图。在本次调查中,968名美国职业棒球大联盟球员的统计数据被模糊聚类为9组,以更好地表达棒球技能的范围。研究结果提供了对个人无法有效处理的大量数据的见解,这将使击球率的模糊聚类成为美国职业棒球大联盟的宝贵工具。在这样一项耗费精神和情感的运动中,激励资源是非常需要的,而模糊聚类统计将为组织提供这样的资源。当玩家在同伴中排名相对较高时,他们会受到极大的鼓舞。成功的球员也可以利用他们的相对分组来获得更好的合同。然而,老板们可以在合同谈判中节省资金,并根据他们所属的模糊集群保留目前表现不佳的理想球员。业主节省资金的另一种方法是在侦察过程中利用模糊聚类。模糊聚类可以用来在同行中比较潜在客户的技能,而不是大的旅行预算。最后,最快将棒球数据模糊聚类应用于球员交易的球队将获得竞争优势。球队将寻求交易具有相同相对技术水平的球员,并且可能以一个名义数据稍好的球员换取两个或更多的球员。美国职业棒球大联盟充满了无数的数据。统计数据的模糊聚类可以缓解一些低效的数据处理,并且先锋组织将从棒球统计数据的模糊聚类实现中获得最大的收益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fuzzy Clustering of Baseball Statistics
A previous investigation into the ability of fuzzy clustering to be a sound method for comparing Major League Baseball players' batting averages yielded promising results. Yet, the study involved a rather small sample of 90 batting averages, which were fuzzy clustered into three categories. Furthermore, the primary study focused on batting averages, a statistic that is incapable of reflecting a player's hitting ability on its own, and it certainly does not account for the defensive skill of a player. While the original work highlighted some of the inherent advantages to fuzzy clustering, the small sample size and number of groups did not allow for a complete spectrum to be generated. Without an uncondensed continuum from which to compare the relative overall skills of all players, the original results limited the practical applications of fuzzy clustering in Major League Baseball. The current research aims to greatly improve upon the first study and emphasize the potential gains from the implementation of fuzzy clustering in the practices of Major League Baseball. In an effort to provide a more comprehensive analysis of baseball statistics, this investigation includes two additional hitting statistics, on base percentage and slugging percentage, and incorporates fielding percentage. The three added statistics reflect a player's bat control, power, and defensive reliability, respectively, all of which teams use to gauge a player's skills. All three offensive statistics are averaged to generate an inclusive measure of a player's offensive capabilities, and the corresponding fielding percentage was added as a second dimension into the fuzzy clustering program. The new four-input model is a more developed and more applicable version of the one produced in the original research. Fuzzy clustering of batting averages, on base percentages, slugging percentages, and fielding percentages is an innovative way for teams to compare an individual's skills to that of all professional players simultaneously, since fuzzy clustering is ideal for establishing relationships between data that would not normally be associated. Baseball statisticians will no longer be forced to merely note the numerical difference between players' three key hitting statistics and a critical defensive measure. Instead, players can be grouped according to their relative production, providing organizations with a more comprehensive view of players' capabilities. In this investigation, 968 Major League Baseball players' selected statistics were fuzzy clustered into nine groups, in an effort to better express the range of baseball skills. The results of the research offer insight into an amount of data that cannot efficiently be processed by an individual, which would make fuzzy clustering of batting averages an invaluable tool for Major League Baseball. Motivational resources are greatly needed in such a mentally and emotionally draining sport, and fuzzy-clustered statistics would provide organizations with such a resource. Players can be greatly uplifted when shown that they rank relatively well among their peers. Successful players can also use their relative groupings to secure better contracts. Owners, however, can save money in contract negotiations and retain desired players that are currently not performing well, according to the fuzzy cluster to which they belong. An additional way for owners to conserve funds is by utilizing fuzzy clustering in the scouting process. Instead of large travel budgets, fuzzy clustering can be used to compare the skill of a prospect among his peers. Finally, the teams that are the quickest to apply fuzzy clustering of baseball data to player trades will gain a competitive edge. Teams will seek trades for players of the same relative skill level, and possibly receive two or more players for one that has slightly better nominal numbers. Major League Baseball is filled with countless amounts of data. Fuzzy clustering of statistics would alleviate some of the inefficient data processing, and the pioneering organization will accrue the most benefits from the implementation of fuzzy clustering of baseball statistics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信