M. Makino, Tomohiro Odaka, J. Kuroiwa, Izumi Suwa, Hideyuki Shirai
{"title":"特征选择,以赢得ATP网球选手的点使用拉力赛信息","authors":"M. Makino, Tomohiro Odaka, J. Kuroiwa, Izumi Suwa, Hideyuki Shirai","doi":"10.2478/ijcss-2020-0003","DOIUrl":null,"url":null,"abstract":"Abstract In tennis, the accumulation of data has progressed and research on tactical analysis has been conducted. Estimating strategically important factors would have the benefit of providing players with useful advice and helping audience members understand what tennis players are good at. Previous research has been conducted into ways of predicting Association of Tennis Professionals (ATP) tennis match outcomes as well as estimating factors that are important for victories using machine learning models. The challenge of previous research is that the victory factor lacks concreteness. Since we thought the root of the abovementioned problem was that previous researchers used game summary as a feature and did not consider the process of rallies between points, this research focused on calculating the frequency of single shots, two-shot patterns, and specific effective shot patterns from each point rally of ATP singles matches. We then used those data to predict point winners and useful features using L1-regularized logistic regression. The highest accuracy obtained was 66.5%, and the area under the curve (AUC) was 0.689. The most prominent feature we found was the ratio of specific shots by specific players. From these results, our method could reveal more concretely tactical factors than previous studies.","PeriodicalId":38466,"journal":{"name":"International Journal of Computer Science in Sport","volume":"19 1","pages":"37 - 50"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Feature Selection to Win the Point of ATP Tennis Players Using Rally Information\",\"authors\":\"M. Makino, Tomohiro Odaka, J. Kuroiwa, Izumi Suwa, Hideyuki Shirai\",\"doi\":\"10.2478/ijcss-2020-0003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract In tennis, the accumulation of data has progressed and research on tactical analysis has been conducted. Estimating strategically important factors would have the benefit of providing players with useful advice and helping audience members understand what tennis players are good at. Previous research has been conducted into ways of predicting Association of Tennis Professionals (ATP) tennis match outcomes as well as estimating factors that are important for victories using machine learning models. The challenge of previous research is that the victory factor lacks concreteness. Since we thought the root of the abovementioned problem was that previous researchers used game summary as a feature and did not consider the process of rallies between points, this research focused on calculating the frequency of single shots, two-shot patterns, and specific effective shot patterns from each point rally of ATP singles matches. We then used those data to predict point winners and useful features using L1-regularized logistic regression. The highest accuracy obtained was 66.5%, and the area under the curve (AUC) was 0.689. The most prominent feature we found was the ratio of specific shots by specific players. From these results, our method could reveal more concretely tactical factors than previous studies.\",\"PeriodicalId\":38466,\"journal\":{\"name\":\"International Journal of Computer Science in Sport\",\"volume\":\"19 1\",\"pages\":\"37 - 50\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Computer Science in Sport\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/ijcss-2020-0003\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Science in Sport","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ijcss-2020-0003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
Feature Selection to Win the Point of ATP Tennis Players Using Rally Information
Abstract In tennis, the accumulation of data has progressed and research on tactical analysis has been conducted. Estimating strategically important factors would have the benefit of providing players with useful advice and helping audience members understand what tennis players are good at. Previous research has been conducted into ways of predicting Association of Tennis Professionals (ATP) tennis match outcomes as well as estimating factors that are important for victories using machine learning models. The challenge of previous research is that the victory factor lacks concreteness. Since we thought the root of the abovementioned problem was that previous researchers used game summary as a feature and did not consider the process of rallies between points, this research focused on calculating the frequency of single shots, two-shot patterns, and specific effective shot patterns from each point rally of ATP singles matches. We then used those data to predict point winners and useful features using L1-regularized logistic regression. The highest accuracy obtained was 66.5%, and the area under the curve (AUC) was 0.689. The most prominent feature we found was the ratio of specific shots by specific players. From these results, our method could reveal more concretely tactical factors than previous studies.