Mabliny Thuany, Katja Weiss, David Valero, Elias Villiger, Marilia S Andrade, Pantelis T Nikolaidis, Volker Scheer, Claudio Andre Barbosa de Lira, Rodrigo Luiz Vancini, Ivan Cuk, Lorin Braschler, Thomas Rosemann, Beat Knechtle
{"title":"使用机器学习方法对6小时超级马拉松比赛进行分析。","authors":"Mabliny Thuany, Katja Weiss, David Valero, Elias Villiger, Marilia S Andrade, Pantelis T Nikolaidis, Volker Scheer, Claudio Andre Barbosa de Lira, Rodrigo Luiz Vancini, Ivan Cuk, Lorin Braschler, Thomas Rosemann, Beat Knechtle","doi":"10.3389/fspor.2025.1577470","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Ultra-marathon running popularity is increasing, with the 6-h run being the shortest time-limited ultra-marathon. Since very little is known regarding the country from which the fastest 6-h runners originate, the fastest age group, and where the fastest 6-h race courses are located, this study aims to close this gap.</p><p><strong>Methods: </strong>A machine learning model based on the XG Boost algorithm was built to predict running speed based on the athletés age, gender, country of origin, and the country where the race takes place. Model explainability tools were used to investigate how each independent variable would influence the predicted running speed. To assess the impact of individual performance against the other variables under study, a Mixed Effects Linear Model was also built.</p><p><strong>Results: </strong>A total of 117,882 race records from 51,018 unique runners from 65 countries participating in races held in 56 different countries were analyzed. Participation is spread across a wide range of countries, with a high correlation between the country of origin and the country of the event. Most runners originated from Germany, Italy, France, the USA, and Sweden, with Europe (Belgium, Russia, Spain, Poland, Romania, and Lithuania), being the fastest. Most athletes competed in Italy, Germany, France, the USA, and The Netherlands. The fastest average running speeds were also achieved in European countries (Russia, Belgium, Poland, Netherlands, Romania, Croatia, and Lithuania).</p><p><strong>Conclusions: </strong>For athletes competing in a 6-h ultramarathon, gender was the most important predictor, followed by the origin of the athlete, the age, and the race location. The 6-h running event seems to be dominated by European athletes regarding both participation and performance.</p>","PeriodicalId":12716,"journal":{"name":"Frontiers in Sports and Active Living","volume":"7 ","pages":"1577470"},"PeriodicalIF":2.6000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12499350/pdf/","citationCount":"0","resultStr":"{\"title\":\"An analysis of the 6-h ultra-marathon race using a machine learning approach.\",\"authors\":\"Mabliny Thuany, Katja Weiss, David Valero, Elias Villiger, Marilia S Andrade, Pantelis T Nikolaidis, Volker Scheer, Claudio Andre Barbosa de Lira, Rodrigo Luiz Vancini, Ivan Cuk, Lorin Braschler, Thomas Rosemann, Beat Knechtle\",\"doi\":\"10.3389/fspor.2025.1577470\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Ultra-marathon running popularity is increasing, with the 6-h run being the shortest time-limited ultra-marathon. Since very little is known regarding the country from which the fastest 6-h runners originate, the fastest age group, and where the fastest 6-h race courses are located, this study aims to close this gap.</p><p><strong>Methods: </strong>A machine learning model based on the XG Boost algorithm was built to predict running speed based on the athletés age, gender, country of origin, and the country where the race takes place. Model explainability tools were used to investigate how each independent variable would influence the predicted running speed. To assess the impact of individual performance against the other variables under study, a Mixed Effects Linear Model was also built.</p><p><strong>Results: </strong>A total of 117,882 race records from 51,018 unique runners from 65 countries participating in races held in 56 different countries were analyzed. Participation is spread across a wide range of countries, with a high correlation between the country of origin and the country of the event. Most runners originated from Germany, Italy, France, the USA, and Sweden, with Europe (Belgium, Russia, Spain, Poland, Romania, and Lithuania), being the fastest. Most athletes competed in Italy, Germany, France, the USA, and The Netherlands. The fastest average running speeds were also achieved in European countries (Russia, Belgium, Poland, Netherlands, Romania, Croatia, and Lithuania).</p><p><strong>Conclusions: </strong>For athletes competing in a 6-h ultramarathon, gender was the most important predictor, followed by the origin of the athlete, the age, and the race location. The 6-h running event seems to be dominated by European athletes regarding both participation and performance.</p>\",\"PeriodicalId\":12716,\"journal\":{\"name\":\"Frontiers in Sports and Active Living\",\"volume\":\"7 \",\"pages\":\"1577470\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12499350/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Sports and Active Living\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fspor.2025.1577470\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"SPORT SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Sports and Active Living","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fspor.2025.1577470","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"SPORT SCIENCES","Score":null,"Total":0}
An analysis of the 6-h ultra-marathon race using a machine learning approach.
Background: Ultra-marathon running popularity is increasing, with the 6-h run being the shortest time-limited ultra-marathon. Since very little is known regarding the country from which the fastest 6-h runners originate, the fastest age group, and where the fastest 6-h race courses are located, this study aims to close this gap.
Methods: A machine learning model based on the XG Boost algorithm was built to predict running speed based on the athletés age, gender, country of origin, and the country where the race takes place. Model explainability tools were used to investigate how each independent variable would influence the predicted running speed. To assess the impact of individual performance against the other variables under study, a Mixed Effects Linear Model was also built.
Results: A total of 117,882 race records from 51,018 unique runners from 65 countries participating in races held in 56 different countries were analyzed. Participation is spread across a wide range of countries, with a high correlation between the country of origin and the country of the event. Most runners originated from Germany, Italy, France, the USA, and Sweden, with Europe (Belgium, Russia, Spain, Poland, Romania, and Lithuania), being the fastest. Most athletes competed in Italy, Germany, France, the USA, and The Netherlands. The fastest average running speeds were also achieved in European countries (Russia, Belgium, Poland, Netherlands, Romania, Croatia, and Lithuania).
Conclusions: For athletes competing in a 6-h ultramarathon, gender was the most important predictor, followed by the origin of the athlete, the age, and the race location. The 6-h running event seems to be dominated by European athletes regarding both participation and performance.