Carson Evans, Laiyin Zhu, Kathleen Baker, Lei Meng
{"title":"Unveiling multiscale drivers of wind speed in Michigan using machine learning","authors":"Carson Evans, Laiyin Zhu, Kathleen Baker, Lei Meng","doi":"10.1038/s41612-025-01166-x","DOIUrl":null,"url":null,"abstract":"<p>The Great Lakes region has a unique climatology due to its large water bodies. Dynamic seasonal wind speeds are an important component in this climate that requires further study. Using 10-m wind data from ERA5-Land, this study employs remote teleconnection indices and local climate features to predict low-level wind speeds using Extreme Gradient Boosting (XGBoost) machine learning. The model for monthly winds achieves high accuracy, with an <i>R</i>² of 0.96 and a Root Mean Square Error (RMSE) of 0.12 m/s<sup>−1</sup>. The Shapley Additive Values (SHAP) analysis reveals that local climate variables, including the proximity to the nearest Great Lake, surface roughness, and surface temperature, are the most influential predictors and are most important in the model. Teleconnections such as the El Niño-Southern Oscillation and the Arctic Oscillation play minor roles. This study provides a new multiscale perspective on wind speed characteristics, drivers, and insights into the region’s wind energy potential.</p>","PeriodicalId":19438,"journal":{"name":"npj Climate and Atmospheric Science","volume":"24 1","pages":""},"PeriodicalIF":8.4000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj Climate and Atmospheric Science","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1038/s41612-025-01166-x","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The Great Lakes region has a unique climatology due to its large water bodies. Dynamic seasonal wind speeds are an important component in this climate that requires further study. Using 10-m wind data from ERA5-Land, this study employs remote teleconnection indices and local climate features to predict low-level wind speeds using Extreme Gradient Boosting (XGBoost) machine learning. The model for monthly winds achieves high accuracy, with an R² of 0.96 and a Root Mean Square Error (RMSE) of 0.12 m/s−1. The Shapley Additive Values (SHAP) analysis reveals that local climate variables, including the proximity to the nearest Great Lake, surface roughness, and surface temperature, are the most influential predictors and are most important in the model. Teleconnections such as the El Niño-Southern Oscillation and the Arctic Oscillation play minor roles. This study provides a new multiscale perspective on wind speed characteristics, drivers, and insights into the region’s wind energy potential.
五大湖地区由于其巨大的水体而具有独特的气候。动态季节性风速是这种气候的一个重要组成部分,需要进一步研究。本研究利用ERA5-Land的10米风数据,利用远程遥相关指数和当地气候特征,利用极端梯度增强(XGBoost)机器学习预测低层风速。月风模型具有较高的精度,R²为0.96,均方根误差(RMSE)为0.12 m/s - 1。Shapley相加值(SHAP)分析表明,当地气候变量,包括与最近的五大湖的接近程度、地表粗糙度和地表温度,是最具影响力的预测因子,也是模式中最重要的预测因子。像厄尔尼诺Niño-Southern涛动和北极涛动这样的遥相关作用较小。这项研究提供了一个新的多尺度视角来研究该地区的风速特征、驱动因素和风能潜力。
期刊介绍:
npj Climate and Atmospheric Science is an open-access journal encompassing the relevant physical, chemical, and biological aspects of atmospheric and climate science. The journal places particular emphasis on regional studies that unveil new insights into specific localities, including examinations of local atmospheric composition, such as aerosols.
The range of topics covered by the journal includes climate dynamics, climate variability, weather and climate prediction, climate change, ocean dynamics, weather extremes, air pollution, atmospheric chemistry (including aerosols), the hydrological cycle, and atmosphere–ocean and atmosphere–land interactions. The journal welcomes studies employing a diverse array of methods, including numerical and statistical modeling, the development and application of in situ observational techniques, remote sensing, and the development or evaluation of new reanalyses.