Xuedong Wang, Yaqi Li, Yinghong Zuo, Jinhui Zhu, Jun Zhuo, Xiazhi Li, Li Liu, Shengli Niu
{"title":"利用扩散模型和注意机制的气象风场快速降尺度方法","authors":"Xuedong Wang, Yaqi Li, Yinghong Zuo, Jinhui Zhu, Jun Zhuo, Xiazhi Li, Li Liu, Shengli Niu","doi":"10.1016/j.jweia.2025.106215","DOIUrl":null,"url":null,"abstract":"<div><div>Extensive applications have been found for rapid downscaling on meteorological wind field data in extreme weather warning, renewable energy assessment and site selection optimization, as well as dynamic simulation of nuclear accident emergency responses. This study proposes a progressive deep learning downscaling model (from 0.25° to 3 km) for generating high-resolution wind field data by integrating diffusion models with multi-head cross-attention mechanisms. Specifically, a progressive diffusion framework is developed to generate high-resolution wind fields through iterative denoising of latent states initialized from Gaussian noise distributions. Compared to Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) architecture, our approach achieved approximately 32 % reduction in Mean Absolute Error (MAE) and 46 % reduction in Mean Squared Error (MSE), while Structural Similarity Index (SSIM) was significantly improved from 0.7 to over 0.8. It is deduced that this approach significantly improves generation quality and alleviates non-physical artifacts caused by the mode collapse within Generative Adversarial Network (GAN). The integration of multi-head cross-attention mechanisms and 2D Rotary Positional Embedding (RoPE) enables effective fusion of cross-modal information from low-resolution wind fields, terrain data, and diffusion model latent states. Ablation studies demonstrate that the attention mechanism reduced both MAE and MSE by approximately 20 %, while improving Peak Signal-to-Noise Ratio (PSNR) and SSIM values. This indicates that the attention mechanisms enable the model to capture long-range spatial dependencies, significantly enhancing its representational capacity and physical consistency in spatial feature reconstruction. Additionally, to address the slow reasoning speed of traditional diffusion models, this work implements the Denoising Diffusion Implicit Model (DDIM) sampling algorithm to achieve deterministic latent space mapping, substantially reducing inference steps while maintaining generation accuracy. This framework ultimately realizes rapid generation of high-precision wind field data within seconds. The proposed methodology provides a novel solution for meteorological wind field downscaling, directly serving various engineering applications requiring refined meteorological data<strong>.</strong></div></div>","PeriodicalId":54752,"journal":{"name":"Journal of Wind Engineering and Industrial Aerodynamics","volume":"266 ","pages":"Article 106215"},"PeriodicalIF":4.9000,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A rapid downscaling approach for meteorological wind fields using diffusion models and attention mechanisms\",\"authors\":\"Xuedong Wang, Yaqi Li, Yinghong Zuo, Jinhui Zhu, Jun Zhuo, Xiazhi Li, Li Liu, Shengli Niu\",\"doi\":\"10.1016/j.jweia.2025.106215\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Extensive applications have been found for rapid downscaling on meteorological wind field data in extreme weather warning, renewable energy assessment and site selection optimization, as well as dynamic simulation of nuclear accident emergency responses. This study proposes a progressive deep learning downscaling model (from 0.25° to 3 km) for generating high-resolution wind field data by integrating diffusion models with multi-head cross-attention mechanisms. Specifically, a progressive diffusion framework is developed to generate high-resolution wind fields through iterative denoising of latent states initialized from Gaussian noise distributions. Compared to Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) architecture, our approach achieved approximately 32 % reduction in Mean Absolute Error (MAE) and 46 % reduction in Mean Squared Error (MSE), while Structural Similarity Index (SSIM) was significantly improved from 0.7 to over 0.8. It is deduced that this approach significantly improves generation quality and alleviates non-physical artifacts caused by the mode collapse within Generative Adversarial Network (GAN). The integration of multi-head cross-attention mechanisms and 2D Rotary Positional Embedding (RoPE) enables effective fusion of cross-modal information from low-resolution wind fields, terrain data, and diffusion model latent states. Ablation studies demonstrate that the attention mechanism reduced both MAE and MSE by approximately 20 %, while improving Peak Signal-to-Noise Ratio (PSNR) and SSIM values. This indicates that the attention mechanisms enable the model to capture long-range spatial dependencies, significantly enhancing its representational capacity and physical consistency in spatial feature reconstruction. Additionally, to address the slow reasoning speed of traditional diffusion models, this work implements the Denoising Diffusion Implicit Model (DDIM) sampling algorithm to achieve deterministic latent space mapping, substantially reducing inference steps while maintaining generation accuracy. This framework ultimately realizes rapid generation of high-precision wind field data within seconds. The proposed methodology provides a novel solution for meteorological wind field downscaling, directly serving various engineering applications requiring refined meteorological data<strong>.</strong></div></div>\",\"PeriodicalId\":54752,\"journal\":{\"name\":\"Journal of Wind Engineering and Industrial Aerodynamics\",\"volume\":\"266 \",\"pages\":\"Article 106215\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Wind Engineering and Industrial Aerodynamics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167610525002119\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Wind Engineering and Industrial Aerodynamics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167610525002119","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
A rapid downscaling approach for meteorological wind fields using diffusion models and attention mechanisms
Extensive applications have been found for rapid downscaling on meteorological wind field data in extreme weather warning, renewable energy assessment and site selection optimization, as well as dynamic simulation of nuclear accident emergency responses. This study proposes a progressive deep learning downscaling model (from 0.25° to 3 km) for generating high-resolution wind field data by integrating diffusion models with multi-head cross-attention mechanisms. Specifically, a progressive diffusion framework is developed to generate high-resolution wind fields through iterative denoising of latent states initialized from Gaussian noise distributions. Compared to Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP) architecture, our approach achieved approximately 32 % reduction in Mean Absolute Error (MAE) and 46 % reduction in Mean Squared Error (MSE), while Structural Similarity Index (SSIM) was significantly improved from 0.7 to over 0.8. It is deduced that this approach significantly improves generation quality and alleviates non-physical artifacts caused by the mode collapse within Generative Adversarial Network (GAN). The integration of multi-head cross-attention mechanisms and 2D Rotary Positional Embedding (RoPE) enables effective fusion of cross-modal information from low-resolution wind fields, terrain data, and diffusion model latent states. Ablation studies demonstrate that the attention mechanism reduced both MAE and MSE by approximately 20 %, while improving Peak Signal-to-Noise Ratio (PSNR) and SSIM values. This indicates that the attention mechanisms enable the model to capture long-range spatial dependencies, significantly enhancing its representational capacity and physical consistency in spatial feature reconstruction. Additionally, to address the slow reasoning speed of traditional diffusion models, this work implements the Denoising Diffusion Implicit Model (DDIM) sampling algorithm to achieve deterministic latent space mapping, substantially reducing inference steps while maintaining generation accuracy. This framework ultimately realizes rapid generation of high-precision wind field data within seconds. The proposed methodology provides a novel solution for meteorological wind field downscaling, directly serving various engineering applications requiring refined meteorological data.
期刊介绍:
The objective of the journal is to provide a means for the publication and interchange of information, on an international basis, on all those aspects of wind engineering that are included in the activities of the International Association for Wind Engineering http://www.iawe.org/. These are: social and economic impact of wind effects; wind characteristics and structure, local wind environments, wind loads and structural response, diffusion, pollutant dispersion and matter transport, wind effects on building heat loss and ventilation, wind effects on transport systems, aerodynamic aspects of wind energy generation, and codification of wind effects.
Papers on these subjects describing full-scale measurements, wind-tunnel simulation studies, computational or theoretical methods are published, as well as papers dealing with the development of techniques and apparatus for wind engineering experiments.