Kang-En Huang, Minghuai Wang, Daniel Rosenfeld, Yannian Zhu
{"title":"利用物理信息自编码器探索液滴大小分布的可接受潜在表示","authors":"Kang-En Huang, Minghuai Wang, Daniel Rosenfeld, Yannian Zhu","doi":"10.1029/2024MS004821","DOIUrl":null,"url":null,"abstract":"<p>Investigating the role of clouds and precipitation in the Earth system necessitates microphysical schemes capable of accurately describing the evolution of hydrometeor particle size distribution (PSD), while maintaining low computational costs implementable in atmospheric models. Machine learning (ML) offers a promising approach to replace computationally expensive bin microphysical schemes with efficient emulations. However, many existing ML emulations predict moments of PSDs as prognostic variables, inheriting structural limitations from traditional bulk schemes. In contrast, latent variables directly discovered by ML have the potential to represent PSDs more accurately. However, their inherent nonlinearity breaks the conservation property under advection and diffusion, limiting their applicability in online simulations. To address this dilemma, we propose Non-negative weighted integrals (NWIs), formulated as weighted integrals of PSD with learnable non-negative weight functions. NWI provides the most general mathematical form for advectable microphysical prognostic variables. We conducted unsupervised learning over a liquid droplet PSD data set generated from ensemble large eddy simulations with Spectral Bin Microphysics (SBM). We used autoencoders that are physics-informed by NWI’s formulation to learn the optimal PSD representations from the data, and compared NWIs with traditional moment approaches in bulk schemes on their ability to represent PSDs in actual bin scheme simulations. Results show that NWIs can capture the critical information of medium-sized droplets, and outperform traditional cloud-rain moment approaches in terms of PSD reconstruction error, indicating improved PSD information compression efficiency. With these properties, NWIs are advantageous over moments as fully prognostic variables to build accurate ML-based bin-emulating schemes.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 9","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004821","citationCount":"0","resultStr":"{\"title\":\"Exploring Advectable Latent Representations for Droplet Size Distributions With Physics-Informed Autoencoders\",\"authors\":\"Kang-En Huang, Minghuai Wang, Daniel Rosenfeld, Yannian Zhu\",\"doi\":\"10.1029/2024MS004821\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Investigating the role of clouds and precipitation in the Earth system necessitates microphysical schemes capable of accurately describing the evolution of hydrometeor particle size distribution (PSD), while maintaining low computational costs implementable in atmospheric models. Machine learning (ML) offers a promising approach to replace computationally expensive bin microphysical schemes with efficient emulations. However, many existing ML emulations predict moments of PSDs as prognostic variables, inheriting structural limitations from traditional bulk schemes. In contrast, latent variables directly discovered by ML have the potential to represent PSDs more accurately. However, their inherent nonlinearity breaks the conservation property under advection and diffusion, limiting their applicability in online simulations. To address this dilemma, we propose Non-negative weighted integrals (NWIs), formulated as weighted integrals of PSD with learnable non-negative weight functions. NWI provides the most general mathematical form for advectable microphysical prognostic variables. We conducted unsupervised learning over a liquid droplet PSD data set generated from ensemble large eddy simulations with Spectral Bin Microphysics (SBM). We used autoencoders that are physics-informed by NWI’s formulation to learn the optimal PSD representations from the data, and compared NWIs with traditional moment approaches in bulk schemes on their ability to represent PSDs in actual bin scheme simulations. Results show that NWIs can capture the critical information of medium-sized droplets, and outperform traditional cloud-rain moment approaches in terms of PSD reconstruction error, indicating improved PSD information compression efficiency. With these properties, NWIs are advantageous over moments as fully prognostic variables to build accurate ML-based bin-emulating schemes.</p>\",\"PeriodicalId\":14881,\"journal\":{\"name\":\"Journal of Advances in Modeling Earth Systems\",\"volume\":\"17 9\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004821\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advances in Modeling Earth Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024MS004821\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024MS004821","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
研究云和降水在地球系统中的作用需要能够准确描述水流星粒径分布(PSD)演变的微物理方案,同时保持在大气模式中实现的低计算成本。机器学习(ML)提供了一种很有前途的方法,可以用有效的模拟来取代计算昂贵的bin微物理方案。然而,许多现有的ML模拟预测psd的矩作为预测变量,继承了传统批量方案的结构限制。相比之下,由ML直接发现的潜在变量有可能更准确地表示psd。然而,它们固有的非线性破坏了它们在平流和扩散下的守恒性质,限制了它们在在线模拟中的适用性。为了解决这一难题,我们提出了非负加权积分(nwi),将其表述为具有可学习的非负权重函数的PSD的加权积分。NWI为可预测的微物理预后变量提供了最通用的数学形式。我们利用光谱Bin微物理(Spectral Bin Microphysics, SBM)对集合大涡模拟生成的液滴PSD数据集进行了无监督学习。我们使用基于NWI公式的自编码器从数据中学习最佳PSD表示,并将NWI与批量方案中的传统矩方法在实际bin方案模拟中表示PSD的能力进行了比较。结果表明,NWIs可以捕获中等大小液滴的关键信息,在PSD重构误差方面优于传统的云雨矩方法,表明PSD信息压缩效率有所提高。有了这些特性,nwi比矩更有优势,可以作为完全预测变量来构建准确的基于ml的bin模拟方案。
Exploring Advectable Latent Representations for Droplet Size Distributions With Physics-Informed Autoencoders
Investigating the role of clouds and precipitation in the Earth system necessitates microphysical schemes capable of accurately describing the evolution of hydrometeor particle size distribution (PSD), while maintaining low computational costs implementable in atmospheric models. Machine learning (ML) offers a promising approach to replace computationally expensive bin microphysical schemes with efficient emulations. However, many existing ML emulations predict moments of PSDs as prognostic variables, inheriting structural limitations from traditional bulk schemes. In contrast, latent variables directly discovered by ML have the potential to represent PSDs more accurately. However, their inherent nonlinearity breaks the conservation property under advection and diffusion, limiting their applicability in online simulations. To address this dilemma, we propose Non-negative weighted integrals (NWIs), formulated as weighted integrals of PSD with learnable non-negative weight functions. NWI provides the most general mathematical form for advectable microphysical prognostic variables. We conducted unsupervised learning over a liquid droplet PSD data set generated from ensemble large eddy simulations with Spectral Bin Microphysics (SBM). We used autoencoders that are physics-informed by NWI’s formulation to learn the optimal PSD representations from the data, and compared NWIs with traditional moment approaches in bulk schemes on their ability to represent PSDs in actual bin scheme simulations. Results show that NWIs can capture the critical information of medium-sized droplets, and outperform traditional cloud-rain moment approaches in terms of PSD reconstruction error, indicating improved PSD information compression efficiency. With these properties, NWIs are advantageous over moments as fully prognostic variables to build accurate ML-based bin-emulating schemes.
期刊介绍:
The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community.
Open access. Articles are available free of charge for everyone with Internet access to view and download.
Formal peer review.
Supplemental material, such as code samples, images, and visualizations, is published at no additional charge.
No additional charge for color figures.
Modest page charges to cover production costs.
Articles published in high-quality full text PDF, HTML, and XML.
Internal and external reference linking, DOI registration, and forward linking via CrossRef.