Scott A. Martin, Georgy E. Manucharyan, Patrice Klein
{"title":"Generative Data Assimilation for Surface Ocean State Estimation From Multi-Modal Satellite Observations","authors":"Scott A. Martin, Georgy E. Manucharyan, Patrice Klein","doi":"10.1029/2025MS005063","DOIUrl":null,"url":null,"abstract":"<p>Estimating the surface ocean state at mesoscale eddy-resolving scales is essential for understanding the role of eddies in climate and marine ecosystems. Satellites provide multi-modal observations through sea surface height, temperature (SST), and salinity (SSS). However, each variable is observed with varying resolutions and sparsity, while some variables, such as surface currents, are not yet observed by satellites. All these variables must be accurately reconstructed across scales to study eddy dynamics. Dynamical data assimilation (DA) struggles to accurately reconstruct eddies since, to respect the equations of motion, it must reconstruct both the surface and interior ocean state, but the interior is sparsely observed. Relaxing this requirement and focusing only on the surface could improve surface state estimation, but a new method is required to ensure reconstructions remain physically realistic. Here, we introduce a score-based generative data assimilation (GenDA) framework for jointly reconstructing key surface ocean variables at eddy-resolving scales from multi-modal satellite observations. GenDA uses a two-stage approach: training a score-based diffusion model on a simulation to generate realistic ocean states before employing this as a Bayesian prior to assimilate sparse observations and generate state estimates. The learned diffusion prior leads to coherence between variables and realism across scales. By synergizing low-resolution SSS with high-resolution SST observations, GenDA improves the SSS resolution. Remarkably, GenDA can infer unobserved surface currents using only satellite observables, suggesting the learned prior encodes physical relationships between variables. Applied to real observations, GenDA demonstrates strong generalizability compared to regression-based deep learning and outperforms state-of-the-art dynamical DA.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 8","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025MS005063","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025MS005063","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Estimating the surface ocean state at mesoscale eddy-resolving scales is essential for understanding the role of eddies in climate and marine ecosystems. Satellites provide multi-modal observations through sea surface height, temperature (SST), and salinity (SSS). However, each variable is observed with varying resolutions and sparsity, while some variables, such as surface currents, are not yet observed by satellites. All these variables must be accurately reconstructed across scales to study eddy dynamics. Dynamical data assimilation (DA) struggles to accurately reconstruct eddies since, to respect the equations of motion, it must reconstruct both the surface and interior ocean state, but the interior is sparsely observed. Relaxing this requirement and focusing only on the surface could improve surface state estimation, but a new method is required to ensure reconstructions remain physically realistic. Here, we introduce a score-based generative data assimilation (GenDA) framework for jointly reconstructing key surface ocean variables at eddy-resolving scales from multi-modal satellite observations. GenDA uses a two-stage approach: training a score-based diffusion model on a simulation to generate realistic ocean states before employing this as a Bayesian prior to assimilate sparse observations and generate state estimates. The learned diffusion prior leads to coherence between variables and realism across scales. By synergizing low-resolution SSS with high-resolution SST observations, GenDA improves the SSS resolution. Remarkably, GenDA can infer unobserved surface currents using only satellite observables, suggesting the learned prior encodes physical relationships between variables. Applied to real observations, GenDA demonstrates strong generalizability compared to regression-based deep learning and outperforms state-of-the-art dynamical DA.
期刊介绍:
The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community.
Open access. Articles are available free of charge for everyone with Internet access to view and download.
Formal peer review.
Supplemental material, such as code samples, images, and visualizations, is published at no additional charge.
No additional charge for color figures.
Modest page charges to cover production costs.
Articles published in high-quality full text PDF, HTML, and XML.
Internal and external reference linking, DOI registration, and forward linking via CrossRef.