Steffen Mauceri, William Keely, Josh Laughner, Christopher W. O’Dell, Steven Massie, Robert Nelson, David Baker, Matthäus Kiel, Otto Lamminpää, Jonathan Hobbs, Abhishek Chatterjee, Tommy Taylor, Paul Wennberg, Sean Crowell, Britton Stephens, Vivienne H. Payne
{"title":"Uncertainty-Aware Machine Learning Bias Correction and Filtering for OCO-2: 1","authors":"Steffen Mauceri, William Keely, Josh Laughner, Christopher W. O’Dell, Steven Massie, Robert Nelson, David Baker, Matthäus Kiel, Otto Lamminpää, Jonathan Hobbs, Abhishek Chatterjee, Tommy Taylor, Paul Wennberg, Sean Crowell, Britton Stephens, Vivienne H. Payne","doi":"10.1029/2025EA004328","DOIUrl":null,"url":null,"abstract":"<p>The Orbiting Carbon Observatory-2 (OCO-2) makes space-based radiance measurements of reflected sunlight. Using a physics-based retrieval algorithm, these measurements are inverted to estimate column-averaged atmospheric carbon dioxide dry-air mole fractions (XCO<sub>2</sub>). However, biases are present in the retrieved XCO<sub>2</sub> due to sensor calibration errors and discrepancies between the physics-based retrieval and nature. We propose a Random Forest (RF), a non-linear, interpretable machine learning (ML) technique, to correct these biases. The approach is rigorously validated, comes with quantified uncertainties, and is derived independent of carbon flux models. Compared to the operational approach, our method reduces unphysical variability over land and ocean and shows closer agreement with independent ground-based observations from the Total Carbon Column Observing Network. The RF-bias correction is suitable for integration into the operational processing pipeline for the next version of OCO-2 products, pending additional testing and validation. It is inherently generalizable to other existing and planned greenhouse gas monitoring missions. This paper (Part 1) describes the RF bias correction, while a second paper (Part 2) describes the development of a data filtering strategy specifically designed for a subset of retrievals exhibiting irreducible errors that remain inadequately corrected by the ML bias correction.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 7","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2025EA004328","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth and Space Science","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2025EA004328","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
The Orbiting Carbon Observatory-2 (OCO-2) makes space-based radiance measurements of reflected sunlight. Using a physics-based retrieval algorithm, these measurements are inverted to estimate column-averaged atmospheric carbon dioxide dry-air mole fractions (XCO2). However, biases are present in the retrieved XCO2 due to sensor calibration errors and discrepancies between the physics-based retrieval and nature. We propose a Random Forest (RF), a non-linear, interpretable machine learning (ML) technique, to correct these biases. The approach is rigorously validated, comes with quantified uncertainties, and is derived independent of carbon flux models. Compared to the operational approach, our method reduces unphysical variability over land and ocean and shows closer agreement with independent ground-based observations from the Total Carbon Column Observing Network. The RF-bias correction is suitable for integration into the operational processing pipeline for the next version of OCO-2 products, pending additional testing and validation. It is inherently generalizable to other existing and planned greenhouse gas monitoring missions. This paper (Part 1) describes the RF bias correction, while a second paper (Part 2) describes the development of a data filtering strategy specifically designed for a subset of retrievals exhibiting irreducible errors that remain inadequately corrected by the ML bias correction.
期刊介绍:
Marking AGU’s second new open access journal in the last 12 months, Earth and Space Science is the only journal that reflects the expansive range of science represented by AGU’s 62,000 members, including all of the Earth, planetary, and space sciences, and related fields in environmental science, geoengineering, space engineering, and biogeochemistry.