Kerri Lu , Dan M. Kluger , Stephen Bates , Sherrie Wang
{"title":"Regression coefficient estimation from remote sensing maps","authors":"Kerri Lu , Dan M. Kluger , Stephen Bates , Sherrie Wang","doi":"10.1016/j.rse.2025.114949","DOIUrl":null,"url":null,"abstract":"<div><div>Regressions are commonly used in environmental science and economics to identify causal or associative relationships between variables. In these settings, remote sensing-derived map products increasingly serve as sources of variables, enabling estimation of effects such as the impact of conservation zones on deforestation. However, the quality of map products varies, and — because maps are outputs of complex machine learning algorithms that take in a variety of remotely sensed variables as inputs — errors are difficult to characterize. Thus, population-level estimators from such maps may be biased. In this paper, we apply prediction-powered inference (PPI) to estimate regression coefficients relating a response variable and covariates to each other. PPI is a method that estimates parameters of interest by using a small amount of randomly sampled ground truth data to correct for bias in large-scale remote sensing map products. Applying PPI across multiple remote sensing use cases in regression coefficient estimation, we find that it results in estimates that are (1) more reliable than using the map product as if it were 100% accurate and (2) have lower uncertainty than using only the ground truth sample data and ignoring the map product. Empirically, we observe effective sample size increases of up to 17-fold using PPI compared to only using ground truth data. This is the first work to estimate remote sensing regression coefficients without assumptions on the structure of map product errors. Data and code are available at <span><span>https://github.com/Earth-Intelligence-Lab/uncertainty-quantification</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"330 ","pages":"Article 114949"},"PeriodicalIF":11.4000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425725003530","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Regressions are commonly used in environmental science and economics to identify causal or associative relationships between variables. In these settings, remote sensing-derived map products increasingly serve as sources of variables, enabling estimation of effects such as the impact of conservation zones on deforestation. However, the quality of map products varies, and — because maps are outputs of complex machine learning algorithms that take in a variety of remotely sensed variables as inputs — errors are difficult to characterize. Thus, population-level estimators from such maps may be biased. In this paper, we apply prediction-powered inference (PPI) to estimate regression coefficients relating a response variable and covariates to each other. PPI is a method that estimates parameters of interest by using a small amount of randomly sampled ground truth data to correct for bias in large-scale remote sensing map products. Applying PPI across multiple remote sensing use cases in regression coefficient estimation, we find that it results in estimates that are (1) more reliable than using the map product as if it were 100% accurate and (2) have lower uncertainty than using only the ground truth sample data and ignoring the map product. Empirically, we observe effective sample size increases of up to 17-fold using PPI compared to only using ground truth data. This is the first work to estimate remote sensing regression coefficients without assumptions on the structure of map product errors. Data and code are available at https://github.com/Earth-Intelligence-Lab/uncertainty-quantification.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.