{"title":"Advancing Earth science applications through a semi-automatic monoplotting framework for efficient 3D geo-referencing of monocular oblique visual Data","authors":"Behzad Golparvar, Ruo-Qian Wang","doi":"10.1016/j.cageo.2025.105915","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread availability of high-quality images from smartphones, drones, and digital cameras presents an unprecedented opportunity for global geospatial data collection. However, these images are often captured at oblique angles, making geo-referencing challenging and limiting their usability. Monoplotting, a technique that requires only a single image and a Digital Elevation Model (DEM), addresses these challenges by establishing pixel-level correspondence between imagery and real-world coordinates. However, traditional monoplotting methods are labor-intensive, requiring manual identification of control points in both the image and DEM, as well as manual tuning of camera parameters, which restricts scalability for large-scale databases and near-real-time applications. This paper proposes a novel semi-automatic monoplotting framework that minimizes human intervention. The framework integrates key point detection, geo-referenced 3D point retrieval, regularized gradient-based optimization, pose estimation, back-projection, and pixel mapping to enable efficient geo-referencing. To the best of the authors’ knowledge, this is the first study to incorporate key point detection into monoplotting, and apply regularized gradient-based optimization for camera position and parameter determination, even with unequal numbers of key points from the image and DEM. Numerical experiments with a historical image and a corresponding real-world DEM demonstrate the framework’s effectiveness. The robustness of the method is further evaluated on distorted images, where the distortion strength coefficient is treated as an unknown and estimated through projection optimization. The results confirm the framework’s ability to establish accurate correspondence between the image pixel domain and real-world 3D coordinates. Additionally, integrating machine learning models, such as semantic segmentation, highlights the framework’s advantages in Earth science applications, including snow and glacier characterization.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"200 ","pages":"Article 105915"},"PeriodicalIF":4.2000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300425000652","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The widespread availability of high-quality images from smartphones, drones, and digital cameras presents an unprecedented opportunity for global geospatial data collection. However, these images are often captured at oblique angles, making geo-referencing challenging and limiting their usability. Monoplotting, a technique that requires only a single image and a Digital Elevation Model (DEM), addresses these challenges by establishing pixel-level correspondence between imagery and real-world coordinates. However, traditional monoplotting methods are labor-intensive, requiring manual identification of control points in both the image and DEM, as well as manual tuning of camera parameters, which restricts scalability for large-scale databases and near-real-time applications. This paper proposes a novel semi-automatic monoplotting framework that minimizes human intervention. The framework integrates key point detection, geo-referenced 3D point retrieval, regularized gradient-based optimization, pose estimation, back-projection, and pixel mapping to enable efficient geo-referencing. To the best of the authors’ knowledge, this is the first study to incorporate key point detection into monoplotting, and apply regularized gradient-based optimization for camera position and parameter determination, even with unequal numbers of key points from the image and DEM. Numerical experiments with a historical image and a corresponding real-world DEM demonstrate the framework’s effectiveness. The robustness of the method is further evaluated on distorted images, where the distortion strength coefficient is treated as an unknown and estimated through projection optimization. The results confirm the framework’s ability to establish accurate correspondence between the image pixel domain and real-world 3D coordinates. Additionally, integrating machine learning models, such as semantic segmentation, highlights the framework’s advantages in Earth science applications, including snow and glacier characterization.
期刊介绍:
Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.