Bubryur Kim , Yuvaraj Natarajan , K.R. Sri Preethaa , V. Danushkumar , Ryan Shamet , Jiannan Chen , Rui Xie , Timothy Copeland , Boo Hyun Nam , Jinwoo An
{"title":"Advancements in Sinkhole Remediation: Field data-driven Sinkhole grout volume prediction model via machine learning-based regression Analysis","authors":"Bubryur Kim , Yuvaraj Natarajan , K.R. Sri Preethaa , V. Danushkumar , Ryan Shamet , Jiannan Chen , Rui Xie , Timothy Copeland , Boo Hyun Nam , Jinwoo An","doi":"10.1016/j.aiig.2025.100159","DOIUrl":null,"url":null,"abstract":"<div><div>Sinkhole formation poses a significant geohazard in karst regions, where unpredictable subsurface erosion often necessitates costly grouting for stabilization. Accurate estimation of grout volume remains a persistent challenge due to spatial variability, site-specific conditions, and the limitations of traditional empirical methods. This study introduces a novel machine learning-based regression model for grout volume prediction that integrates cone penetration test (CPT)-derived Sinkhole Resistance Ratio (SRR) values, spatial correlations between CPT and grouting points (GPs), and field-recorded grout volumes from six sinkhole sites in Florida. Three data transformation methods, the Proximal Allocation Method (PAM), the Equitable Distribution Method (EDM), and the Threshold-based Equitable Distribution Method (TEDM), were applied to distribute grout influence across CPTs, with TEDM demonstrating superior predictive performance. Synthetic data augmentation using spline methodology further improved model robustness. A high-degree polynomial regression model, optimized with ridge regularization, achieved high accuracy (R<sup>2</sup> = 0.95; PEV = 0.94) and significantly outperformed existing linear and logarithmic models. Results confirm that lower SRR values correlate with higher grout demand, and the proposed model reliably captures these nonlinear relationships. This research advances sinkhole remediation practice by providing a data-driven, accurate, and generalizable framework for grout volume estimation, enabling more efficient resource allocation and improved project outcomes.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100159"},"PeriodicalIF":4.2000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544125000553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sinkhole formation poses a significant geohazard in karst regions, where unpredictable subsurface erosion often necessitates costly grouting for stabilization. Accurate estimation of grout volume remains a persistent challenge due to spatial variability, site-specific conditions, and the limitations of traditional empirical methods. This study introduces a novel machine learning-based regression model for grout volume prediction that integrates cone penetration test (CPT)-derived Sinkhole Resistance Ratio (SRR) values, spatial correlations between CPT and grouting points (GPs), and field-recorded grout volumes from six sinkhole sites in Florida. Three data transformation methods, the Proximal Allocation Method (PAM), the Equitable Distribution Method (EDM), and the Threshold-based Equitable Distribution Method (TEDM), were applied to distribute grout influence across CPTs, with TEDM demonstrating superior predictive performance. Synthetic data augmentation using spline methodology further improved model robustness. A high-degree polynomial regression model, optimized with ridge regularization, achieved high accuracy (R2 = 0.95; PEV = 0.94) and significantly outperformed existing linear and logarithmic models. Results confirm that lower SRR values correlate with higher grout demand, and the proposed model reliably captures these nonlinear relationships. This research advances sinkhole remediation practice by providing a data-driven, accurate, and generalizable framework for grout volume estimation, enabling more efficient resource allocation and improved project outcomes.