{"title":"机器学习辅助增强了英国西南部Cornubian岩基裂缝Variscan花岗岩的岩石物性数据集","authors":"A. Turan , E. Artun , I. Sass","doi":"10.1016/j.aiig.2025.100151","DOIUrl":null,"url":null,"abstract":"<div><div>Outcrop analogue studies play an important role in advancing our comprehension of reservoir architectures, offering insights into hidden reservoir rocks prior to drilling, in a cost-effective manner. These studies contribute to the delineation of the three-dimensional geometry of geological structures, the characterization of petro- and thermo-physical properties, and the structural geological aspects of reservoir rocks. Nevertheless, several challenges, including inaccessible sampling sites, limited resources, and the dimensional constraints of different laboratories hinder the acquisition of comprehensive datasets. In this study, we employ machine learning techniques to estimate missing data in a petrophysical dataset of fractured Variscan granites from the Cornubian Batholith in Southwest UK. The utilization of mean, k-nearest neighbors, and random forest imputation methods addresses the challenge of missing data, thereby revealing the effectiveness of random forest imputation in providing realistic estimations. Subsequently, supervised classification models are trained to classify samples according to their pluton origins, with promising accuracy achieved by models trained with imputed values. Variable importance ranking of the models showed that the choice of imputation method influences the inferred importance of specific petrophysical properties. While porosity (POR) and grain density (GD) were among important variables, variables with high missingness ratio were not among the top variables. This study demonstrates the value of machine learning in enhancing petrophysical datasets, while emphasizing the importance of careful method selection and model validation for reliable results. The findings contribute to a more informed decision-making process in geothermal exploration and reservoir characterization efforts, thereby demonstrating the potential of machine learning in advancing subsurface characterization techniques.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100151"},"PeriodicalIF":4.2000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning assisted enhancement of petrophysical property dataset of fractured Variscan granites of the Cornubian Batholith, SW UK\",\"authors\":\"A. Turan , E. Artun , I. Sass\",\"doi\":\"10.1016/j.aiig.2025.100151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Outcrop analogue studies play an important role in advancing our comprehension of reservoir architectures, offering insights into hidden reservoir rocks prior to drilling, in a cost-effective manner. These studies contribute to the delineation of the three-dimensional geometry of geological structures, the characterization of petro- and thermo-physical properties, and the structural geological aspects of reservoir rocks. Nevertheless, several challenges, including inaccessible sampling sites, limited resources, and the dimensional constraints of different laboratories hinder the acquisition of comprehensive datasets. In this study, we employ machine learning techniques to estimate missing data in a petrophysical dataset of fractured Variscan granites from the Cornubian Batholith in Southwest UK. The utilization of mean, k-nearest neighbors, and random forest imputation methods addresses the challenge of missing data, thereby revealing the effectiveness of random forest imputation in providing realistic estimations. Subsequently, supervised classification models are trained to classify samples according to their pluton origins, with promising accuracy achieved by models trained with imputed values. Variable importance ranking of the models showed that the choice of imputation method influences the inferred importance of specific petrophysical properties. While porosity (POR) and grain density (GD) were among important variables, variables with high missingness ratio were not among the top variables. This study demonstrates the value of machine learning in enhancing petrophysical datasets, while emphasizing the importance of careful method selection and model validation for reliable results. The findings contribute to a more informed decision-making process in geothermal exploration and reservoir characterization efforts, thereby demonstrating the potential of machine learning in advancing subsurface characterization techniques.</div></div>\",\"PeriodicalId\":100124,\"journal\":{\"name\":\"Artificial Intelligence in Geosciences\",\"volume\":\"6 2\",\"pages\":\"Article 100151\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence in Geosciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666544125000474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544125000474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine learning assisted enhancement of petrophysical property dataset of fractured Variscan granites of the Cornubian Batholith, SW UK
Outcrop analogue studies play an important role in advancing our comprehension of reservoir architectures, offering insights into hidden reservoir rocks prior to drilling, in a cost-effective manner. These studies contribute to the delineation of the three-dimensional geometry of geological structures, the characterization of petro- and thermo-physical properties, and the structural geological aspects of reservoir rocks. Nevertheless, several challenges, including inaccessible sampling sites, limited resources, and the dimensional constraints of different laboratories hinder the acquisition of comprehensive datasets. In this study, we employ machine learning techniques to estimate missing data in a petrophysical dataset of fractured Variscan granites from the Cornubian Batholith in Southwest UK. The utilization of mean, k-nearest neighbors, and random forest imputation methods addresses the challenge of missing data, thereby revealing the effectiveness of random forest imputation in providing realistic estimations. Subsequently, supervised classification models are trained to classify samples according to their pluton origins, with promising accuracy achieved by models trained with imputed values. Variable importance ranking of the models showed that the choice of imputation method influences the inferred importance of specific petrophysical properties. While porosity (POR) and grain density (GD) were among important variables, variables with high missingness ratio were not among the top variables. This study demonstrates the value of machine learning in enhancing petrophysical datasets, while emphasizing the importance of careful method selection and model validation for reliable results. The findings contribute to a more informed decision-making process in geothermal exploration and reservoir characterization efforts, thereby demonstrating the potential of machine learning in advancing subsurface characterization techniques.