{"title":"Use of AI tools to understand and model surface-interaction based EOR processes","authors":"Tony Thomas, P. Sharma, Dharmendra Kumar","doi":"10.1016/j.acags.2022.100111","DOIUrl":"https://doi.org/10.1016/j.acags.2022.100111","url":null,"abstract":"","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"1 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"53922107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enhancing reservoir porosity prediction from acoustic impedance and lithofacies using a weighted ensemble deep learning approach","authors":"Munezero Ntibahanana , Moïse Luemba , Keto Tondozi","doi":"10.1016/j.acags.2022.100106","DOIUrl":"https://doi.org/10.1016/j.acags.2022.100106","url":null,"abstract":"<div><p>Inferring underground porosity and evaluating its spatial distribution is of great significance in a wide range of Earth sciences and engineering, including hydrocarbon reservoir characterization and geothermal energy exploitation. Popular methods are largely based on the analysis of lithological cores, well logs, and seismic inversion. These methods are reliable, but they are still time-consuming, expensive, and difficult to conduct. In addition, seismic inversion confronts problems of nonlinearity and has multiple solutions. However, deep learning (DL) can provide a more flexible, efficient, and accurate capability, mapping directly from acoustic impedance and lithofacies data to porosity. To prove the point, in this paper, we trained an ensemble of DL models and then proposed a weight combination of every single trained model’s strength to improve the result. We evaluated the method's reliability using a number of metrics. Further, we compared it with traditional ones. The weighted ensemble resulted in a lower error than the simple ensemble and the single model. Its spatial distribution map showed the best connectivity with that of historical porosity. Finally, we tested our method's effectiveness using a dataset that was used in a previously published study. Our method improved the prediction of the latter.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100106"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000283/pdfft?md5=1185ea453aa66de2b8376d1e57b80fcd&pid=1-s2.0-S2590197422000283-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137187461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Przemysław Juda , Philippe Renard , Julien Straubhaar
{"title":"A parsimonious parametrization of the Direct Sampling algorithm for multiple-point statistical simulations","authors":"Przemysław Juda , Philippe Renard , Julien Straubhaar","doi":"10.1016/j.acags.2022.100091","DOIUrl":"10.1016/j.acags.2022.100091","url":null,"abstract":"<div><p>Multiple-point statistics algorithms allow modeling spatial variability from training images. Among these techniques, the Direct Sampling (DS) algorithm has advanced capabilities, such as multivariate simulations, treatment of non-stationarity, multi-resolution capabilities, conditioning by inequality or connectivity data. However, finding the right trade-off between computing time and simulation quality requires tuning three main parameters, which can be complicated since simulation time and quality are affected by these parameters in a complex manner. To facilitate the parameter selection, we propose the Direct Sampling Best Candidate (DSBC) parametrization approach. It consists in setting the distance threshold to 0. The two other parameters are kept (the number of neighbors and the scan fraction) as well as all the advantages of DS. We present three test cases that prove that the DSBC approach allows to identify efficiently parameters leading to comparable or better quality and computational time than the standard DS parametrization. We conclude that the DSBC approach could be used as a default mode when using DS, and that the standard parametrization should only be used when the DSBC approach is not sufficient.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100091"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000131/pdfft?md5=ef0cdd29b7ef9c5061d475c38a70c937&pid=1-s2.0-S2590197422000131-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41692243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A framework for constructing machine learning models with feature set optimisation for evapotranspiration partitioning","authors":"Adam Stapleton , Elke Eichelmann , Mark Roantree","doi":"10.1016/j.acags.2022.100105","DOIUrl":"https://doi.org/10.1016/j.acags.2022.100105","url":null,"abstract":"<div><p>A deeper understanding of the drivers of evapotranspiration and the modelling of its constituent parts (evaporation and transpiration) may be of significant importance to the monitoring and management of water resources globally over the coming decades. In this work a framework was developed to identify the best performing machine learning algorithm from a candidate set, select optimal predictive features and rank features in terms of their importance to predictive accuracy. The experiments conducted in this work used 3 separate feature sets across 4 wetland sites as input into 8 candidate machine learning algorithms, providing 96 sets of experimental configurations. Given this high number of parameters, our results show strong evidence that there is no singularly optimal machine learning algorithm or feature set across all of the wetland sites studied despite their similarities. At each of the sites at least one model was identified that improved on the predictive performance of our baseline. A key finding discovered when examining feature importance is that methane flux, a feature whose relationship with evapotranspiration is not generally examined, may contribute to further biophysical process understanding. This work demonstrates the applicability of a machine learning framework for evapotranspiration partitioning that is independent of domain knowledge, producing improved models for partitioning and identifying new and useful predictive features.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100105"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000271/pdfft?md5=4bb0fbb0ea2eccd1e035569ab227461d&pid=1-s2.0-S2590197422000271-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137187460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Georg H. Erharter , Thomas Wagner , Gerfried Winkler , Thomas Marcher
{"title":"Machine learning – An approach for consistent rock glacier mapping and inventorying – Example of Austria","authors":"Georg H. Erharter , Thomas Wagner , Gerfried Winkler , Thomas Marcher","doi":"10.1016/j.acags.2022.100093","DOIUrl":"10.1016/j.acags.2022.100093","url":null,"abstract":"<div><p>Rock glaciers (RG) are landforms that occur in high latitudes or elevations and — in their active state — consist of a mixture of rock debris and ice. Despite serving as a form of groundwater storage, they are an indicator for the occurrence of (former) permafrost and therefore carry significance in the research for the ongoing climate change. For these reasons, the past years have shown rising interest in the establishment of RG inventories to investigate the extent of permafrost and quantify water storages. Creating these inventories, however, usually involves manual, laborious, and subjective mapping of the landforms based on aerial image - and digital elevation model analysis. We propose an approach for RG mapping based on supervised machine learning which can help to increase the mapping efficiency and permits rapid RG mapping in vast and not yet covered areas. We found deep convolutional artificial neural networks (ANN) that are specifically designed for image segmentation (U-Net architecture) to be well suited for this classification problem. The general workflow consists of training the ANNs with orthophotos and slope maps of digital elevation models as input. The output (RG label-maps) is derived from a recently published RG inventory of the Austrian Alps that features 5769 individual RGs and was compiled manually by several scientists. To increase the generalization capabilities, we use live data augmentation during training. Based on this inventory, the ANNs have learned the average expert opinion and the RG map generated by the ANN can be used to increase the consistency and completeness of already existing RG inventories. Moreover, this ANN approach might be valuable for other landform mapping tasks beyond rock glaciers (e.g., other mass movements).</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100093"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000155/pdfft?md5=83b1e291fe3fb5fbed0ea26111dfb50b&pid=1-s2.0-S2590197422000155-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43590280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Irida: A machine learning based code for the automated derivation of site-specific rock type logs and their properties using Kimeleon colourlith image logs","authors":"Achyut Mishra , Apoorv Jyoti , Ralf R. Haese","doi":"10.1016/j.acags.2022.100102","DOIUrl":"10.1016/j.acags.2022.100102","url":null,"abstract":"<div><p>High resolution characterization of sub-surface geology is critical to improving the performance of reservoir models in fluid flow and reactive transport simulation studies in the fields of groundwater, CO<sub>2</sub> geo-sequestration and oil and gas research. The modern improvements in wireline logging technology allow for the deduction of depth continuous records of individual rock properties at cm-scale resolution. However, to the best of the authors’ knowledge, no method exists to obtain high-resolution lithotype logs. Traditional methods for rock typing are based on core sample analysis and are therefore limited to discrete depth. The Kimeleon colourlith log output is based on the combination of wireline logs and is probably one of the few tools which creates continuous lithotype logs at high resolution primarily for the purpose of visualization. There is currently no automated method to transform the colourlith images into quantitative rock type logs which can be used as an input for reservoir modelling software. Additionally, colourlith logs are based solely on wireline information and do not incorporate local lithological information from core samples. This study addresses these issues by combining discrete core sample data with continuous colourlith logs. A code (Irida) has been developed to enhance the usability of the Kimeleon software by transforming colourlith logs into high resolution lithotype logs of rock types identified in core plugs. The code takes the colourlith image log as an input along with the names and properties of site-specific rock types identified and measured in discrete core samples. The code then uses k-means clustering, an unsupervised machine learning method, to compute a high-resolution log of rock types and their properties which are continuous with depth. The properties of interest include porosity, anisotropic absolute and relative permeabilities and capillary pressure curves. The code significantly reduces the efforts required for the preparation of lithotype logs.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100102"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000246/pdfft?md5=a27cbcf2c2997fd76c8295d630868122&pid=1-s2.0-S2590197422000246-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43179814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yasmin Uchôa da Silva , Gutemberg Borges França , Heloisa Musetti Ruivo , Haroldo Fraga de Campos Velho
{"title":"Forecast of convective events via hybrid model: WRF and machine learning algorithms","authors":"Yasmin Uchôa da Silva , Gutemberg Borges França , Heloisa Musetti Ruivo , Haroldo Fraga de Campos Velho","doi":"10.1016/j.acags.2022.100099","DOIUrl":"https://doi.org/10.1016/j.acags.2022.100099","url":null,"abstract":"<div><p>This presents a novel hybrid 24-h forecasting model of convective weather events based on numerical simulation and machine learning algorithms. To characterize the convective events, 13-year from 2008 up to 2020 of precipitation data from the main airport stations in Rio de Janeiro, Brazil, and atmospheric discharges from the surrounding area of around 150 km are investigated. The Weather Research and Forecasting (WRF) model was used to numerically simulate atmospheric conditions for every day in February, as it is the month with the greatest daily rate of atmospheric discharge for the data period. The p-value hypothesis test (with <span><math><mrow><mi>α</mi><mspace></mspace><mo>=</mo><mspace></mspace><mn>0.05</mn></mrow></math></span>) was applied to each grid point of the numerically predicted variables (defined as an independent attribute) to find those most associated with convective events using the output of the 3-D WRF grid. This one identified 36 attributes (or predictors) that were used as input in the machine learning algorithms' training-test process in this study. Several cross-validation training and testing experiments were carried out using the nine-selected categorical machine learning algorithms and the 36 defined predictors. After applying the boosting technique to the nine previously trained-tested algorithms, the results of the 24-h predictions of convective occurrences were deemed satisfactory. The RandomForest method produced the best results, with statistics values close to perfection, such as POD = 1.00, FAR = 0.02, and CSI = 0.98. The 24-h hindcast utilizing the nine algorithms for the 28 days of February 2019 was very encouraging because it was able to almost recreate the maturation phase of events and their eventual failures were noted during the formation and dissipation phases. The best and worst 24-h hindcast had POD = 0.97 and 0.88, FAR = 0.02 and 0.12, and CSI = 0.94 and 0.78, respectively.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100099"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000210/pdfft?md5=1d2ce0355dabd75829d084b9b8de2eaf&pid=1-s2.0-S2590197422000210-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137187529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Proxy-based Robust Production Optimization with Multilayer Perceptron","authors":"Cuthbert Shang Wui Ng, Ashkan Jahanbani Ghahfarokhi","doi":"10.1016/j.acags.2022.100103","DOIUrl":"10.1016/j.acags.2022.100103","url":null,"abstract":"<div><p>Machine learning (ML) has been a technique employed to build data-driven models that can map the relationship between the input and output data provided. ML-based data-driven models offer an alternative path to solving optimization problems, which are conventionally resolved by applying simulation models. Higher computational cost is induced if the simulation model is computationally intensive. Such a situation aptly applies to petroleum engineering, especially when different geological realizations of numerical reservoir simulation (NRS) models are considered for production optimization. Therefore, data-driven models are suggested as a substitute for NRS. In this work, we demonstrated how multilayer perceptron could be implemented to build data-driven models based on 10 realizations of the Egg Model. These models were then coupled with two nature-inspired algorithms, viz. particle swarm optimization and grey wolf optimizer to solve waterflooding optimization. These data-driven models were adaptively re-trained by applying a training database that was updated via the addition of extra samples retrieved from optimization with the proxy models. The details of the methodology will be divulged in the paper. According to the results obtained, we could deduce that the methodology generated reliable data-driven models to solve the optimization problem, as justified by the excellent performance of the ML-based proxy model (with a coefficient of determination, R<sup>2</sup> exceeding 0.98 in training, testing, and blind validation) and accurate optimization result (less than 1% error between the Expected Net Present Values optimized using NRS and proxy models). This study aids in an enhanced understanding of implementing adaptive training in tandem with optimization in ML-based proxy modeling.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100103"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000258/pdfft?md5=a172463c1b387d74115d3915a128258e&pid=1-s2.0-S2590197422000258-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44921835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yerniyaz Abildin , Chaoshui Xu , Peter Dowd , Amir Adeli
{"title":"A hybrid framework for modelling domains using quantitative covariates","authors":"Yerniyaz Abildin , Chaoshui Xu , Peter Dowd , Amir Adeli","doi":"10.1016/j.acags.2022.100107","DOIUrl":"10.1016/j.acags.2022.100107","url":null,"abstract":"<div><p>Domains define the boundaries of mineralisation zones, within which the grade distribution of the target minerals can be quantified via an established mineral resource estimation procedure. Available domain modelling techniques include manual interpretation, implicit modelling and advanced geostatistical approaches. In mining applications, the most commonly used method is manual domaining, which is labour-intensive and prone to subjective judgement errors. In addition, the output is largely deterministic and ignores the significant uncertainty associated with the domain interpretation and boundary definitions. There is, therefore, a need for a more objective framework that can automatically define mineral domains and quantify the associated uncertainty. This paper describes such a framework, which consists of a hybrid approach based on simulated grade distributions and a machine learning (ML) classification technique, XGBoost, trained on lithological properties. Data from an Iron Oxide Copper Gold (IOCG) deposit are used as a case study to demonstrate the application of the proposed method. The study shows that the approach can handle complex multi-class problems with imbalanced features, and it can quantify the uncertainty of domain boundaries. A noise filtering method is used as a pre-processing step to improve the performance of the ML classification, especially in the case of problematic classes where domain boundaries are difficult to predict due to the similarity in geological characteristics.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100107"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000295/pdfft?md5=2eddf2c8c4fbf2254243d8f71b09b5b8&pid=1-s2.0-S2590197422000295-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48836983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Contributions of machine learning to quantitative and real-time mud gas data analysis: A critical review","authors":"Fatai Anifowose, Mokhles Mezghani, Saleh Badawood, Javed Ismail","doi":"10.1016/j.acags.2022.100095","DOIUrl":"10.1016/j.acags.2022.100095","url":null,"abstract":"<div><p>The current utility of mud gas data is typically limited to geological and petrophysical correlation, formation evaluation, and fluid typing. A critical and comprehensive review of the literature on mud gas data revealed that the mud gas data is abundantly acquired during drilling but not sufficiently utilized in real time. There is the need to leverage the current advances in machine learning technology and the race towards the digital transformation of the petroleum industry to create new opportunities for more extensive utility of mud gas data. Now that data is the new “oil” or “gold”, the utility of the rich and abundant mud gas data could be explored for real-time applications. Such new possibilities are capable of adding more value to the reservoir characterization workflow ahead of geophysical logging, geological core data analysis, and well testing. Achieving this will facilitate early decision-making, improve safety, reduce nonproductive time, and ultimately accelerate the attainment of the digital transformation objective of the petroleum industry. We conclude with identifying possible future directions for the ultimate attainment of maximizing the utility of mud gas data through real-time and more advanced applications.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100095"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000179/pdfft?md5=05e1d93af07412f49d3f45e78cc62d49&pid=1-s2.0-S2590197422000179-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44840264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}