{"title":"Benchmarking data handling strategies for landslide susceptibility modeling using random forest workflows","authors":"Guruh Samodra , Ngadisih , Ferman Setia Nugroho","doi":"10.1016/j.aiig.2024.100093","DOIUrl":"10.1016/j.aiig.2024.100093","url":null,"abstract":"<div><div>Machine learning (ML) algorithms are frequently used in landslide susceptibility modeling. Different data handling strategies may generate variations in landslide susceptibility modeling, even when using the same ML algorithm. This research aims to compare the combinations of inventory data handling, cross validation (CV), and hyperparameter tuning strategies to generate landslide susceptibility maps. The results are expected to provide a general strategy for landslide susceptibility modeling using ML techniques. The authors employed eight landslide inventory data handling scenarios to convert a landslide polygon into a landslide point, i.e., the landslide point is located on the toe (minimum height), on the scarp (maximum height), at the center of the landslide, randomly inside the polygon (1 point), randomly inside the polygon (3 points), randomly inside the polygon (5 points), randomly inside the polygon (10 points), and 15 m grid sampling. Random forest models using CV–nonspatial hyperparameter tuning, spatial CV–spatial hyperparameter tuning, and spatial CV–forward feature selection–no hyperparameter tuning were applied for each data handling strategy. The combination generated 24 random forest ML workflows, which are applied using a complete inventory of 743 landslides triggered by Tropical Cyclone Cempaka (2017) in Pacitan Regency, Indonesia, and 11 landslide controlling factors. The results show that grid sampling with spatial CV and spatial hyperparameter tuning is favorable because the strategy can minimize overfitting, generate a relatively high-performance predictive model, and reduce the appearance of susceptibility artifacts in the landslide area. Careful data inventory handling, CV, and hyperparameter tuning strategies should be considered in landslide susceptibility modeling to increase the applicability of landslide susceptibility maps in practical application.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100093"},"PeriodicalIF":0.0,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142532855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haiying Fu , Shuai Wang , Guicheng He , Zhonghua Zhu , Qing Yu , Dexin Ding
{"title":"A 3D convolutional neural network model with multiple outputs for simultaneously estimating the reactive transport parameters of sandstone from its CT images","authors":"Haiying Fu , Shuai Wang , Guicheng He , Zhonghua Zhu , Qing Yu , Dexin Ding","doi":"10.1016/j.aiig.2024.100092","DOIUrl":"10.1016/j.aiig.2024.100092","url":null,"abstract":"<div><div>Porosity, tortuosity, specific surface area (SSA), and permeability are four key parameters of reactive transport modeling in sandstone, which are important for understanding solute transport and geochemical reaction processes in sandstone aquifers. These four parameters reflect the characteristics of pore structure of sandstone from different perspectives, and the traditional empirical formulas cannot make accurate predictions of them due to their complexity and heterogeneity. In this paper, eleven types of sandstone CT images were firstly segmented into numerous subsample images, the porosity, tortuosity, SSA, and permeability of the subsamples were calculated, and the dataset was established. The 3D convolutional neural network (CNN) models were subsequently established and trained to predict the key reactive transport parameters based on subsample CT images of sandstones. The results demonstrated that the 3D CNN model with multiple outputs exhibited excellent prediction ability for the four parameters compared to the traditional empirical formulas. In particular, for the prediction of tortuosity and permeability, the 3D CNN model with multiple outputs even showed slightly better prediction ability than its single-output variant model. Additionally, it demonstrated good generalization performance on sandstone CT images not included in the training dataset. The study showed that the 3D CNN model with multiple outputs has the advantages of simplifying operation and saving computational resources, which has the prospect of popularization and application.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100092"},"PeriodicalIF":0.0,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Binsen Xu , Zhou Feng , Jun Zhou , Rongbo Shao , Hongliang Wu , Peng Liu , Han Tian , Weizhong Li , Lizhi Xiao
{"title":"Transfer learning for well logging formation evaluation using similarity weights","authors":"Binsen Xu , Zhou Feng , Jun Zhou , Rongbo Shao , Hongliang Wu , Peng Liu , Han Tian , Weizhong Li , Lizhi Xiao","doi":"10.1016/j.aiig.2024.100091","DOIUrl":"10.1016/j.aiig.2024.100091","url":null,"abstract":"<div><div>Machine learning has been widely applied in well logging formation evaluation studies. However, several challenges negatively impacted the generalization capabilities of machine learning models in practical implementations, such as the mismatch of data domain between training and testing datasets, imbalances among sample categories, and inadequate representation of data model. These issues have led to substantial insufficient identification for reservoir and significant deviations in subsequent evaluations. To improve the transferability of machine learning models within limited sample sets, this study proposes a weight transfer learning framework based on the similarity of the labels. The similarity weighting method includes both hard weights and soft weights. By evaluating the similarity between test and training sets of logging data, the similarity results are used to estimate the weights of training samples, thereby optimizing the model learning process. We develop a double experts' network and a bidirectional gated neural network based on hierarchical attention and multi-head attention (BiGRU-MHSA) for well logs reconstruction and lithofacies classification tasks. Oil field data results for the shale strata in the Gulong area of the Songliao Basin of China indicate that the double experts’ network model performs well in curve reconstruction tasks. However, it may not be effective in lithofacies classification tasks, while BiGRU-MHSA performs well in that area. In the study of constructing large-scale well logging processing and formation interpretation models, it is maybe more beneficial by employing different expert models for combined evaluations. In addition, although the improvement is limited, hard or soft weighting methods is better than unweighted (i.e., average-weighted) in significantly different adjacent wells. The code and data are open and available for subsequent studies on other lithofacies layers.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100091"},"PeriodicalIF":0.0,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142323362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enhanced permeability prediction in porous media using particle swarm optimization with multi-source integration","authors":"Zhiping Chen , Jia Zhang , Daren Zhang , Xiaolin Chang , Wei Zhou","doi":"10.1016/j.aiig.2024.100090","DOIUrl":"10.1016/j.aiig.2024.100090","url":null,"abstract":"<div><p>Accurately and efficiently predicting the permeability of porous media is essential for addressing a wide range of hydrogeological issues. However, the complexity of porous media often limits the effectiveness of individual prediction methods. This study introduces a novel Particle Swarm Optimization-based Permeability Integrated Prediction model (PSO-PIP), which incorporates a particle swarm optimization algorithm enhanced with dynamic clustering and adaptive parameter tuning (KGPSO). The model integrates multi-source data from the Lattice Boltzmann Method (LBM), Pore Network Modeling (PNM), and Finite Difference Method (FDM). By assigning optimal weight coefficients to the outputs of these methods, the model minimizes deviations from actual values and enhances permeability prediction performance. Initially, the computational performances of the LBM, PNM, and FDM are comparatively analyzed on datasets consisting of sphere packings and real rock samples. It is observed that these methods exhibit computational biases in certain permeability ranges. The PSO-PIP model is proposed to combine the strengths of each computational approach and mitigate their limitations. The PSO-PIP model consistently produces predictions that are highly congruent with actual permeability values across all prediction intervals, significantly enhancing prediction accuracy. The outcomes of this study provide a new tool and perspective for the comprehensive, rapid, and accurate prediction of permeability in porous media.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100090"},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000315/pdfft?md5=b1d09a2cb0aeba96843adc16601b4089&pid=1-s2.0-S2666544124000315-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data","authors":"Bas Peters , Eldad Haber , Keegan Lensink","doi":"10.1016/j.aiig.2024.100087","DOIUrl":"10.1016/j.aiig.2024.100087","url":null,"abstract":"<div><p>The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with network depth. This work focuses on a fully invertible network based on the telegraph equation. While reversibility saves the major amount of memory used in deep networks by the data, the convolutional kernels can take up most memory if fully invertible networks contain multiple invertible pooling/coarsening layers. We address the explosion of the number of convolutional kernels by combining fully invertible networks with layers that contain the convolutional kernels in a compressed form directly. A second challenge is that invertible networks output a tensor the same size as its input. This property prevents the straightforward application of invertible networks to applications that map between different input–output dimensions, need to map to outputs with more channels than present in the input data, or desire outputs that decrease/increase the resolution compared to the input data. However, we show that by employing invertible networks in a non-standard fashion, we can still use them for these tasks. Examples in hyperspectral land-use classification, airborne geophysical surveying, and seismic imaging illustrate that we can input large data volumes in one chunk and do not need to work on small patches, use dimensionality reduction, or employ methods that classify a patch to a single central pixel.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100087"},"PeriodicalIF":0.0,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000285/pdfft?md5=aefb3645cc92ad5ad25d7d3f97a32057&pid=1-s2.0-S2666544124000285-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142087583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"EQGraphNet: Advancing single-station earthquake magnitude estimation via deep graph networks with residual connections","authors":"Zhiguo Wang , Ziwei Chen , Huai Zhang","doi":"10.1016/j.aiig.2024.100089","DOIUrl":"10.1016/j.aiig.2024.100089","url":null,"abstract":"<div><p>Magnitude estimation is a critical task in seismology, and conventional methods usually require dense seismic station arrays to provide data with sufficient spatiotemporal distribution. In this context, we propose the Earthquake Graph Network (EQGraphNet) to enhance the performance of single-station magnitude estimation. The backbone of the proposed model consists of eleven convolutional neural network layers and ten RCGL modules, where a RCGL combines a Residual Connection and a Graph convolutional Layer capable of mitigating the over-smoothing problem and simultaneously extracting temporal features of seismic signals. Our work uses the STanford EArthquake Dataset for model training and performance testing. Compared with three existing deep learning models, EQGraphNet demonstrates improved accuracy for both local magnitude and duration magnitude scales. To evaluate the robustness, we add natural background noise to the model input and find that EQGraphNet achieves the best results, particularly for signals with lower signal-to-noise ratios. Additionally, by replacing various network components and comparing their estimation performances, we illustrate the contribution of each part of EQGraphNet, validating the rationality of our approach. We also demonstrate the generalization capability of our model across different earthquakes occurring environments, achieving mean errors of <span><math><mo>±</mo></math></span>0.1 units. Furthermore, by demonstrating the effectiveness of deeper architectures, this work encourages further exploration of deeper GNN models for both multi-station and single-station magnitude estimation.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100089"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000303/pdfft?md5=c749cb3c43c7b43360a083719cc07ae7&pid=1-s2.0-S2666544124000303-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142049374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gayantha R.L. Kodikara , Lindsay J. McHenry , Ian G. Stanistreet , Harald Stollhofen , Jackson K. Njau , Nicholas Toth , Kathy Schick
{"title":"Wide & deep learning for predicting relative mineral compositions of sediment cores solely based on XRF scans, a case study from Pleistocene Paleolake Olduvai, Tanzania","authors":"Gayantha R.L. Kodikara , Lindsay J. McHenry , Ian G. Stanistreet , Harald Stollhofen , Jackson K. Njau , Nicholas Toth , Kathy Schick","doi":"10.1016/j.aiig.2024.100088","DOIUrl":"10.1016/j.aiig.2024.100088","url":null,"abstract":"<div><p>This study develops a method to use deep learning models to predict the mineral assemblages and their relative abundances in paleolake cores using high-resolution XRF core scan elemental data and X-ray diffraction (XRD) mineralogical results from the same core taken at coarser resolution. It uses the XRF core scan data along with published mineralogical information from the Olduvai Gorge Coring Project (OGCP) 2014 sediment cores 1A, 2A, and 3A from Paleolake Olduvai, Tanzania. Both regression and classification models were developed using a Keras deep learning framework to assess the predictability of mineral assemblages with their relative abundances (in regression models) or at least the mineral assemblages (in classification models) using XRF core scan data. Models were created using the Sequential class and Functional API with different model architectures. The correlation matrix of element ratios calculated from XRF element intensity records from the cores and XRD-derived mineralogical information was used to select the most useful features to train the models. 1057 training data records were used for the models. Lithological classes were also used for some models using Wide & Deep neural networks since those combine the benefits of memorization and generalization for mineral prediction. The results were validated using 265 validation data records unseen by the model and discuss the accuracy of models using six test records. The optimized Deep Neural Network (DNN) classification model achieved over 86% binary accuracy while the regression models were also able to predict the relative mineral abundances of samples with high accuracies. Overall, the study shows the efficacy of a carefully crafted Deep Learning (DL) model for predicting mineral assemblages and abundances using high-resolution XRF core scan data.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100088"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000297/pdfft?md5=8bd7402c96f4a311b4dbf3ffa0c2ef1b&pid=1-s2.0-S2666544124000297-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142075965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abolfazl Komeazi , Georg Rümpker , Johannes Faber , Fabian Limberger , Nishtha Srivastava
{"title":"When linear inversion fails: Neural-network optimization for sparse-ray travel-time tomography of a volcanic edifice","authors":"Abolfazl Komeazi , Georg Rümpker , Johannes Faber , Fabian Limberger , Nishtha Srivastava","doi":"10.1016/j.aiig.2024.100086","DOIUrl":"10.1016/j.aiig.2024.100086","url":null,"abstract":"<div><p>In this study, we present an artificial neural network (ANN)-based approach for travel-time tomography of a volcanic edifice under sparse-ray coverage. We employ ray tracing to simulate the propagation of seismic waves through the heterogeneous medium of a volcanic edifice, and an inverse modeling algorithm that uses an ANN to estimate the velocity structure from the “observed” travel-time data. The performance of the approach is evaluated through a 2-dimensional numerical study that simulates i) an active source seismic experiment with a few (explosive) sources placed on one side of the edifice and a dense line of receivers placed on the other side, and ii) earthquakes located inside the edifice with receivers placed on both sides of the edifice. The results are compared with those obtained from conventional damped linear inversion. The average Root Mean Square Error (RMSE) between the input and output models is approximately 0.03 km/s for the ANN inversions, whereas it is about 0.4 km/s for the linear inversions, demonstrating that the ANN-based approach outperforms the classical approach, particularly in situations with sparse ray coverage. Our study emphasizes the advantages of employing a relatively simple ANN architecture in conjunction with second-order optimizers to minimize the loss function. Compared to using first-order optimizers, our ANN architecture shows a ∼25% reduction in RMSE. The ANN-based approach is computationally efficient. We observed that even though the ANN is trained based on completely random velocity models, it is still capable of resolving previously unseen anomalous structures within the edifice with about 5% anomalous discrepancies, making it a potentially valuable tool for the detection of low velocity anomalies related to magmatic intrusions or mush.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100086"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000273/pdfft?md5=7b60a887781291fb9c1bbd214c747929&pid=1-s2.0-S2666544124000273-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142049373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chanjuan Liu , Jing Xu , Xi’an Li , Zhongyao Yu , Jinran Wu
{"title":"Water resource forecasting with machine learning and deep learning: A scientometric analysis","authors":"Chanjuan Liu , Jing Xu , Xi’an Li , Zhongyao Yu , Jinran Wu","doi":"10.1016/j.aiig.2024.100084","DOIUrl":"10.1016/j.aiig.2024.100084","url":null,"abstract":"<div><p>Water prediction plays a crucial role in modern-day water resource management, encompassing both hydrological patterns and demand forecasts. To gain insights into its current focus, status, and emerging themes, this study analyzed 876 articles published between 2015 and 2022, retrieved from the Web of Science database. Leveraging CiteSpace visualization software, bibliometric techniques, and literature review methodologies, the investigation identified essential literature related to water prediction using machine learning and deep learning approaches. Through a comprehensive analysis, the study identified significant countries, institutions, authors, journals, and keywords in this field. By exploring this data, the research mapped out prevailing trends and cutting-edge areas, providing valuable insights for researchers and practitioners involved in water prediction through machine learning and deep learning. The study aims to guide future inquiries by highlighting key research domains and emerging areas of interest.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100084"},"PeriodicalIF":0.0,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266654412400025X/pdfft?md5=8bb63629925bdc6599eb399ca1cbfe94&pid=1-s2.0-S266654412400025X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Raquel Alonso-Perez , James M.D. Day , D. Graham Pearson , Yan Luo , Manuel A. Palacios , Raju Sudhakar , Aaron Palke
{"title":"Exploring emerald global geochemical provenance through fingerprinting and machine learning methods","authors":"Raquel Alonso-Perez , James M.D. Day , D. Graham Pearson , Yan Luo , Manuel A. Palacios , Raju Sudhakar , Aaron Palke","doi":"10.1016/j.aiig.2024.100085","DOIUrl":"10.1016/j.aiig.2024.100085","url":null,"abstract":"<div><p>Emeralds – the green colored variety of beryl – occur as gem-quality specimens in over fifty deposits globally. While digital traceability methods for emerald have limitations, sample-based approaches offer robust alternatives, particularly for determining the geographic origin of emerald. Three factors make emerald suitable for provenance studies and hence for developing models for origin determination. First, the diverse elemental chemistry of emerald at minor (<1 wt%) and trace levels (<1 to 100’s ppmw) exhibits unique inter-element fractionations between global deposits. Second, minimally destructive techniques, including laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS), enable measurement of these diagnostic elemental signatures. Third, when applied to extensive datasets, machine learning (ML) techniques enable the creation of predictive models and statistical discrimination with adequate characterization of the deposits. This study employs a carefully selected dataset comprising more than 1000 LA-ICP-MS analyses of gem-quality emeralds, enriched with new analyses. This dataset represents the largest available for global emerald deposits. We conducted unsupervised exploratory analysis using Principal Component Analysis (PCA). For machine learning-based classification, we employed Support Vector Machine Classification (SVM-C), achieving an initial accuracy rate of 79%. This was enhanced to 96.8% through the use of hierarchical SVM-C with PCA filters as our modeling approach. The ML models were trained using the concentrations of eight statistically significant elements (Li, V, Cr, Fe, Sc, Ga, Rb, Cs). By leveraging high-quality LA-ICP-MS data and ML techniques, accurate identification of the geographical origin of emerald becomes possible. These models are important for accurate provenance of emerald, and from a geochemical perspective, for understanding the formation environments of beryl-bearing pegmatites and shales.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100085"},"PeriodicalIF":0.0,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000261/pdfft?md5=8ac6027d08bf9d1a8618e5ecdb9f25b3&pid=1-s2.0-S2666544124000261-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}