A semi-supervised learning framework for intelligent mineral prospectivity mapping: Incorporation of the CatBoost and Gaussian mixture model algorithms
Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash
{"title":"A semi-supervised learning framework for intelligent mineral prospectivity mapping: Incorporation of the CatBoost and Gaussian mixture model algorithms","authors":"Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash","doi":"10.1016/j.gexplo.2025.107755","DOIUrl":null,"url":null,"abstract":"<div><div>Semi-supervised learning warrants more significant consideration for machine learning-based mapping in mineral exploration, since mineral deposits frequently exhibit imbalances in occurrence frequencies. It can potentially address challenges associated with class imbalances via the efficient use of labeled data and the extrapolation of patterns from unlabeled data. This research endeavors to present a prospective model for Mississippi Valley-Type lead and zinc deposits employing a semi-supervised approach in the Varcheh district, western Iran. To achieve this goal, diverse exploratory criteria related to mineralization, encompassing geological, remote sensing, geochemical, and structural layers, have been incorporated to develop a semi-supervised mineral prospectivity model. The model strengthens the advantages of supervised and unsupervised learning approaches by incorporating the Categorical gradient Boosting (CatBoost) and Gaussian mixture model algorithms into a semi-supervised framework. This approach effectively utilizes limited labeled data, while capturing spatial patterns and relationships in the unlabeled dataset, ultimately contributing to a more robust mineral prospectivity mapping model. Indeed, the regions with high posterior probability include most lead and zinc deposits in this strategy, suggesting that the locations of known deposits are significantly tied to areas connected to high posterior probability. The semi-supervised proposed framework in this paper is also compared with supervised approach to validate the performance improvement. The implemented approach can be highly valuable for exploring resources.</div></div>","PeriodicalId":16336,"journal":{"name":"Journal of Geochemical Exploration","volume":"274 ","pages":"Article 107755"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Geochemical Exploration","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375674225000871","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Semi-supervised learning warrants more significant consideration for machine learning-based mapping in mineral exploration, since mineral deposits frequently exhibit imbalances in occurrence frequencies. It can potentially address challenges associated with class imbalances via the efficient use of labeled data and the extrapolation of patterns from unlabeled data. This research endeavors to present a prospective model for Mississippi Valley-Type lead and zinc deposits employing a semi-supervised approach in the Varcheh district, western Iran. To achieve this goal, diverse exploratory criteria related to mineralization, encompassing geological, remote sensing, geochemical, and structural layers, have been incorporated to develop a semi-supervised mineral prospectivity model. The model strengthens the advantages of supervised and unsupervised learning approaches by incorporating the Categorical gradient Boosting (CatBoost) and Gaussian mixture model algorithms into a semi-supervised framework. This approach effectively utilizes limited labeled data, while capturing spatial patterns and relationships in the unlabeled dataset, ultimately contributing to a more robust mineral prospectivity mapping model. Indeed, the regions with high posterior probability include most lead and zinc deposits in this strategy, suggesting that the locations of known deposits are significantly tied to areas connected to high posterior probability. The semi-supervised proposed framework in this paper is also compared with supervised approach to validate the performance improvement. The implemented approach can be highly valuable for exploring resources.
期刊介绍:
Journal of Geochemical Exploration is mostly dedicated to publication of original studies in exploration and environmental geochemistry and related topics.
Contributions considered of prevalent interest for the journal include researches based on the application of innovative methods to:
define the genesis and the evolution of mineral deposits including transfer of elements in large-scale mineralized areas.
analyze complex systems at the boundaries between bio-geochemistry, metal transport and mineral accumulation.
evaluate effects of historical mining activities on the surface environment.
trace pollutant sources and define their fate and transport models in the near-surface and surface environments involving solid, fluid and aerial matrices.
assess and quantify natural and technogenic radioactivity in the environment.
determine geochemical anomalies and set baseline reference values using compositional data analysis, multivariate statistics and geo-spatial analysis.
assess the impacts of anthropogenic contamination on ecosystems and human health at local and regional scale to prioritize and classify risks through deterministic and stochastic approaches.
Papers dedicated to the presentation of newly developed methods in analytical geochemistry to be applied in the field or in laboratory are also within the topics of interest for the journal.