Advanced machine learning based gold prospectivity mapping in the Dharwar Craton, India: A hybrid knowledge-data driven paradigm integrating ensemble and deep learning

Geosystems and Geoenvironment Pub Date : 2026-05-01 Epub Date: 2025-11-07 DOI:10.1016/j.geogeo.2025.100473
Soumya Mitra , Saptarshi Mallick , Santu Biswas , Kshounish Patra
{"title":"Advanced machine learning based gold prospectivity mapping in the Dharwar Craton, India: A hybrid knowledge-data driven paradigm integrating ensemble and deep learning","authors":"Soumya Mitra ,&nbsp;Saptarshi Mallick ,&nbsp;Santu Biswas ,&nbsp;Kshounish Patra","doi":"10.1016/j.geogeo.2025.100473","DOIUrl":null,"url":null,"abstract":"<div><div>Developing more sophisticated as well as efficient exploration methods to identify the hidden ore bodies are necessary to meet the world wide increasing demand of mineral resources. In this regard, mineral prospectivity mapping (MPM) is crucial. This study undertakes a pioneering effort to apply and compare four machine learning (ML) models—random forest (RF), XGBoost (XGB), support vector classifier (SVC) and artificial neural network (ANN)—for gold prospectivity mapping within the Archean Dharwar Craton, India. The primary goals included the development and evaluation of these models, a systematic assessment of their comparative performance through cross-validation, feature important analysis and ultimately, production of prospectivity map. The culmination of this work is a high-resolution, combined prospectivity map, designed to produce a new prospectivity areas. Diverse geospatial data was meticulously integrated as per mineral system of the area, including geological maps, structural lineaments, geochemical, geophysical and ASTER remote sensing imagery. For model training, 79 known gold occurrences were carefully collected alongside an equal number of selected non-occurrence locations, framing the task as a supervised binary classification problem. Rigorous evaluation, employing 5-fold cross-validation and a 70:30 train-test split, confirmed the exceptional capabilities of these models. XGB and RF consistently emerged as top performers, with impressive AUC-ROC values of 0.9992 and 0.9965, respectively, coupled with high precision, recall and F1-scores and few false positives or negatives. While ANN also showed excellent performance, SVC, though strong, yielded comparatively lower metrics. A detailed feature importance analysis exhibits the positive role of Meta-Basalt, geochemical principal component 1 and Bouguer gravity anomaly and its derivative maps. Success-rate curves vividly illustrated the models' efficiency capturing over 76% of known occurrences within just 20% of the highest-ranked areas shows targeting precision. The generated combined prospectivity map, a robust synthesis from XGB, RF and ANN, based on a stringent consensus criterion validates existing knowledge and precisely delineates high-priority exploration targets, fundamentally reshaping the approach to future mineral exploration.</div></div>","PeriodicalId":100582,"journal":{"name":"Geosystems and Geoenvironment","volume":"5 2","pages":"Article 100473"},"PeriodicalIF":0.0000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geosystems and Geoenvironment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772883825001219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/11/7 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Developing more sophisticated as well as efficient exploration methods to identify the hidden ore bodies are necessary to meet the world wide increasing demand of mineral resources. In this regard, mineral prospectivity mapping (MPM) is crucial. This study undertakes a pioneering effort to apply and compare four machine learning (ML) models—random forest (RF), XGBoost (XGB), support vector classifier (SVC) and artificial neural network (ANN)—for gold prospectivity mapping within the Archean Dharwar Craton, India. The primary goals included the development and evaluation of these models, a systematic assessment of their comparative performance through cross-validation, feature important analysis and ultimately, production of prospectivity map. The culmination of this work is a high-resolution, combined prospectivity map, designed to produce a new prospectivity areas. Diverse geospatial data was meticulously integrated as per mineral system of the area, including geological maps, structural lineaments, geochemical, geophysical and ASTER remote sensing imagery. For model training, 79 known gold occurrences were carefully collected alongside an equal number of selected non-occurrence locations, framing the task as a supervised binary classification problem. Rigorous evaluation, employing 5-fold cross-validation and a 70:30 train-test split, confirmed the exceptional capabilities of these models. XGB and RF consistently emerged as top performers, with impressive AUC-ROC values of 0.9992 and 0.9965, respectively, coupled with high precision, recall and F1-scores and few false positives or negatives. While ANN also showed excellent performance, SVC, though strong, yielded comparatively lower metrics. A detailed feature importance analysis exhibits the positive role of Meta-Basalt, geochemical principal component 1 and Bouguer gravity anomaly and its derivative maps. Success-rate curves vividly illustrated the models' efficiency capturing over 76% of known occurrences within just 20% of the highest-ranked areas shows targeting precision. The generated combined prospectivity map, a robust synthesis from XGB, RF and ANN, based on a stringent consensus criterion validates existing knowledge and precisely delineates high-priority exploration targets, fundamentally reshaping the approach to future mineral exploration.

Abstract Image

印度Dharwar克拉通基于先进机器学习的金矿远景映射:集成集成和深度学习的混合知识数据驱动范式
为了满足世界范围内对矿产资源日益增长的需求,必须开发更先进、更有效的找矿方法来识别隐伏矿体。在这方面,矿产远景测绘(MPM)至关重要。本研究开创性地应用和比较了四种机器学习(ML)模型——随机森林(RF)、XGBoost (XGB)、支持向量分类器(SVC)和人工神经网络(ANN)——用于印度太古代Dharwar克拉通的金矿远景映射。主要目标包括开发和评估这些模型,通过交叉验证对其比较性能进行系统评估,特征重要分析并最终生成前景图。这项工作的最终成果是一张高分辨率的综合远景图,旨在产生一个新的远景区。根据矿区的矿产系统,对包括地质图、构造地貌、地球化学、地球物理和ASTER遥感影像在内的多种地理空间数据进行了精心整合。对于模型训练,我们仔细收集了79个已知的金矿点,以及相同数量的选定的非金矿点,将该任务构建为一个监督二元分类问题。严格的评估,采用5倍交叉验证和70:30训练测试分割,证实了这些模型的卓越能力。XGB和RF一直是表现最好的,AUC-ROC值分别为0.9992和0.9965,加上高精度、召回率和f1得分高,假阳性或阴性少。虽然人工神经网络也表现出出色的性能,但SVC虽然很强,但产生的指标相对较低。详细的特征重要性分析表明,变质玄武岩、地球化学主成分1和布格重力异常及其衍生图在该区的积极作用。成功率曲线生动地说明了模型的效率,在排名最高的20%的区域内捕获了超过76%的已知事件,显示了目标精度。生成的组合远景图是XGB、RF和ANN的强大综合,基于严格的共识标准,验证了现有知识,并精确地划定了高优先级的勘探目标,从根本上重塑了未来矿产勘探的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.70
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书