中国南方南岭成矿带钨多金属成矿资源的远景性和不确定性分析:AdaBoost、GBDT 和 XgBoost 算法的比较研究

IF 4.8 2区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY
Tongfei Li, Qinglin Xia, Yongpeng Ouyang, Runling Zeng, Qiankun Liu, Taotao Li
{"title":"中国南方南岭成矿带钨多金属成矿资源的远景性和不确定性分析:AdaBoost、GBDT 和 XgBoost 算法的比较研究","authors":"Tongfei Li, Qinglin Xia, Yongpeng Ouyang, Runling Zeng, Qiankun Liu, Taotao Li","doi":"10.1007/s11053-024-10321-9","DOIUrl":null,"url":null,"abstract":"<p>Supervised machine learning algorithms are utilized to predict undiscovered mineral resources by analyzing the correlation between geological data and mineral deposits. The scarcity of mineralization and the uncertainty arising from the selection of training samples also the accuracy and generalization of such algorithms. This study employed the adaptive boosting (AdaBoost), gradient boosting decision tree (GBDT), and extreme gradient boosting (XgBoost) algorithms to map the prospectivity of tungsten polymetallic mineral resources in the Nanling metallogenic belt. Firstly, the under-sampling and synthetic minority oversampling technique (SMOTE) methods were used to generate training datasets. Secondly, 50 groups of training datasets were generated using under-sampling, and another 50 groups of training datasets were generated using the SMOTE method. These datasets were used to separately train different boosting algorithms in order to assess the uncertainty associated with the selection of negative samples and the generation of positive samples. Finally, the risk–return analysis was used to mitigate uncertainty, and an enhanced prediction–area (P–A) plot was proposed to evaluate the performance. The results indicate that AdaBoost is the least affected by the selection of negative samples, followed by XgBoost. The SMOTE not only enhances the performance of AdaBoost and XgBoost algorithms but it also reduces the uncertainty related to the selection of negative samples and the generation of positive samples. In addition, the enhanced P–A plot can simultaneously account for both prediction accuracy and uncertainty, making it a potential tool for model evaluation. According to the results, eight potential areas with high return and low risk have been identified as priority areas for exploration. This research not only introduces a new method for mineral prospectivity mapping and uncertainty evaluation but also provides guidance for mineral exploration in this region.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"6 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prospectivity and Uncertainty Analysis of Tungsten Polymetallogenic Mineral Resources in the Nanling Metallogenic Belt, South China: A Comparative Study of AdaBoost, GBDT, and XgBoost Algorithms\",\"authors\":\"Tongfei Li, Qinglin Xia, Yongpeng Ouyang, Runling Zeng, Qiankun Liu, Taotao Li\",\"doi\":\"10.1007/s11053-024-10321-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Supervised machine learning algorithms are utilized to predict undiscovered mineral resources by analyzing the correlation between geological data and mineral deposits. The scarcity of mineralization and the uncertainty arising from the selection of training samples also the accuracy and generalization of such algorithms. This study employed the adaptive boosting (AdaBoost), gradient boosting decision tree (GBDT), and extreme gradient boosting (XgBoost) algorithms to map the prospectivity of tungsten polymetallic mineral resources in the Nanling metallogenic belt. Firstly, the under-sampling and synthetic minority oversampling technique (SMOTE) methods were used to generate training datasets. Secondly, 50 groups of training datasets were generated using under-sampling, and another 50 groups of training datasets were generated using the SMOTE method. These datasets were used to separately train different boosting algorithms in order to assess the uncertainty associated with the selection of negative samples and the generation of positive samples. Finally, the risk–return analysis was used to mitigate uncertainty, and an enhanced prediction–area (P–A) plot was proposed to evaluate the performance. The results indicate that AdaBoost is the least affected by the selection of negative samples, followed by XgBoost. The SMOTE not only enhances the performance of AdaBoost and XgBoost algorithms but it also reduces the uncertainty related to the selection of negative samples and the generation of positive samples. In addition, the enhanced P–A plot can simultaneously account for both prediction accuracy and uncertainty, making it a potential tool for model evaluation. According to the results, eight potential areas with high return and low risk have been identified as priority areas for exploration. This research not only introduces a new method for mineral prospectivity mapping and uncertainty evaluation but also provides guidance for mineral exploration in this region.</p>\",\"PeriodicalId\":54284,\"journal\":{\"name\":\"Natural Resources Research\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2024-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Natural Resources Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s11053-024-10321-9\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-024-10321-9","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

有监督的机器学习算法通过分析地质数据与矿床之间的相关性来预测未发现的矿产资源。矿化的稀缺性和训练样本选择的不确定性也影响了此类算法的准确性和普适性。本研究采用自适应提升(AdaBoost)、梯度提升决策树(GBDT)和极梯度提升(XgBoost)算法对南岭成矿带钨多金属矿产资源的远景进行了测绘。首先,采用欠采样和合成少数超采样技术(SMOTE)方法生成训练数据集。其次,使用欠采样生成 50 组训练数据集,使用 SMOTE 方法生成另外 50 组训练数据集。这些数据集分别用于训练不同的提升算法,以评估与选择负样本和生成正样本相关的不确定性。最后,使用风险收益分析来减少不确定性,并提出了一个增强的预测面积(P-A)图来评估性能。结果表明,AdaBoost 受负样本选择的影响最小,其次是 XgBoost。SMOTE 不仅提高了 AdaBoost 和 XgBoost 算法的性能,还减少了与负样本选择和正样本生成相关的不确定性。此外,增强的 P-A 图可以同时考虑预测准确性和不确定性,使其成为模型评估的潜在工具。根据研究结果,八个高回报、低风险的潜在领域被确定为优先探索领域。这项研究不仅为矿产远景测绘和不确定性评价引入了一种新方法,还为该地区的矿产勘探提供了指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Prospectivity and Uncertainty Analysis of Tungsten Polymetallogenic Mineral Resources in the Nanling Metallogenic Belt, South China: A Comparative Study of AdaBoost, GBDT, and XgBoost Algorithms

Prospectivity and Uncertainty Analysis of Tungsten Polymetallogenic Mineral Resources in the Nanling Metallogenic Belt, South China: A Comparative Study of AdaBoost, GBDT, and XgBoost Algorithms

Supervised machine learning algorithms are utilized to predict undiscovered mineral resources by analyzing the correlation between geological data and mineral deposits. The scarcity of mineralization and the uncertainty arising from the selection of training samples also the accuracy and generalization of such algorithms. This study employed the adaptive boosting (AdaBoost), gradient boosting decision tree (GBDT), and extreme gradient boosting (XgBoost) algorithms to map the prospectivity of tungsten polymetallic mineral resources in the Nanling metallogenic belt. Firstly, the under-sampling and synthetic minority oversampling technique (SMOTE) methods were used to generate training datasets. Secondly, 50 groups of training datasets were generated using under-sampling, and another 50 groups of training datasets were generated using the SMOTE method. These datasets were used to separately train different boosting algorithms in order to assess the uncertainty associated with the selection of negative samples and the generation of positive samples. Finally, the risk–return analysis was used to mitigate uncertainty, and an enhanced prediction–area (P–A) plot was proposed to evaluate the performance. The results indicate that AdaBoost is the least affected by the selection of negative samples, followed by XgBoost. The SMOTE not only enhances the performance of AdaBoost and XgBoost algorithms but it also reduces the uncertainty related to the selection of negative samples and the generation of positive samples. In addition, the enhanced P–A plot can simultaneously account for both prediction accuracy and uncertainty, making it a potential tool for model evaluation. According to the results, eight potential areas with high return and low risk have been identified as priority areas for exploration. This research not only introduces a new method for mineral prospectivity mapping and uncertainty evaluation but also provides guidance for mineral exploration in this region.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Natural Resources Research
Natural Resources Research Environmental Science-General Environmental Science
CiteScore
11.90
自引率
11.10%
发文量
151
期刊介绍: This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信