A robust soft voting ensemble of the isolation forest model, extended isolation forest model and generalized isolation forest model for multivariate geochemical anomaly recognition

IF 3.2 2区 地球科学 Q1 GEOLOGY
Chenyi Zheng , Yongliang Chen , Xudong Du
{"title":"A robust soft voting ensemble of the isolation forest model, extended isolation forest model and generalized isolation forest model for multivariate geochemical anomaly recognition","authors":"Chenyi Zheng ,&nbsp;Yongliang Chen ,&nbsp;Xudong Du","doi":"10.1016/j.oregeorev.2025.106787","DOIUrl":null,"url":null,"abstract":"<div><div>Rapid and effective recognition of metallogenic anomalies from geochemical exploration data is the key to quickly locate potential mineral prospecting areas. Isolation forest (IF) algorithm, extended isolation forest (EIF) algorithm and generalized isolation forest (GIF) algorithm are three advanced unsupervised learning ensemble techniques that can isolate anomalies rapidly and effectively from high-dimensional data. Previous studies have shown that the three unsupervised learning ensembles have high performance and high efficiency in the recognition of multivariate geochemical anomalies. However, they suffer from lack of robustness because of the randomness in isolation tree construction, including random subsampling with replacement and random selection of splitting threshold, which causes unstable anomaly patterns in complex geochemical settings. To solve this problem, a robust soft voting ensemble (SVE) model was built from the IF model, EIF model and GIF model for the recognition of multivariate geochemical anomalies in the Molidawa area (Inner Mongolia, China). The IF model, EIF model and GIF model were built on the interpolated 1:50,000 stream sediment data and used as the base anomaly recognition models. The soft voting algorithm was then used to build the SVE model based on the three base anomaly recognition models. A comparison of the SVE model with the three base anomaly recognition models shows that the SVE model is more robust than the three base anomaly recognition models. The anomalies recognized by the SVE model contain all the known molybdenum deposits and spatially coincide with the molybdenum mineralization controlling factors such as intermediate-acidic magmatic intrusions and faults. Therefore, in geochemical anomaly recognition, soft voting algorithm is a feasible tool to build a robust anomaly recognition ensemble model from a set of base anomaly recognition models lacking robustness.</div></div>","PeriodicalId":19644,"journal":{"name":"Ore Geology Reviews","volume":"185 ","pages":"Article 106787"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ore Geology Reviews","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169136825003476","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Rapid and effective recognition of metallogenic anomalies from geochemical exploration data is the key to quickly locate potential mineral prospecting areas. Isolation forest (IF) algorithm, extended isolation forest (EIF) algorithm and generalized isolation forest (GIF) algorithm are three advanced unsupervised learning ensemble techniques that can isolate anomalies rapidly and effectively from high-dimensional data. Previous studies have shown that the three unsupervised learning ensembles have high performance and high efficiency in the recognition of multivariate geochemical anomalies. However, they suffer from lack of robustness because of the randomness in isolation tree construction, including random subsampling with replacement and random selection of splitting threshold, which causes unstable anomaly patterns in complex geochemical settings. To solve this problem, a robust soft voting ensemble (SVE) model was built from the IF model, EIF model and GIF model for the recognition of multivariate geochemical anomalies in the Molidawa area (Inner Mongolia, China). The IF model, EIF model and GIF model were built on the interpolated 1:50,000 stream sediment data and used as the base anomaly recognition models. The soft voting algorithm was then used to build the SVE model based on the three base anomaly recognition models. A comparison of the SVE model with the three base anomaly recognition models shows that the SVE model is more robust than the three base anomaly recognition models. The anomalies recognized by the SVE model contain all the known molybdenum deposits and spatially coincide with the molybdenum mineralization controlling factors such as intermediate-acidic magmatic intrusions and faults. Therefore, in geochemical anomaly recognition, soft voting algorithm is a feasible tool to build a robust anomaly recognition ensemble model from a set of base anomaly recognition models lacking robustness.
基于隔离林模型、扩展隔离林模型和广义隔离林模型的多变量地球化学异常识别鲁棒软投票集成
从化探资料中快速有效地识别成矿异常是快速定位找矿潜力区的关键。隔离森林(IF)算法、扩展隔离森林(EIF)算法和广义隔离森林(GIF)算法是三种先进的无监督学习集成技术,可以快速有效地从高维数据中分离异常。已有研究表明,这三种无监督学习系统在多变量地球化学异常识别中具有较高的性能和效率。然而,由于隔离树构造的随机性,包括随机替换子采样和随机选择分裂阈值,导致在复杂地球化学环境下异常模式不稳定,导致隔离树鲁棒性不足。为了解决这一问题,利用IF模型、EIF模型和GIF模型构建了一个鲁棒软投票集合(SVE)模型,用于内蒙古莫利达瓦地区多元地球化学异常的识别。以插值后的1:5万水系沉积物数据为基础,建立IF模型、EIF模型和GIF模型作为基础异常识别模型。在三种基本异常识别模型的基础上,采用软投票算法建立SVE模型。将SVE模型与三种基本异常识别模型进行比较,结果表明SVE模型比三种基本异常识别模型具有更好的鲁棒性。SVE模型识别的异常包含了已知的所有钼矿床,并与中酸性岩浆侵入和断裂等钼矿化控制因素在空间上重合。因此,在地球化学异常识别中,软投票算法是一种可行的工具,可以从一组缺乏鲁棒性的基础异常识别模型中构建鲁棒的异常识别集成模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ore Geology Reviews
Ore Geology Reviews 地学-地质学
CiteScore
6.50
自引率
27.30%
发文量
546
审稿时长
22.9 weeks
期刊介绍: Ore Geology Reviews aims to familiarize all earth scientists with recent advances in a number of interconnected disciplines related to the study of, and search for, ore deposits. The reviews range from brief to longer contributions, but the journal preferentially publishes manuscripts that fill the niche between the commonly shorter journal articles and the comprehensive book coverages, and thus has a special appeal to many authors and readers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信