A comprehensive evaluation of OPTICS, GMM and K-means clustering methodologies for geochemical anomaly detection connected with sample catchment basins

IF 2.6 3区 地球科学 Q2 GEOCHEMISTRY & GEOPHYSICS
Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash
{"title":"A comprehensive evaluation of OPTICS, GMM and K-means clustering methodologies for geochemical anomaly detection connected with sample catchment basins","authors":"Mahsa Hajihosseinlou ,&nbsp;Abbas Maghsoudi ,&nbsp;Reza Ghezelbash","doi":"10.1016/j.chemer.2024.126094","DOIUrl":null,"url":null,"abstract":"<div><p>The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT Pb<img>Zn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT Pb<img>Zn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.</p></div>","PeriodicalId":55973,"journal":{"name":"Chemie Der Erde-Geochemistry","volume":"84 2","pages":"Article 126094"},"PeriodicalIF":2.6000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemie Der Erde-Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009281924000187","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT PbZn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT PbZn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.

全面评估 OPTICS、GMM 和 K-means 聚类方法在与样本集水盆地相关的地球化学异常检测中的应用
通过数据驱动的聚类来发现与样本集水盆地(SCB)相关的地球化学异常现象的过程包括一个综合框架,用于识别特定研究区域内表现出独特地球化学属性的区域。点排序以识别聚类结构(OPTICS)方法可作为检测 SCB 中地球化学异常的可靠方法。这归功于该方法能够有效地管理不同的聚类密度,自适应地识别聚类数量,对噪声表现出弹性,并对参数表现出最小的敏感性。本研究比较了 OPTICS 聚类算法与两种传统聚类技术(即高斯混合模型 (GMM) 和 K-means 聚类)的结果。下文将采用期望最大化(EM)技术来训练用于聚类的 GMM。此外,还采用了 Akaike 信息准则(AIC)和贝叶斯信息准则(BIC)这两个有效的统计指标来确定属于 GMM 的最佳成分(聚类)数量。值得注意的是,聚类算法的有效性是通过卡林斯基-哈拉巴什(CH)指数和成功率曲线进一步评估的。在识别伊朗西部瓦尔切地区的 MVT 铅锌异常方面,基于密度的聚类方法 OPTICS 被证实比 K-means 和 GMM 更有效。此外,指定的异常点与地质事实显示出地理空间上的对应关系,并且观察到在靠近 MVT 铅锌矿点的地方更容易发现强异常点。这项工作提出了一种基于 OPTICS 的新型异常检测方法,该方法具有卓越的性能和数据建模效率。主要重点是从不确定分布的样本数据中有效区分地球化学异常。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Chemie Der Erde-Geochemistry
Chemie Der Erde-Geochemistry 地学-地球化学与地球物理
CiteScore
7.10
自引率
0.00%
发文量
40
审稿时长
3.0 months
期刊介绍: GEOCHEMISTRY was founded as Chemie der Erde 1914 in Jena, and, hence, is one of the oldest journals for geochemistry-related topics. GEOCHEMISTRY (formerly Chemie der Erde / Geochemistry) publishes original research papers, short communications, reviews of selected topics, and high-class invited review articles addressed at broad geosciences audience. Publications dealing with interdisciplinary questions are particularly welcome. Young scientists are especially encouraged to submit their work. Contributions will be published exclusively in English. The journal, through very personalized consultation and its worldwide distribution, offers entry into the world of international scientific communication, and promotes interdisciplinary discussion on chemical problems in a broad spectrum of geosciences. The following topics are covered by the expertise of the members of the editorial board (see below): -cosmochemistry, meteoritics- igneous, metamorphic, and sedimentary petrology- volcanology- low & high temperature geochemistry- experimental - theoretical - field related studies- mineralogy - crystallography- environmental geosciences- archaeometry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信