{"title":"基于集合学习的 UMAP 地球化学异常识别方法","authors":"Qingteng Zhang, Yue Liu, Hao Fang","doi":"10.1016/j.chemer.2024.126157","DOIUrl":null,"url":null,"abstract":"<div><div>Geochemical data are usually high-dimensional data that could contain dozens of elements. Geochemical distribution patterns and anomalies related to mineralization and lithological features are always hidden in these high-dimensional data, which cannot be directly observed from the data. To solve this problem, a manifold learning-based uniform manifold approximation and projection (UMAP) method, was introduced to recognize mineralization-related geochemical anomalies from high-dimensional geochemical data in this study. The UMAP method is a nonlinear dimensionality reduction method, which is suitable for dimensionality reduction and visualization of high-dimensional data. A case study was conducted to demonstrate the advantages of the UMAP method for identifying ion-adsorbed rare-earth-element (REE) mineralization-related anomalies from high-dimensional data in the Nanling region, China. Factor analysis was used to determine ion-adsorbed REE mineralization-related element combination that consists of 10 elements. High-dimensional geochemical data were reduced to two dimensions based on the UMAP method. The results indicated that the UMAP method can effectively characterize the spatial distributions of ion-adsorbed REE mineralization-related anomalies by dimensionality reduction analysis and visualization analysis of high-dimensional geochemical data in the study area. To illustrate the superiority of the UMAP method, a comparative study was conducted between the UMAP and other three manifold learning methods, namely locally linear embedding (LLE), isometric feature mapping (Isomap) and t-distributed stochastic neighbor embedding (t-SNE). The performance of the four manifold learning methods was evaluated by receiver operating characteristic (ROC) curve and prediction-area (P-A) plot, showing that the performance of the UMAP method is superior to that of the LLE, Isomap and t-SNE methods in terms of recognizing ion-adsorbed REE mineralization-related anomalies and the spatial distributions of the REE-bearing geological bodies in the Nanling belt.</div></div>","PeriodicalId":55973,"journal":{"name":"Chemie Der Erde-Geochemistry","volume":"84 4","pages":"Article 126157"},"PeriodicalIF":2.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Manifold learning-based UMAP method for geochemical anomaly identification\",\"authors\":\"Qingteng Zhang, Yue Liu, Hao Fang\",\"doi\":\"10.1016/j.chemer.2024.126157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Geochemical data are usually high-dimensional data that could contain dozens of elements. Geochemical distribution patterns and anomalies related to mineralization and lithological features are always hidden in these high-dimensional data, which cannot be directly observed from the data. To solve this problem, a manifold learning-based uniform manifold approximation and projection (UMAP) method, was introduced to recognize mineralization-related geochemical anomalies from high-dimensional geochemical data in this study. The UMAP method is a nonlinear dimensionality reduction method, which is suitable for dimensionality reduction and visualization of high-dimensional data. A case study was conducted to demonstrate the advantages of the UMAP method for identifying ion-adsorbed rare-earth-element (REE) mineralization-related anomalies from high-dimensional data in the Nanling region, China. Factor analysis was used to determine ion-adsorbed REE mineralization-related element combination that consists of 10 elements. High-dimensional geochemical data were reduced to two dimensions based on the UMAP method. The results indicated that the UMAP method can effectively characterize the spatial distributions of ion-adsorbed REE mineralization-related anomalies by dimensionality reduction analysis and visualization analysis of high-dimensional geochemical data in the study area. To illustrate the superiority of the UMAP method, a comparative study was conducted between the UMAP and other three manifold learning methods, namely locally linear embedding (LLE), isometric feature mapping (Isomap) and t-distributed stochastic neighbor embedding (t-SNE). The performance of the four manifold learning methods was evaluated by receiver operating characteristic (ROC) curve and prediction-area (P-A) plot, showing that the performance of the UMAP method is superior to that of the LLE, Isomap and t-SNE methods in terms of recognizing ion-adsorbed REE mineralization-related anomalies and the spatial distributions of the REE-bearing geological bodies in the Nanling belt.</div></div>\",\"PeriodicalId\":55973,\"journal\":{\"name\":\"Chemie Der Erde-Geochemistry\",\"volume\":\"84 4\",\"pages\":\"Article 126157\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemie Der Erde-Geochemistry\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0009281924000825\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemie Der Erde-Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009281924000825","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
Manifold learning-based UMAP method for geochemical anomaly identification
Geochemical data are usually high-dimensional data that could contain dozens of elements. Geochemical distribution patterns and anomalies related to mineralization and lithological features are always hidden in these high-dimensional data, which cannot be directly observed from the data. To solve this problem, a manifold learning-based uniform manifold approximation and projection (UMAP) method, was introduced to recognize mineralization-related geochemical anomalies from high-dimensional geochemical data in this study. The UMAP method is a nonlinear dimensionality reduction method, which is suitable for dimensionality reduction and visualization of high-dimensional data. A case study was conducted to demonstrate the advantages of the UMAP method for identifying ion-adsorbed rare-earth-element (REE) mineralization-related anomalies from high-dimensional data in the Nanling region, China. Factor analysis was used to determine ion-adsorbed REE mineralization-related element combination that consists of 10 elements. High-dimensional geochemical data were reduced to two dimensions based on the UMAP method. The results indicated that the UMAP method can effectively characterize the spatial distributions of ion-adsorbed REE mineralization-related anomalies by dimensionality reduction analysis and visualization analysis of high-dimensional geochemical data in the study area. To illustrate the superiority of the UMAP method, a comparative study was conducted between the UMAP and other three manifold learning methods, namely locally linear embedding (LLE), isometric feature mapping (Isomap) and t-distributed stochastic neighbor embedding (t-SNE). The performance of the four manifold learning methods was evaluated by receiver operating characteristic (ROC) curve and prediction-area (P-A) plot, showing that the performance of the UMAP method is superior to that of the LLE, Isomap and t-SNE methods in terms of recognizing ion-adsorbed REE mineralization-related anomalies and the spatial distributions of the REE-bearing geological bodies in the Nanling belt.
期刊介绍:
GEOCHEMISTRY was founded as Chemie der Erde 1914 in Jena, and, hence, is one of the oldest journals for geochemistry-related topics.
GEOCHEMISTRY (formerly Chemie der Erde / Geochemistry) publishes original research papers, short communications, reviews of selected topics, and high-class invited review articles addressed at broad geosciences audience. Publications dealing with interdisciplinary questions are particularly welcome. Young scientists are especially encouraged to submit their work. Contributions will be published exclusively in English. The journal, through very personalized consultation and its worldwide distribution, offers entry into the world of international scientific communication, and promotes interdisciplinary discussion on chemical problems in a broad spectrum of geosciences.
The following topics are covered by the expertise of the members of the editorial board (see below):
-cosmochemistry, meteoritics-
igneous, metamorphic, and sedimentary petrology-
volcanology-
low & high temperature geochemistry-
experimental - theoretical - field related studies-
mineralogy - crystallography-
environmental geosciences-
archaeometry