Yi-Wei Cai, Kun-Feng Qiu, Maurizio Petrelli, Zhao-Liang Hou, M. Santosh, Hao-Cheng Yu, Ryan T. Armstrong, Jun Deng
{"title":"The application of “transfer learning” in optical microscopy: the petrographic classification of metallic minerals","authors":"Yi-Wei Cai, Kun-Feng Qiu, Maurizio Petrelli, Zhao-Liang Hou, M. Santosh, Hao-Cheng Yu, Ryan T. Armstrong, Jun Deng","doi":"10.2138/am-2023-9092","DOIUrl":null,"url":null,"abstract":"\n Analysis of optical microscopic image data is crucial for the identification and characterization of mineral phases, and thus directly relevant to the subsequent methodology selections of further detailed petrological exploration. Here we present a novel application of Swin Transformer, a deep learning algorithm to classify metal mineral phases such as arsenopyrite, chalcopyrite, gold, pyrite, and stibnite, in images captured by optical microscopy. To speed up the training process and improve the generalization capabilities of the investigated model, we adopt the “transfer learning” paradigm by pretraining the algorithm using a large, general-purpose, image dataset named ImageNet-1k. Further, we compare the performances of the Swin Transformer with those of two well-established Convolutional Neural Networks (CNNs) named MobileNetv2 and ResNet50, respectively. Our results highlight a maximum accuracy of 0.92 for the Swin Transformer, outperforming the CNNs. To provide an interpretation of the trained models, we apply the so-called Class Activation Map (CAM), which points to a strong global feature extraction ability of the Swin Transformer metal mineral classifier that focuses on distinctive (e.g., colors) and microstructural (e.g., edge shapes) features. The results demonstrate that the deep learning approach can accurately extract all available attributes, which reveals the potential to assist in data exploration and provides an opportunity to carry out spatial quantization at a large scale (cm-mm). Simultaneously, boosting the learning processes with pre-trained weights can accurately capture relevant attributes in mineral classification, revealing the potential for application in mineralogy and petrology, as well as enabling its use in resource explorations.","PeriodicalId":7768,"journal":{"name":"American Mineralogist","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Mineralogist","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.2138/am-2023-9092","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Analysis of optical microscopic image data is crucial for the identification and characterization of mineral phases, and thus directly relevant to the subsequent methodology selections of further detailed petrological exploration. Here we present a novel application of Swin Transformer, a deep learning algorithm to classify metal mineral phases such as arsenopyrite, chalcopyrite, gold, pyrite, and stibnite, in images captured by optical microscopy. To speed up the training process and improve the generalization capabilities of the investigated model, we adopt the “transfer learning” paradigm by pretraining the algorithm using a large, general-purpose, image dataset named ImageNet-1k. Further, we compare the performances of the Swin Transformer with those of two well-established Convolutional Neural Networks (CNNs) named MobileNetv2 and ResNet50, respectively. Our results highlight a maximum accuracy of 0.92 for the Swin Transformer, outperforming the CNNs. To provide an interpretation of the trained models, we apply the so-called Class Activation Map (CAM), which points to a strong global feature extraction ability of the Swin Transformer metal mineral classifier that focuses on distinctive (e.g., colors) and microstructural (e.g., edge shapes) features. The results demonstrate that the deep learning approach can accurately extract all available attributes, which reveals the potential to assist in data exploration and provides an opportunity to carry out spatial quantization at a large scale (cm-mm). Simultaneously, boosting the learning processes with pre-trained weights can accurately capture relevant attributes in mineral classification, revealing the potential for application in mineralogy and petrology, as well as enabling its use in resource explorations.
期刊介绍:
American Mineralogist: Journal of Earth and Planetary Materials (Am Min), is the flagship journal of the Mineralogical Society of America (MSA), continuously published since 1916. Am Min is home to some of the most important advances in the Earth Sciences. Our mission is a continuance of this heritage: to provide readers with reports on original scientific research, both fundamental and applied, with far reaching implications and far ranging appeal. Topics of interest cover all aspects of planetary evolution, and biological and atmospheric processes mediated by solid-state phenomena. These include, but are not limited to, mineralogy and crystallography, high- and low-temperature geochemistry, petrology, geofluids, bio-geochemistry, bio-mineralogy, synthetic materials of relevance to the Earth and planetary sciences, and breakthroughs in analytical methods of any of the aforementioned.