Deep learning based identification of rock minerals from un-processed digital microscopic images of undisturbed broken-surfaces

M.A. Dalhat, Sami A. Osman
{"title":"Deep learning based identification of rock minerals from un-processed digital microscopic images of undisturbed broken-surfaces","authors":"M.A. Dalhat,&nbsp;Sami A. Osman","doi":"10.1016/j.aiig.2025.100127","DOIUrl":null,"url":null,"abstract":"<div><div>This study employed convolutional neural networks (CNNs) for the classification of rock minerals based on 3179 RGB-scale original microstructural images of undisturbed broken surfaces. The image dataset covers 40 distinct rock mineral-types. Three CNN architectures (Simple model, SqueezeNet, and Xception) were evaluated to compare their performance and feature extraction capabilities. Gradient-weighted Class Activation Mapping (Grad-CAM) was employed to visualize the features influencing model predictions, providing insights into how each model distinguishes between mineral classes. Key discriminative attributes included texture, grain size, pattern, and color variations. Texture and grain boundaries were identified as the most critical features, as they were strongly activated regions by the best model. Patterns such as banding and chromatic contrasts further enhanced classification accuracy. Performance analysis revealed that the Simple model had limited ability to isolate fine-grained details, producing broad and less specific activations (0.84 test accuracy). SqueezeNet demonstrated improved localization of discriminative features but occasionally missed finer textural details (0.95 test accuracy). The Xception model outperformed the others, achieving the highest classification accuracy (0.98 test accuracy) by exhibiting precise and tightly focused activations, capturing intricate textures and subtle chromatic variations. Its superior performance can be attributed to its deep architecture and efficient depth-wise separable convolutions, which enabled hierarchical and detailed feature extraction. Results underscores the importance of texture, pattern, and chromatic features in accurate mineral classification and highlights the suitability of deep, efficient architectures like Xception for such tasks. These findings demonstrate the potential of CNNs in geoscience research, offering a framework for automated mineral identification in industrial and scientific applications.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100127"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544125000231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study employed convolutional neural networks (CNNs) for the classification of rock minerals based on 3179 RGB-scale original microstructural images of undisturbed broken surfaces. The image dataset covers 40 distinct rock mineral-types. Three CNN architectures (Simple model, SqueezeNet, and Xception) were evaluated to compare their performance and feature extraction capabilities. Gradient-weighted Class Activation Mapping (Grad-CAM) was employed to visualize the features influencing model predictions, providing insights into how each model distinguishes between mineral classes. Key discriminative attributes included texture, grain size, pattern, and color variations. Texture and grain boundaries were identified as the most critical features, as they were strongly activated regions by the best model. Patterns such as banding and chromatic contrasts further enhanced classification accuracy. Performance analysis revealed that the Simple model had limited ability to isolate fine-grained details, producing broad and less specific activations (0.84 test accuracy). SqueezeNet demonstrated improved localization of discriminative features but occasionally missed finer textural details (0.95 test accuracy). The Xception model outperformed the others, achieving the highest classification accuracy (0.98 test accuracy) by exhibiting precise and tightly focused activations, capturing intricate textures and subtle chromatic variations. Its superior performance can be attributed to its deep architecture and efficient depth-wise separable convolutions, which enabled hierarchical and detailed feature extraction. Results underscores the importance of texture, pattern, and chromatic features in accurate mineral classification and highlights the suitability of deep, efficient architectures like Xception for such tasks. These findings demonstrate the potential of CNNs in geoscience research, offering a framework for automated mineral identification in industrial and scientific applications.
基于深度学习的岩石矿物识别,从未受干扰的破碎表面的未处理的数字显微图像
本研究基于3179张rgb尺度原始破碎面显微结构图像,采用卷积神经网络(cnn)对岩石矿物进行分类。图像数据集涵盖了40种不同的岩石矿物类型。我们评估了三种CNN架构(Simple model、SqueezeNet和Xception),比较了它们的性能和特征提取能力。梯度加权类激活映射(Grad-CAM)用于可视化影响模型预测的特征,提供每个模型如何区分矿物类别的见解。关键的鉴别属性包括纹理、粒度、图案和颜色变化。纹理和晶界被认为是最关键的特征,因为它们是被最佳模型强烈激活的区域。带状和彩色对比等模式进一步提高了分类的准确性。性能分析表明,Simple模型隔离细粒度细节的能力有限,产生广泛而不太特定的激活(测试精度为0.84)。SqueezeNet在判别特征的定位上得到了改进,但偶尔会遗漏更精细的纹理细节(测试精度为0.95)。Xception模型优于其他模型,通过展示精确和紧密聚焦的激活,捕获复杂的纹理和微妙的颜色变化,实现了最高的分类精度(0.98测试精度)。其优越的性能可归因于其深层架构和高效的深度可分离卷积,这使得分层和详细的特征提取成为可能。结果强调了纹理、图案和颜色特征在准确矿物分类中的重要性,并强调了像Xception这样的深层、高效架构对此类任务的适用性。这些发现证明了cnn在地球科学研究中的潜力,为工业和科学应用中的自动矿物识别提供了一个框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信