基于TL-FMix-MobileViT模型的锂矿物岩性识别

IF 4.8 2区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY
Jianpeng Jing, Nannan Zhang, Hao Zhang, Shibin Liao, Li Chen, Jinyu Chang, Jintao Tao, Siyuan Li
{"title":"基于TL-FMix-MobileViT模型的锂矿物岩性识别","authors":"Jianpeng Jing, Nannan Zhang, Hao Zhang, Shibin Liao, Li Chen, Jinyu Chang, Jintao Tao, Siyuan Li","doi":"10.1007/s11053-025-10475-0","DOIUrl":null,"url":null,"abstract":"<p>In lithium mineral exploration, rapid and accurate identification of lithium-related rock lithologies is critical. Traditional manual methods are time-consuming and have limited accuracy, whereas some deep learning models, despite offering high precision, suffer from high computational complexity and low inference speeds, limiting their practical application. To address these issues, this study proposes a lightweight deep learning method based on a transfer learning-based Fourier-space mixed sample data augmentation mobile vision transformer (TL-FMix-MobileViT) to efficiently identify six types of lithium-related rock lithologies. Data from Dahongliutan (Xinjiang, China), Portugal, and Spain were used for model training. The model integrates the inverted residual blocks of MobileNetV2, reducing computational cost and accelerating inference with depth-wise separable convolutions, along with a lightweight vision transformer that extracts both local and global features while lowering complexity. Transfer learning with pretrained models reduces the training time and resource usage, while the FMix data augmentation method improves the generalization ability and accelerates convergence. Among three TL-FMix-MobileViT variants (extra-extra small, extra small, and small), the small version performed best, with strong stability and generalization ability, although all variants offer benefits for different scenarios. Compared with seven classic models, TL-FMix-MobileViT achieved the highest classification performance, with over 99% accuracy and reliable inference. Visual comparisons showed that the model effectively captured features at rock boundaries, thereby providing superior classification of mixed rock features compared with other models. This lightweight model provides an efficient and accurate method for lithium-related rock lithology identification, demonstrating its potential for lithium exploration.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"158 9 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lithology Identification of Lithium Minerals Based on TL-FMix-MobileViT Model\",\"authors\":\"Jianpeng Jing, Nannan Zhang, Hao Zhang, Shibin Liao, Li Chen, Jinyu Chang, Jintao Tao, Siyuan Li\",\"doi\":\"10.1007/s11053-025-10475-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In lithium mineral exploration, rapid and accurate identification of lithium-related rock lithologies is critical. Traditional manual methods are time-consuming and have limited accuracy, whereas some deep learning models, despite offering high precision, suffer from high computational complexity and low inference speeds, limiting their practical application. To address these issues, this study proposes a lightweight deep learning method based on a transfer learning-based Fourier-space mixed sample data augmentation mobile vision transformer (TL-FMix-MobileViT) to efficiently identify six types of lithium-related rock lithologies. Data from Dahongliutan (Xinjiang, China), Portugal, and Spain were used for model training. The model integrates the inverted residual blocks of MobileNetV2, reducing computational cost and accelerating inference with depth-wise separable convolutions, along with a lightweight vision transformer that extracts both local and global features while lowering complexity. Transfer learning with pretrained models reduces the training time and resource usage, while the FMix data augmentation method improves the generalization ability and accelerates convergence. Among three TL-FMix-MobileViT variants (extra-extra small, extra small, and small), the small version performed best, with strong stability and generalization ability, although all variants offer benefits for different scenarios. Compared with seven classic models, TL-FMix-MobileViT achieved the highest classification performance, with over 99% accuracy and reliable inference. Visual comparisons showed that the model effectively captured features at rock boundaries, thereby providing superior classification of mixed rock features compared with other models. This lightweight model provides an efficient and accurate method for lithium-related rock lithology identification, demonstrating its potential for lithium exploration.</p>\",\"PeriodicalId\":54284,\"journal\":{\"name\":\"Natural Resources Research\",\"volume\":\"158 9 1\",\"pages\":\"\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Natural Resources Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s11053-025-10475-0\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-025-10475-0","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

在锂矿勘探中,快速准确地识别与锂有关的岩石岩性是至关重要的。传统的人工方法耗时长,精度有限,而一些深度学习模型虽然精度高,但计算复杂度高,推理速度慢,限制了其实际应用。为了解决这些问题,本研究提出了一种基于迁移学习的傅里叶空间混合样本数据增强移动视觉转换器(TL-FMix-MobileViT)的轻量级深度学习方法,以有效识别六种与锂相关的岩石岩性。模型训练使用了来自中国新疆大红流滩、葡萄牙和西班牙的数据。该模型集成了MobileNetV2的倒转残差块,降低了计算成本,加速了深度可分离卷积的推理,以及一个轻量级的视觉转换器,在降低复杂性的同时提取局部和全局特征。预训练模型迁移学习减少了训练时间和资源使用,而FMix数据增强方法提高了泛化能力,加快了收敛速度。在三种TL-FMix-MobileViT变体(extra-extra small、extra small和small)中,小版本表现最好,具有较强的稳定性和泛化能力,尽管所有变体都具有不同场景的优势。与7个经典模型相比,TL-FMix-MobileViT的分类性能最高,准确率超过99%,推理可靠。目视对比表明,该模型有效地捕获了岩石边界特征,与其他模型相比,对混合岩石特征进行了更好的分类。这种轻量级模型为锂相关岩石岩性识别提供了一种高效、准确的方法,展示了其在锂勘探中的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Lithology Identification of Lithium Minerals Based on TL-FMix-MobileViT Model

In lithium mineral exploration, rapid and accurate identification of lithium-related rock lithologies is critical. Traditional manual methods are time-consuming and have limited accuracy, whereas some deep learning models, despite offering high precision, suffer from high computational complexity and low inference speeds, limiting their practical application. To address these issues, this study proposes a lightweight deep learning method based on a transfer learning-based Fourier-space mixed sample data augmentation mobile vision transformer (TL-FMix-MobileViT) to efficiently identify six types of lithium-related rock lithologies. Data from Dahongliutan (Xinjiang, China), Portugal, and Spain were used for model training. The model integrates the inverted residual blocks of MobileNetV2, reducing computational cost and accelerating inference with depth-wise separable convolutions, along with a lightweight vision transformer that extracts both local and global features while lowering complexity. Transfer learning with pretrained models reduces the training time and resource usage, while the FMix data augmentation method improves the generalization ability and accelerates convergence. Among three TL-FMix-MobileViT variants (extra-extra small, extra small, and small), the small version performed best, with strong stability and generalization ability, although all variants offer benefits for different scenarios. Compared with seven classic models, TL-FMix-MobileViT achieved the highest classification performance, with over 99% accuracy and reliable inference. Visual comparisons showed that the model effectively captured features at rock boundaries, thereby providing superior classification of mixed rock features compared with other models. This lightweight model provides an efficient and accurate method for lithium-related rock lithology identification, demonstrating its potential for lithium exploration.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Natural Resources Research
Natural Resources Research Environmental Science-General Environmental Science
CiteScore
11.90
自引率
11.10%
发文量
151
期刊介绍: This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信