Jianpeng Jing, Nannan Zhang, Hao Zhang, Shibin Liao, Li Chen, Jinyu Chang, Jintao Tao, Siyuan Li
{"title":"Lithology Identification of Lithium Minerals Based on TL-FMix-MobileViT Model","authors":"Jianpeng Jing, Nannan Zhang, Hao Zhang, Shibin Liao, Li Chen, Jinyu Chang, Jintao Tao, Siyuan Li","doi":"10.1007/s11053-025-10475-0","DOIUrl":null,"url":null,"abstract":"<p>In lithium mineral exploration, rapid and accurate identification of lithium-related rock lithologies is critical. Traditional manual methods are time-consuming and have limited accuracy, whereas some deep learning models, despite offering high precision, suffer from high computational complexity and low inference speeds, limiting their practical application. To address these issues, this study proposes a lightweight deep learning method based on a transfer learning-based Fourier-space mixed sample data augmentation mobile vision transformer (TL-FMix-MobileViT) to efficiently identify six types of lithium-related rock lithologies. Data from Dahongliutan (Xinjiang, China), Portugal, and Spain were used for model training. The model integrates the inverted residual blocks of MobileNetV2, reducing computational cost and accelerating inference with depth-wise separable convolutions, along with a lightweight vision transformer that extracts both local and global features while lowering complexity. Transfer learning with pretrained models reduces the training time and resource usage, while the FMix data augmentation method improves the generalization ability and accelerates convergence. Among three TL-FMix-MobileViT variants (extra-extra small, extra small, and small), the small version performed best, with strong stability and generalization ability, although all variants offer benefits for different scenarios. Compared with seven classic models, TL-FMix-MobileViT achieved the highest classification performance, with over 99% accuracy and reliable inference. Visual comparisons showed that the model effectively captured features at rock boundaries, thereby providing superior classification of mixed rock features compared with other models. This lightweight model provides an efficient and accurate method for lithium-related rock lithology identification, demonstrating its potential for lithium exploration.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"158 9 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-025-10475-0","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In lithium mineral exploration, rapid and accurate identification of lithium-related rock lithologies is critical. Traditional manual methods are time-consuming and have limited accuracy, whereas some deep learning models, despite offering high precision, suffer from high computational complexity and low inference speeds, limiting their practical application. To address these issues, this study proposes a lightweight deep learning method based on a transfer learning-based Fourier-space mixed sample data augmentation mobile vision transformer (TL-FMix-MobileViT) to efficiently identify six types of lithium-related rock lithologies. Data from Dahongliutan (Xinjiang, China), Portugal, and Spain were used for model training. The model integrates the inverted residual blocks of MobileNetV2, reducing computational cost and accelerating inference with depth-wise separable convolutions, along with a lightweight vision transformer that extracts both local and global features while lowering complexity. Transfer learning with pretrained models reduces the training time and resource usage, while the FMix data augmentation method improves the generalization ability and accelerates convergence. Among three TL-FMix-MobileViT variants (extra-extra small, extra small, and small), the small version performed best, with strong stability and generalization ability, although all variants offer benefits for different scenarios. Compared with seven classic models, TL-FMix-MobileViT achieved the highest classification performance, with over 99% accuracy and reliable inference. Visual comparisons showed that the model effectively captured features at rock boundaries, thereby providing superior classification of mixed rock features compared with other models. This lightweight model provides an efficient and accurate method for lithium-related rock lithology identification, demonstrating its potential for lithium exploration.
期刊介绍:
This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.