Maofa Wang , Wenheng Guo , Fengshan Yang , Bingchen Yan , Yanlin Xu , Jun Jiang , Jingjing Huang
{"title":"Rock thin section image classification in low data scenarios using few-shot learning","authors":"Maofa Wang , Wenheng Guo , Fengshan Yang , Bingchen Yan , Yanlin Xu , Jun Jiang , Jingjing Huang","doi":"10.1016/j.cageo.2025.105962","DOIUrl":null,"url":null,"abstract":"<div><div>Microscopic rock thin section image recognition is crucial in rock mineral analysis. Typically, deep learning models are used to automate expert knowledge, but the scarcity of samples in certain categories limits the available training data, affecting the performance of traditional deep learning models. This paper proposes a novel few-shot learning model to address the challenge of classifying rock thin section images under limited sample conditions. Based on advanced few-shot learning processes involving pre-training and meta-training, we first introduce a Cross Attention Feature Fusion (CAFF) module. This module generates new features by combining plane polarized light images (PPL) and cross-polarized light images (XPL) of rock thin sections under a microscope, integrating these with the original features through autonomous learning to obtain more comprehensive features. Secondly, we propose a Feature Selection (FS) module based on the prototypical network (ProtoNet). This module enhances the model’s classification capability by extracting key feature dimensions from two perspectives: intra-class representativeness and inter-class distinctiveness. Finally, using the pre-trained ResNet50 and Swim-Transformer on ImageNet-1000k as the backbone network, simulation experiments were conducted on the Nanjing University Rock Thin Section Teaching Dataset. Under the 5-Way 5-Shot few-shot learning task standard, the proposed ProtoNet-CAFF-FS model achieved an average classification accuracy of 96.70% and 99.16%, outperforming traditional modeling methods and demonstrating the effectiveness of the newly added modules.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"203 ","pages":"Article 105962"},"PeriodicalIF":4.4000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300425001128","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Microscopic rock thin section image recognition is crucial in rock mineral analysis. Typically, deep learning models are used to automate expert knowledge, but the scarcity of samples in certain categories limits the available training data, affecting the performance of traditional deep learning models. This paper proposes a novel few-shot learning model to address the challenge of classifying rock thin section images under limited sample conditions. Based on advanced few-shot learning processes involving pre-training and meta-training, we first introduce a Cross Attention Feature Fusion (CAFF) module. This module generates new features by combining plane polarized light images (PPL) and cross-polarized light images (XPL) of rock thin sections under a microscope, integrating these with the original features through autonomous learning to obtain more comprehensive features. Secondly, we propose a Feature Selection (FS) module based on the prototypical network (ProtoNet). This module enhances the model’s classification capability by extracting key feature dimensions from two perspectives: intra-class representativeness and inter-class distinctiveness. Finally, using the pre-trained ResNet50 and Swim-Transformer on ImageNet-1000k as the backbone network, simulation experiments were conducted on the Nanjing University Rock Thin Section Teaching Dataset. Under the 5-Way 5-Shot few-shot learning task standard, the proposed ProtoNet-CAFF-FS model achieved an average classification accuracy of 96.70% and 99.16%, outperforming traditional modeling methods and demonstrating the effectiveness of the newly added modules.
期刊介绍:
Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.