Foraminiferal image classification based on convolutional neural network considering data augmentation optimization

IF 1.5 4区地球科学 Q2 PALEONTOLOGY

Marine Micropaleontology Pub Date : 2025-06-06 DOI:10.1016/j.marmicro.2025.102476

Jincan Wang , Muhui Zhang , Weiping Zeng , Songzhu Gu , Qingzhong Liang , Shuqin Zhou , Zimeng Gao

{"title":"Foraminiferal image classification based on convolutional neural network considering data augmentation optimization","authors":"Jincan Wang , Muhui Zhang , Weiping Zeng , Songzhu Gu , Qingzhong Liang , Shuqin Zhou , Zimeng Gao","doi":"10.1016/j.marmicro.2025.102476","DOIUrl":null,"url":null,"abstract":"<div><div>Foraminifera are of utmost importance in paleoclimate and marine ecosystem research, with accurate classification being equally vital. Convolutional Neural Networks (CNNs) can realize the automatic classification of foraminiferal images, but they usually rely on data augmentation to address the issue of data scarcity. Despite the widespread use of data augmentation methods, the impacts of various augmentation methods on the classification of foraminifera remain unclear. In this study, we systematically evaluated the effects of different data augmentation methods on the classification performance of CNNs using three publicly available datasets. Experiments based on the ResNet-50 architecture showed that random rotation (RR), random flipping (RF), and random erasing (RE, ratio = 0.2) significantly improved the classification accuracy. The combined model of these three methods achieved accuracies of 89.4 %, 89.7 %, and 95.7 %, and F1 scores of 72.7 %, 72.8 %, and 84.3 % in the three tasks respectively. Compared with the basic model, the accuracy (A) increased by an average of 3.2 %, and the F1 score (F1) increased by an average of 7.1 %. This study confirms that selecting and combining appropriate data augmentation methods can effectively enhance the performance of foraminiferal image classification, with the combination of RR, RF, and RE being the most effective.</div></div>","PeriodicalId":49881,"journal":{"name":"Marine Micropaleontology","volume":"199 ","pages":"Article 102476"},"PeriodicalIF":1.5000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine Micropaleontology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377839825000416","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PALEONTOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Foraminifera are of utmost importance in paleoclimate and marine ecosystem research, with accurate classification being equally vital. Convolutional Neural Networks (CNNs) can realize the automatic classification of foraminiferal images, but they usually rely on data augmentation to address the issue of data scarcity. Despite the widespread use of data augmentation methods, the impacts of various augmentation methods on the classification of foraminifera remain unclear. In this study, we systematically evaluated the effects of different data augmentation methods on the classification performance of CNNs using three publicly available datasets. Experiments based on the ResNet-50 architecture showed that random rotation (RR), random flipping (RF), and random erasing (RE, ratio = 0.2) significantly improved the classification accuracy. The combined model of these three methods achieved accuracies of 89.4 %, 89.7 %, and 95.7 %, and F1 scores of 72.7 %, 72.8 %, and 84.3 % in the three tasks respectively. Compared with the basic model, the accuracy (A) increased by an average of 3.2 %, and the F1 score (F1) increased by an average of 7.1 %. This study confirms that selecting and combining appropriate data augmentation methods can effectively enhance the performance of foraminiferal image classification, with the combination of RR, RF, and RE being the most effective.

Abstract Image

查看原文本刊更多论文

考虑数据增强优化的卷积神经网络有孔虫图像分类

有孔虫在古气候和海洋生态系统研究中具有极其重要的意义，准确的分类同样至关重要。卷积神经网络（cnn）可以实现有孔虫图像的自动分类，但通常依赖于数据增强来解决数据稀缺性问题。尽管数据增强方法被广泛使用，但各种增强方法对有孔虫分类的影响尚不清楚。在这项研究中，我们使用三个公开的数据集系统地评估了不同的数据增强方法对cnn分类性能的影响。基于ResNet-50架构的实验表明，随机旋转（RR）、随机翻转（RF）和随机擦除（RE, ratio = 0.2）显著提高了分类精度。三种方法组合模型的准确率分别为89.4%、89.7%和95.7%，F1得分分别为72.7%、72.8%和84.3%。与基本模型相比，准确率(A)平均提高3.2%，F1分数（F1）平均提高7.1%。本研究证实，选择和组合合适的数据增强方法可以有效提高有孔虫图像分类的性能，其中RR、RF和RE的组合效果最好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Marine Micropaleontology 地学-古生物学

CiteScore

3.70

自引率

15.80%

发文量

审稿时长

26.7 weeks

期刊介绍： Marine Micropaleontology is an international journal publishing original, innovative and significant scientific papers in all fields related to marine microfossils, including ecology and paleoecology, biology and paleobiology, paleoceanography and paleoclimatology, environmental monitoring, taphonomy, evolution and molecular phylogeny. The journal strongly encourages the publication of articles in which marine microfossils and/or their chemical composition are used to solve fundamental geological, environmental and biological problems. However, it does not publish purely stratigraphic or taxonomic papers. In Marine Micropaleontology, a special section is dedicated to short papers on new methods and protocols using marine microfossils. We solicit special issues on hot topics in marine micropaleontology and review articles on timely subjects.