Foraminiferal image classification based on convolutional neural network considering data augmentation optimization

IF 1.5 4区 地球科学 Q2 PALEONTOLOGY
Jincan Wang , Muhui Zhang , Weiping Zeng , Songzhu Gu , Qingzhong Liang , Shuqin Zhou , Zimeng Gao
{"title":"Foraminiferal image classification based on convolutional neural network considering data augmentation optimization","authors":"Jincan Wang ,&nbsp;Muhui Zhang ,&nbsp;Weiping Zeng ,&nbsp;Songzhu Gu ,&nbsp;Qingzhong Liang ,&nbsp;Shuqin Zhou ,&nbsp;Zimeng Gao","doi":"10.1016/j.marmicro.2025.102476","DOIUrl":null,"url":null,"abstract":"<div><div>Foraminifera are of utmost importance in paleoclimate and marine ecosystem research, with accurate classification being equally vital. Convolutional Neural Networks (CNNs) can realize the automatic classification of foraminiferal images, but they usually rely on data augmentation to address the issue of data scarcity. Despite the widespread use of data augmentation methods, the impacts of various augmentation methods on the classification of foraminifera remain unclear. In this study, we systematically evaluated the effects of different data augmentation methods on the classification performance of CNNs using three publicly available datasets. Experiments based on the ResNet-50 architecture showed that random rotation (RR), random flipping (RF), and random erasing (RE, ratio = 0.2) significantly improved the classification accuracy. The combined model of these three methods achieved accuracies of 89.4 %, 89.7 %, and 95.7 %, and F1 scores of 72.7 %, 72.8 %, and 84.3 % in the three tasks respectively. Compared with the basic model, the accuracy (A) increased by an average of 3.2 %, and the F1 score (F1) increased by an average of 7.1 %. This study confirms that selecting and combining appropriate data augmentation methods can effectively enhance the performance of foraminiferal image classification, with the combination of RR, RF, and RE being the most effective.</div></div>","PeriodicalId":49881,"journal":{"name":"Marine Micropaleontology","volume":"199 ","pages":"Article 102476"},"PeriodicalIF":1.5000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine Micropaleontology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377839825000416","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PALEONTOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Foraminifera are of utmost importance in paleoclimate and marine ecosystem research, with accurate classification being equally vital. Convolutional Neural Networks (CNNs) can realize the automatic classification of foraminiferal images, but they usually rely on data augmentation to address the issue of data scarcity. Despite the widespread use of data augmentation methods, the impacts of various augmentation methods on the classification of foraminifera remain unclear. In this study, we systematically evaluated the effects of different data augmentation methods on the classification performance of CNNs using three publicly available datasets. Experiments based on the ResNet-50 architecture showed that random rotation (RR), random flipping (RF), and random erasing (RE, ratio = 0.2) significantly improved the classification accuracy. The combined model of these three methods achieved accuracies of 89.4 %, 89.7 %, and 95.7 %, and F1 scores of 72.7 %, 72.8 %, and 84.3 % in the three tasks respectively. Compared with the basic model, the accuracy (A) increased by an average of 3.2 %, and the F1 score (F1) increased by an average of 7.1 %. This study confirms that selecting and combining appropriate data augmentation methods can effectively enhance the performance of foraminiferal image classification, with the combination of RR, RF, and RE being the most effective.
考虑数据增强优化的卷积神经网络有孔虫图像分类
有孔虫在古气候和海洋生态系统研究中具有极其重要的意义,准确的分类同样至关重要。卷积神经网络(cnn)可以实现有孔虫图像的自动分类,但通常依赖于数据增强来解决数据稀缺性问题。尽管数据增强方法被广泛使用,但各种增强方法对有孔虫分类的影响尚不清楚。在这项研究中,我们使用三个公开的数据集系统地评估了不同的数据增强方法对cnn分类性能的影响。基于ResNet-50架构的实验表明,随机旋转(RR)、随机翻转(RF)和随机擦除(RE, ratio = 0.2)显著提高了分类精度。三种方法组合模型的准确率分别为89.4%、89.7%和95.7%,F1得分分别为72.7%、72.8%和84.3%。与基本模型相比,准确率(A)平均提高3.2%,F1分数(F1)平均提高7.1%。本研究证实,选择和组合合适的数据增强方法可以有效提高有孔虫图像分类的性能,其中RR、RF和RE的组合效果最好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Marine Micropaleontology
Marine Micropaleontology 地学-古生物学
CiteScore
3.70
自引率
15.80%
发文量
62
审稿时长
26.7 weeks
期刊介绍: Marine Micropaleontology is an international journal publishing original, innovative and significant scientific papers in all fields related to marine microfossils, including ecology and paleoecology, biology and paleobiology, paleoceanography and paleoclimatology, environmental monitoring, taphonomy, evolution and molecular phylogeny. The journal strongly encourages the publication of articles in which marine microfossils and/or their chemical composition are used to solve fundamental geological, environmental and biological problems. However, it does not publish purely stratigraphic or taxonomic papers. In Marine Micropaleontology, a special section is dedicated to short papers on new methods and protocols using marine microfossils. We solicit special issues on hot topics in marine micropaleontology and review articles on timely subjects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信