V. Carlsson, T. Danelian, Pierre Boulet, P. Devienne, Aurelien Laforge, J. Renaudie
{"title":"Artificial intelligence applied to the classification of eight middle Eocene species of the genus Podocyrtis (polycystine radiolaria)","authors":"V. Carlsson, T. Danelian, Pierre Boulet, P. Devienne, Aurelien Laforge, J. Renaudie","doi":"10.5194/jm-41-165-2022","DOIUrl":null,"url":null,"abstract":"Abstract. This study evaluates the application of artificial intelligence (AI) to the automatic\nclassification of radiolarians and uses as an example eight distinct\nmorphospecies of the Eocene radiolarian genus Podocyrtis, which are part of three\ndifferent evolutionary lineages and are useful in biostratigraphy. The\nsamples used in this study were recovered from the equatorial Atlantic (ODP\nLeg 207) and were supplemented with some samples coming from the North\nAtlantic and Indian Oceans. To create an automatic classification tool,\nnumerous images of the investigated species were needed to train a\nMobileNet convolutional neural network entirely coded in Python. Three\ndifferent datasets were obtained. The first one consists of a mixture of\nbroken and complete specimens, some of which sometimes appear blurry. The\nsecond and third datasets were leveled down into two further steps, which\nexcludes broken and blurry specimens while increasing the quality. The\nconvolutional neural network randomly selected 85 % of all specimens for\ntraining, while the remaining 15 % were used for validation. The MobileNet\narchitecture had an overall accuracy of about 91 % for all datasets.\nThree predicational models were thereafter created, which had been trained\non each dataset and worked well for classification of Podocyrtis coming from the\nIndian Ocean (Madingley Rise, ODP Leg 115, Hole 711A) and the western North\nAtlantic Ocean (New Jersey slope, DSDP Leg 95, Hole 612 and Blake Nose, ODP\nLeg 171B, Hole 1051A). These samples also provided clearer images since they\nwere mounted with Canada balsam rather than Norland epoxy. In spite of some\nmorphological differences encountered in different parts of the world's\noceans and differences in image quality, most species could be correctly\nclassified or at least classified with a neighboring species along a\nlineage. Classification improved slightly for some species by cropping\nand/or removing background particles of images which did not segment\nproperly in the image processing. However, depending on cropping or\nbackground removal, the best result came from the predictive model trained on\nthe normal stacked dataset consisting of a mixture of broken and complete\nspecimens.\n","PeriodicalId":54786,"journal":{"name":"Journal of Micropalaeontology","volume":" ","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2022-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Micropalaeontology","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.5194/jm-41-165-2022","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PALEONTOLOGY","Score":null,"Total":0}
引用次数: 3
Abstract
Abstract. This study evaluates the application of artificial intelligence (AI) to the automatic
classification of radiolarians and uses as an example eight distinct
morphospecies of the Eocene radiolarian genus Podocyrtis, which are part of three
different evolutionary lineages and are useful in biostratigraphy. The
samples used in this study were recovered from the equatorial Atlantic (ODP
Leg 207) and were supplemented with some samples coming from the North
Atlantic and Indian Oceans. To create an automatic classification tool,
numerous images of the investigated species were needed to train a
MobileNet convolutional neural network entirely coded in Python. Three
different datasets were obtained. The first one consists of a mixture of
broken and complete specimens, some of which sometimes appear blurry. The
second and third datasets were leveled down into two further steps, which
excludes broken and blurry specimens while increasing the quality. The
convolutional neural network randomly selected 85 % of all specimens for
training, while the remaining 15 % were used for validation. The MobileNet
architecture had an overall accuracy of about 91 % for all datasets.
Three predicational models were thereafter created, which had been trained
on each dataset and worked well for classification of Podocyrtis coming from the
Indian Ocean (Madingley Rise, ODP Leg 115, Hole 711A) and the western North
Atlantic Ocean (New Jersey slope, DSDP Leg 95, Hole 612 and Blake Nose, ODP
Leg 171B, Hole 1051A). These samples also provided clearer images since they
were mounted with Canada balsam rather than Norland epoxy. In spite of some
morphological differences encountered in different parts of the world's
oceans and differences in image quality, most species could be correctly
classified or at least classified with a neighboring species along a
lineage. Classification improved slightly for some species by cropping
and/or removing background particles of images which did not segment
properly in the image processing. However, depending on cropping or
background removal, the best result came from the predictive model trained on
the normal stacked dataset consisting of a mixture of broken and complete
specimens.
期刊介绍:
The Journal of Micropalaeontology (JM) is an established international journal covering all aspects of microfossils and their application to both applied studies and basic research. In particular we welcome submissions relating to microfossils and their application to palaeoceanography, palaeoclimatology, palaeobiology, evolution, taxonomy, environmental change and molecular phylogeny.