利用机器学习对恐龙脚印进行分类

Michael Jones, Jens N Lallensack, Ian Jarman, Peter Falkingham, Ivo Siekmann
{"title":"利用机器学习对恐龙脚印进行分类","authors":"Michael Jones, Jens N Lallensack, Ian Jarman, Peter Falkingham, Ivo Siekmann","doi":"10.1101/2024.07.15.603597","DOIUrl":null,"url":null,"abstract":"Fossilised dinosaur footprints enable us to study the behaviour of individual dinosaurs as well as interactions between dinosaurs of the same or different species. There are two principal groups of three-toed dinosaurs, ornithopods and theropods. Determining if a footprint is from an ornithopod or a theropod is a challenging problem. Based on a data set of over 300 dinosaur footprints we train several machine learning models for classifying footprints as either ornithopods or theropods. The data are provided in the form of 20 landmarks for representing each footprint which are derived from images. Variable selection using logistic forward regression demonstrates that the selected landmarks are at locations that are intuitively expected to be especially informative locations, such as the top or the bottom of a footprint. Most models show good accuracy but the recall of ornithopods, of which fewer samples were contained in the data set, was generally lower than the recall of theropods. The Multi-Layer Perceptron (MLP) stands out as the model which did best at dealing with the class imbalance. Finally, we investigate which footprints were misclassified by the majority of models. We find that some misclassified samples exhibit features that are characteristic of the other class or have a compromised shape, for example, a middle toe that points to the left or the right rather than straight ahead.","PeriodicalId":501477,"journal":{"name":"bioRxiv - Paleontology","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of dinosaur footprints using machine learning\",\"authors\":\"Michael Jones, Jens N Lallensack, Ian Jarman, Peter Falkingham, Ivo Siekmann\",\"doi\":\"10.1101/2024.07.15.603597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fossilised dinosaur footprints enable us to study the behaviour of individual dinosaurs as well as interactions between dinosaurs of the same or different species. There are two principal groups of three-toed dinosaurs, ornithopods and theropods. Determining if a footprint is from an ornithopod or a theropod is a challenging problem. Based on a data set of over 300 dinosaur footprints we train several machine learning models for classifying footprints as either ornithopods or theropods. The data are provided in the form of 20 landmarks for representing each footprint which are derived from images. Variable selection using logistic forward regression demonstrates that the selected landmarks are at locations that are intuitively expected to be especially informative locations, such as the top or the bottom of a footprint. Most models show good accuracy but the recall of ornithopods, of which fewer samples were contained in the data set, was generally lower than the recall of theropods. The Multi-Layer Perceptron (MLP) stands out as the model which did best at dealing with the class imbalance. Finally, we investigate which footprints were misclassified by the majority of models. We find that some misclassified samples exhibit features that are characteristic of the other class or have a compromised shape, for example, a middle toe that points to the left or the right rather than straight ahead.\",\"PeriodicalId\":501477,\"journal\":{\"name\":\"bioRxiv - Paleontology\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv - Paleontology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.07.15.603597\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Paleontology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.15.603597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

通过恐龙脚印化石,我们可以研究单个恐龙的行为以及同种或不同种恐龙之间的相互作用。三趾恐龙主要分为两类:鸟脚类和兽脚类。判断一个脚印是鸟脚类恐龙还是兽脚类恐龙的脚印是一个具有挑战性的问题。基于 300 多个恐龙脚印的数据集,我们训练了几个机器学习模型,用于将脚印分类为鸟脚类恐龙或兽脚类恐龙。数据是以 20 个地标的形式提供的,每个地标代表一个从图像中提取的脚印。使用逻辑前向回归进行变量选择表明,所选地标位于直观预期信息量特别大的位置,如脚印的顶部或底部。大多数模型都显示出良好的准确性,但鸟脚类的召回率普遍低于兽脚类,因为数据集中包含的样本较少。多层感知器(MLP)在处理类别不平衡方面表现突出。最后,我们研究了大多数模型对哪些足迹进行了错误分类。我们发现,一些被错误分类的样本表现出了其他类别的特征,或者形状有所偏差,例如,中间的脚趾指向左侧或右侧,而不是正前方。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Classification of dinosaur footprints using machine learning
Fossilised dinosaur footprints enable us to study the behaviour of individual dinosaurs as well as interactions between dinosaurs of the same or different species. There are two principal groups of three-toed dinosaurs, ornithopods and theropods. Determining if a footprint is from an ornithopod or a theropod is a challenging problem. Based on a data set of over 300 dinosaur footprints we train several machine learning models for classifying footprints as either ornithopods or theropods. The data are provided in the form of 20 landmarks for representing each footprint which are derived from images. Variable selection using logistic forward regression demonstrates that the selected landmarks are at locations that are intuitively expected to be especially informative locations, such as the top or the bottom of a footprint. Most models show good accuracy but the recall of ornithopods, of which fewer samples were contained in the data set, was generally lower than the recall of theropods. The Multi-Layer Perceptron (MLP) stands out as the model which did best at dealing with the class imbalance. Finally, we investigate which footprints were misclassified by the majority of models. We find that some misclassified samples exhibit features that are characteristic of the other class or have a compromised shape, for example, a middle toe that points to the left or the right rather than straight ahead.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信