{"title":"改变鳄鱼的可追溯性:识别暹罗鳄的深度度量学习","authors":"Kriengsak Treeprapin, Kantapon Kaewtip, Worapong Singchat, Nattakan Ariyaraphong, Thitipong Panthum, Prateep Duengkae, Yosapong Temsiripong, Kornsorn Srikulnath, Suchin Trirongjitmoah","doi":"10.1016/j.ecoinf.2024.102771","DOIUrl":null,"url":null,"abstract":"This study introduces a novel method for identifying individual Siamese crocodiles (), which is a crucial requirement for conservation and sustainable industry practices. Although deep metric learning (DML) has improved identification model robustness and reduced dependency on large datasets, comprehensive field studies and long-term deployments are lacking. To address this, DML combined with convolutional neural network (CNN) was applied for enhancing accuracy using a limited and imbalanced number of images per class and distinguishing dissimilar scale patterns of the head and ventral regions. Individual crocodiles were identified using the k-nearest neighbor (KNN) and support vector machine (SVM) classifiers based on the extracted features. Data were collected from 30 individuals on a crocodile farm using photographs taken over two consecutive years. Two identification types, Type 1, based on a model trained on images collected over two years; and Type 2, based on a model trained exclusively on images from the first year, were implemented. Type 1 identification, which used a CNN combined with the KNN and SVM classifiers, exhibited an accuracy exceeding 99.75 and 92.93% for the ventral and head regions, respectively. Type 2 identification exhibited a reduced accuracy because of a comparatively smaller amount of learning information; the proposed CNN achieved 83.99% accuracy for ventral identification and 67.14 and 65.61% for head identification with KNN and SVM, respectively. This study underscores the efficacy of DML and CNN for handling small, imbalanced datasets in identifying individual crocodiles, and has significant implications for traceability and conservation initiatives in the crocodile industry.","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":null,"pages":null},"PeriodicalIF":5.8000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transforming crocodile traceability: Deep metric learning for identifying Siamese crocodiles\",\"authors\":\"Kriengsak Treeprapin, Kantapon Kaewtip, Worapong Singchat, Nattakan Ariyaraphong, Thitipong Panthum, Prateep Duengkae, Yosapong Temsiripong, Kornsorn Srikulnath, Suchin Trirongjitmoah\",\"doi\":\"10.1016/j.ecoinf.2024.102771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study introduces a novel method for identifying individual Siamese crocodiles (), which is a crucial requirement for conservation and sustainable industry practices. Although deep metric learning (DML) has improved identification model robustness and reduced dependency on large datasets, comprehensive field studies and long-term deployments are lacking. To address this, DML combined with convolutional neural network (CNN) was applied for enhancing accuracy using a limited and imbalanced number of images per class and distinguishing dissimilar scale patterns of the head and ventral regions. Individual crocodiles were identified using the k-nearest neighbor (KNN) and support vector machine (SVM) classifiers based on the extracted features. Data were collected from 30 individuals on a crocodile farm using photographs taken over two consecutive years. Two identification types, Type 1, based on a model trained on images collected over two years; and Type 2, based on a model trained exclusively on images from the first year, were implemented. Type 1 identification, which used a CNN combined with the KNN and SVM classifiers, exhibited an accuracy exceeding 99.75 and 92.93% for the ventral and head regions, respectively. Type 2 identification exhibited a reduced accuracy because of a comparatively smaller amount of learning information; the proposed CNN achieved 83.99% accuracy for ventral identification and 67.14 and 65.61% for head identification with KNN and SVM, respectively. This study underscores the efficacy of DML and CNN for handling small, imbalanced datasets in identifying individual crocodiles, and has significant implications for traceability and conservation initiatives in the crocodile industry.\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ecoinf.2024.102771\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.ecoinf.2024.102771","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Transforming crocodile traceability: Deep metric learning for identifying Siamese crocodiles
This study introduces a novel method for identifying individual Siamese crocodiles (), which is a crucial requirement for conservation and sustainable industry practices. Although deep metric learning (DML) has improved identification model robustness and reduced dependency on large datasets, comprehensive field studies and long-term deployments are lacking. To address this, DML combined with convolutional neural network (CNN) was applied for enhancing accuracy using a limited and imbalanced number of images per class and distinguishing dissimilar scale patterns of the head and ventral regions. Individual crocodiles were identified using the k-nearest neighbor (KNN) and support vector machine (SVM) classifiers based on the extracted features. Data were collected from 30 individuals on a crocodile farm using photographs taken over two consecutive years. Two identification types, Type 1, based on a model trained on images collected over two years; and Type 2, based on a model trained exclusively on images from the first year, were implemented. Type 1 identification, which used a CNN combined with the KNN and SVM classifiers, exhibited an accuracy exceeding 99.75 and 92.93% for the ventral and head regions, respectively. Type 2 identification exhibited a reduced accuracy because of a comparatively smaller amount of learning information; the proposed CNN achieved 83.99% accuracy for ventral identification and 67.14 and 65.61% for head identification with KNN and SVM, respectively. This study underscores the efficacy of DML and CNN for handling small, imbalanced datasets in identifying individual crocodiles, and has significant implications for traceability and conservation initiatives in the crocodile industry.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.