Loris Nanni , Nicola Maritan , Daniel Fusaro , Sheryl Brahnam , Francesco Boscolo Meneguolo , Maria Sgaravatto
{"title":"Insect identification by combining different neural networks","authors":"Loris Nanni , Nicola Maritan , Daniel Fusaro , Sheryl Brahnam , Francesco Boscolo Meneguolo , Maria Sgaravatto","doi":"10.1016/j.eswa.2025.126935","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Traditional insect species classification relies on taxonomic experts examining unique physical characteristics of specimens, a time-consuming and error-prone process. Machine learning (ML) offers a promising alternative by identifying subtle morphological and genetic differences computationally. However, most existing approaches classify undescribed species as outliers, which limits their utility for biodiversity monitoring.</div></div><div><h3>Objective</h3><div>This study aims to develop an ML method capable of simultaneously classifying described species and grouping undescribed species by genus, thereby advancing the field of automated insect classification.</div></div><div><h3>Method</h3><div>We propose a novel ensemble approach combining neural networks (convolutional and attention-based) and Support Vector Machines (SVM), with both DNA barcoding and insect images as input data. To optimize the neural networks for diverse data types, we transform one-dimensional feature vectors into matrices using wavelet transforms. Additionally, a transformer-based architecture integrates DNA barcoding and image features for enhanced classification accuracy.</div></div><div><h3>Experimental Results</h3><div>Our method was evaluated on a comprehensive dataset containing paired insect images and DNA barcodes for 1,040 species across four insect orders. The results demonstrate superior performance compared to existing methods in classifying described species and grouping undescribed ones by genus.</div></div><div><h3>Conclusion</h3><div>The proposed approach represents a significant advancement in automated insect classification, addressing both described and undescribed species. This method has the potential to revolutionize global biodiversity monitoring. The MATLAB/PyTorch source code and dataset used are available at <span><span>https://github.com/LorisNanni/Insect-identification</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"273 ","pages":"Article 126935"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425005573","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Traditional insect species classification relies on taxonomic experts examining unique physical characteristics of specimens, a time-consuming and error-prone process. Machine learning (ML) offers a promising alternative by identifying subtle morphological and genetic differences computationally. However, most existing approaches classify undescribed species as outliers, which limits their utility for biodiversity monitoring.
Objective
This study aims to develop an ML method capable of simultaneously classifying described species and grouping undescribed species by genus, thereby advancing the field of automated insect classification.
Method
We propose a novel ensemble approach combining neural networks (convolutional and attention-based) and Support Vector Machines (SVM), with both DNA barcoding and insect images as input data. To optimize the neural networks for diverse data types, we transform one-dimensional feature vectors into matrices using wavelet transforms. Additionally, a transformer-based architecture integrates DNA barcoding and image features for enhanced classification accuracy.
Experimental Results
Our method was evaluated on a comprehensive dataset containing paired insect images and DNA barcodes for 1,040 species across four insect orders. The results demonstrate superior performance compared to existing methods in classifying described species and grouping undescribed ones by genus.
Conclusion
The proposed approach represents a significant advancement in automated insect classification, addressing both described and undescribed species. This method has the potential to revolutionize global biodiversity monitoring. The MATLAB/PyTorch source code and dataset used are available at https://github.com/LorisNanni/Insect-identification.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.