Song-Quan Ong , Min-Hui Lim , Kim Bjerge , Francisco Javier Peris-Felipo , Rob Lind , Toke Thomas Høye
{"title":"InsectRoleVision: A computer vision system to study arthropod diversity and functional roles","authors":"Song-Quan Ong , Min-Hui Lim , Kim Bjerge , Francisco Javier Peris-Felipo , Rob Lind , Toke Thomas Høye","doi":"10.1016/j.ecoinf.2026.103701","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Arthropods make up the majority of species on Earth. To study their diversity and ecological roles in ecosystems, bulk sampling is commonly used to collect large numbers of specimens. Processing these samples is labor-intensive and time-consuming, often delaying timely decision-making in ecosystem monitoring. Automated detection systems offer a promising alternative to support sample processing; however, most existing systems still have two major limitations. First, the pipelines for localizing and classifying arthropods in images, whether single- or double-stage, are limited. Second, the prediction results often lack information about functional roles. Therefore, we developed <em>InsectRoleVision</em>, a more expert-centered and reliable system that enables inference of arthropod diversity based on their functional roles.</div></div><div><h3>Methodology</h3><div>To develop the system, an image dataset with taxonomic resolution was designed and created to support conclusions about the functional roles of the animals. Both single-stage and double-stage recognition pipelines were compared. For single-stage detection and the first stage of double-stage detection, four YOLO models and a transformer were evaluated to localize and classify the arthropods in each image. In double-stage detection, the region of interest (RoI) was cropped into individual images after localization and used to compare four classification models: InceptionV3, ResNet, MobileNet, and VGG19. A logic block pipeline was connected to the prediction results to further infer the richness and proportionality of each class or taxon with respect to their functional roles.</div></div><div><h3>Result</h3><div>YOLOv11 was the best-performing model, achieving over 93% mAP, precision, and recall in localizing arthropods in the images. InceptionV3 was the best-performing classifier, achieving 80% precision and recall in classifying more than 43,000 cropped images of arthropods. There was no significant difference between the results of single- and modular double-stage detection strategies. Therefore, the choice between strategies depends on the intended application: single-stage detection provides real-time results and is suitable for real-time detection applications, while double-stage detection allows a human expert to review the detection proposal and refine the classification result. <em>InsectRoleVision</em> has adopted the YOLOv11-InceptionV3 architecture, which is more flexible and human-centered, allowing quick access to both arthropod diversity and ecological roles.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"95 ","pages":"Article 103701"},"PeriodicalIF":7.3000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S157495412600107X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/4/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Arthropods make up the majority of species on Earth. To study their diversity and ecological roles in ecosystems, bulk sampling is commonly used to collect large numbers of specimens. Processing these samples is labor-intensive and time-consuming, often delaying timely decision-making in ecosystem monitoring. Automated detection systems offer a promising alternative to support sample processing; however, most existing systems still have two major limitations. First, the pipelines for localizing and classifying arthropods in images, whether single- or double-stage, are limited. Second, the prediction results often lack information about functional roles. Therefore, we developed InsectRoleVision, a more expert-centered and reliable system that enables inference of arthropod diversity based on their functional roles.
Methodology
To develop the system, an image dataset with taxonomic resolution was designed and created to support conclusions about the functional roles of the animals. Both single-stage and double-stage recognition pipelines were compared. For single-stage detection and the first stage of double-stage detection, four YOLO models and a transformer were evaluated to localize and classify the arthropods in each image. In double-stage detection, the region of interest (RoI) was cropped into individual images after localization and used to compare four classification models: InceptionV3, ResNet, MobileNet, and VGG19. A logic block pipeline was connected to the prediction results to further infer the richness and proportionality of each class or taxon with respect to their functional roles.
Result
YOLOv11 was the best-performing model, achieving over 93% mAP, precision, and recall in localizing arthropods in the images. InceptionV3 was the best-performing classifier, achieving 80% precision and recall in classifying more than 43,000 cropped images of arthropods. There was no significant difference between the results of single- and modular double-stage detection strategies. Therefore, the choice between strategies depends on the intended application: single-stage detection provides real-time results and is suitable for real-time detection applications, while double-stage detection allows a human expert to review the detection proposal and refine the classification result. InsectRoleVision has adopted the YOLOv11-InceptionV3 architecture, which is more flexible and human-centered, allowing quick access to both arthropod diversity and ecological roles.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.