InsectRoleVision: A computer vision system to study arthropod diversity and functional roles

IF 7.3 2区 环境科学与生态学 Q1 ECOLOGY
Ecological Informatics Pub Date : 2026-05-01 Epub Date: 2026-04-11 DOI:10.1016/j.ecoinf.2026.103701
Song-Quan Ong , Min-Hui Lim , Kim Bjerge , Francisco Javier Peris-Felipo , Rob Lind , Toke Thomas Høye
{"title":"InsectRoleVision: A computer vision system to study arthropod diversity and functional roles","authors":"Song-Quan Ong ,&nbsp;Min-Hui Lim ,&nbsp;Kim Bjerge ,&nbsp;Francisco Javier Peris-Felipo ,&nbsp;Rob Lind ,&nbsp;Toke Thomas Høye","doi":"10.1016/j.ecoinf.2026.103701","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Arthropods make up the majority of species on Earth. To study their diversity and ecological roles in ecosystems, bulk sampling is commonly used to collect large numbers of specimens. Processing these samples is labor-intensive and time-consuming, often delaying timely decision-making in ecosystem monitoring. Automated detection systems offer a promising alternative to support sample processing; however, most existing systems still have two major limitations. First, the pipelines for localizing and classifying arthropods in images, whether single- or double-stage, are limited. Second, the prediction results often lack information about functional roles. Therefore, we developed <em>InsectRoleVision</em>, a more expert-centered and reliable system that enables inference of arthropod diversity based on their functional roles.</div></div><div><h3>Methodology</h3><div>To develop the system, an image dataset with taxonomic resolution was designed and created to support conclusions about the functional roles of the animals. Both single-stage and double-stage recognition pipelines were compared. For single-stage detection and the first stage of double-stage detection, four YOLO models and a transformer were evaluated to localize and classify the arthropods in each image. In double-stage detection, the region of interest (RoI) was cropped into individual images after localization and used to compare four classification models: InceptionV3, ResNet, MobileNet, and VGG19. A logic block pipeline was connected to the prediction results to further infer the richness and proportionality of each class or taxon with respect to their functional roles.</div></div><div><h3>Result</h3><div>YOLOv11 was the best-performing model, achieving over 93% mAP, precision, and recall in localizing arthropods in the images. InceptionV3 was the best-performing classifier, achieving 80% precision and recall in classifying more than 43,000 cropped images of arthropods. There was no significant difference between the results of single- and modular double-stage detection strategies. Therefore, the choice between strategies depends on the intended application: single-stage detection provides real-time results and is suitable for real-time detection applications, while double-stage detection allows a human expert to review the detection proposal and refine the classification result. <em>InsectRoleVision</em> has adopted the YOLOv11-InceptionV3 architecture, which is more flexible and human-centered, allowing quick access to both arthropod diversity and ecological roles.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"95 ","pages":"Article 103701"},"PeriodicalIF":7.3000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S157495412600107X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/4/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Arthropods make up the majority of species on Earth. To study their diversity and ecological roles in ecosystems, bulk sampling is commonly used to collect large numbers of specimens. Processing these samples is labor-intensive and time-consuming, often delaying timely decision-making in ecosystem monitoring. Automated detection systems offer a promising alternative to support sample processing; however, most existing systems still have two major limitations. First, the pipelines for localizing and classifying arthropods in images, whether single- or double-stage, are limited. Second, the prediction results often lack information about functional roles. Therefore, we developed InsectRoleVision, a more expert-centered and reliable system that enables inference of arthropod diversity based on their functional roles.

Methodology

To develop the system, an image dataset with taxonomic resolution was designed and created to support conclusions about the functional roles of the animals. Both single-stage and double-stage recognition pipelines were compared. For single-stage detection and the first stage of double-stage detection, four YOLO models and a transformer were evaluated to localize and classify the arthropods in each image. In double-stage detection, the region of interest (RoI) was cropped into individual images after localization and used to compare four classification models: InceptionV3, ResNet, MobileNet, and VGG19. A logic block pipeline was connected to the prediction results to further infer the richness and proportionality of each class or taxon with respect to their functional roles.

Result

YOLOv11 was the best-performing model, achieving over 93% mAP, precision, and recall in localizing arthropods in the images. InceptionV3 was the best-performing classifier, achieving 80% precision and recall in classifying more than 43,000 cropped images of arthropods. There was no significant difference between the results of single- and modular double-stage detection strategies. Therefore, the choice between strategies depends on the intended application: single-stage detection provides real-time results and is suitable for real-time detection applications, while double-stage detection allows a human expert to review the detection proposal and refine the classification result. InsectRoleVision has adopted the YOLOv11-InceptionV3 architecture, which is more flexible and human-centered, allowing quick access to both arthropod diversity and ecological roles.
昆虫视觉:研究节肢动物多样性和功能角色的计算机视觉系统
节肢动物构成了地球上大多数物种。为了研究它们在生态系统中的多样性和生态作用,通常采用大量取样的方法来收集大量标本。处理这些样本是一项费时费力的工作,往往会耽误生态系统监测的及时决策。自动检测系统为支持样品处理提供了一个有前途的替代方案;然而,大多数现有的系统仍然有两个主要的限制。首先,无论是单阶段还是双阶段,用于图像中节肢动物定位和分类的管道都是有限的。其次,预测结果往往缺乏关于功能角色的信息。因此,我们开发了一个更以专家为中心、更可靠的系统,可以根据节肢动物的功能角色来推断它们的多样性。为了开发该系统,设计并创建了具有分类学分辨率的图像数据集,以支持有关动物功能角色的结论。对单级和双级识别管道进行了比较。在单级检测和双级检测的第一阶段,利用4种YOLO模型和一个变压器对每张图像中的节肢动物进行定位和分类。在双阶段检测中,定位后的感兴趣区域(RoI)被裁剪成单个图像,并用于比较四种分类模型:InceptionV3、ResNet、MobileNet和VGG19。通过逻辑块管道与预测结果相连接,进一步推断每个类或分类单元在功能角色方面的丰富度和比例性。结果tyolov11是表现最好的模型,在图像中定位节肢动物的mAP、precision和recall均达到93%以上。InceptionV3是表现最好的分类器,在分类超过43,000个节肢动物的裁剪图像中达到80%的准确率和召回率。单阶段检测策略与模块双阶段检测策略的结果无显著差异。因此,策略之间的选择取决于预期的应用:单阶段检测提供实时结果,适用于实时检测应用,而双阶段检测允许人类专家审查检测提案并改进分类结果。昆虫trolevision采用了YOLOv11-InceptionV3架构,该架构更加灵活,以人为本,可以快速访问节肢动物多样性和生态角色。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书