Multi-angle, cross-domain fusion strategy enhances automated insect identification and hierarchical categorization: a case study on assassin bugs (Hemiptera: Reduviidae).

IF 6.2 2区生物学 Q1 EVOLUTIONARY BIOLOGY

Cladistics Pub Date : 2026-02-11 DOI:10.1111/cla.70029

Xinkai Wang, Huaiyu Liu, Zhuo Chen, Yisheng Zhao, Yingqi Liu, Haoyang Xiong, Yuange Duan, Fang Song, Wanzhi Cai, Xuankun Li, Hu Li

{"title":"Multi-angle, cross-domain fusion strategy enhances automated insect identification and hierarchical categorization: a case study on assassin bugs (Hemiptera: Reduviidae).","authors":"Xinkai Wang, Huaiyu Liu, Zhuo Chen, Yisheng Zhao, Yingqi Liu, Haoyang Xiong, Yuange Duan, Fang Song, Wanzhi Cai, Xuankun Li, Hu Li","doi":"10.1111/cla.70029","DOIUrl":null,"url":null,"abstract":"<p><p>Automated insect identification systems hold significant value for biodiversity monitoring, pest management, citizen science initiatives and systematic studies, particularly in an era of declining expertise in insect taxonomy. However, current deep learning approaches often rely on standardized specimen photos from limited-angles and simplified backgrounds, limiting their generalization and effectiveness in diverse practical scenarios. Here, we address this limitation using assassin bugs (Hemiptera: Reduviidae) as a model system-a highly diverse group with complex morphological variation. We developed a comprehensive, high-quality dataset of 11 915 expert-validated images from 92 species across 48 genera and six subfamilies, integrating three image types: standard, turntable-captured and ecological. Using ConvNeXt-B architecture, we systematically evaluated the performance of classification and hierarchical categorization to higher taxonomic ranks across different training and testing scenarios. Multi-angle fusion increased species recognition accuracy by 5.72% and F1-score by 0.061 and increased the hierarchical categorization of unseen species to correct genera by 13.53% and the F1-score by 0.129. Incorporating ecological images further enhanced model performance by 13% for both tasks. Grad-CAM visualization revealed that multi-angle, cross-domain fusion guides the model to focus on taxonomically diagnostic traits, yielding hierarchical feature representations aligned with classical taxonomy across lab and wild environments. Our results demonstrate that integrating diverse viewing angles and ecological contexts substantially improves model cross-domain adaptation, providing a practical framework for developing reliable automated insect identification tools for diverse practical scenarios. Furthermore, our model's ability to accurately categorize unseen taxa into higher taxonomic ranks holds significant potential for future applications in systematics, including facilitate specimen sorting and generating testable phylogenetic hypotheses.</p>","PeriodicalId":50688,"journal":{"name":"Cladistics","volume":" ","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cladistics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/cla.70029","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Automated insect identification systems hold significant value for biodiversity monitoring, pest management, citizen science initiatives and systematic studies, particularly in an era of declining expertise in insect taxonomy. However, current deep learning approaches often rely on standardized specimen photos from limited-angles and simplified backgrounds, limiting their generalization and effectiveness in diverse practical scenarios. Here, we address this limitation using assassin bugs (Hemiptera: Reduviidae) as a model system-a highly diverse group with complex morphological variation. We developed a comprehensive, high-quality dataset of 11 915 expert-validated images from 92 species across 48 genera and six subfamilies, integrating three image types: standard, turntable-captured and ecological. Using ConvNeXt-B architecture, we systematically evaluated the performance of classification and hierarchical categorization to higher taxonomic ranks across different training and testing scenarios. Multi-angle fusion increased species recognition accuracy by 5.72% and F1-score by 0.061 and increased the hierarchical categorization of unseen species to correct genera by 13.53% and the F1-score by 0.129. Incorporating ecological images further enhanced model performance by 13% for both tasks. Grad-CAM visualization revealed that multi-angle, cross-domain fusion guides the model to focus on taxonomically diagnostic traits, yielding hierarchical feature representations aligned with classical taxonomy across lab and wild environments. Our results demonstrate that integrating diverse viewing angles and ecological contexts substantially improves model cross-domain adaptation, providing a practical framework for developing reliable automated insect identification tools for diverse practical scenarios. Furthermore, our model's ability to accurately categorize unseen taxa into higher taxonomic ranks holds significant potential for future applications in systematics, including facilitate specimen sorting and generating testable phylogenetic hypotheses.

查看原文本刊更多论文

多角度、跨域融合策略增强了昆虫自动识别和分层分类——以半翅目：猎蝽科为例

昆虫自动识别系统在生物多样性监测、害虫管理、公民科学倡议和系统研究方面具有重要价值，特别是在昆虫分类学专业知识下降的时代。然而，目前的深度学习方法往往依赖于有限角度和简化背景的标准化标本照片，限制了它们在各种实际场景中的泛化和有效性。在这里，我们使用刺客蝽（半翅目：Reduviidae）作为模型系统来解决这一限制，刺客蝽是一个高度多样化的群体，具有复杂的形态变化。我们开发了一个全面的、高质量的数据集，包括来自6个亚科48属92个物种的11 915张经过专家验证的图像，整合了三种图像类型：标准、转台捕获和生态。采用ConvNeXt-B架构，系统地评估了不同训练和测试场景下分类和分层分类的性能。多角度融合使物种识别准确率提高了5.72%，f1评分提高了0.061分；将未见种分类为正确属的准确率提高了13.53%，f1评分提高了0.129分。结合生态图像进一步提高了模型在两项任务中的表现，提高了13%。Grad-CAM可视化显示，多角度、跨域融合引导模型专注于分类学诊断特征，产生与实验室和野生环境中的经典分类学一致的分层特征表示。我们的研究结果表明，整合不同视角和生态环境大大提高了模型的跨域适应性，为开发可靠的昆虫自动识别工具提供了一个实用框架。此外，我们的模型能够准确地将未见的分类群划分为更高的分类等级，这在系统分类学的未来应用中具有重要的潜力，包括促进标本分类和产生可测试的系统发育假设。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cladistics 生物-进化生物学

CiteScore

8.60

自引率

5.60%

发文量

期刊介绍： Cladistics publishes high quality research papers on systematics, encouraging debate on all aspects of the field, from philosophy, theory and methodology to empirical studies and applications in biogeography, coevolution, conservation biology, ontogeny, genomics and paleontology. Cladistics is read by scientists working in the research fields of evolution, systematics and integrative biology and enjoys a consistently high position in the ISI® rankings for evolutionary biology.