Efficient Attention-Lightweight Deep Learning Architecture Integration for Plant Pest Recognition

IEEE Transactions on AgriFood Electronics Pub Date : 2025-07-08 DOI:10.1109/TAFE.2025.3583334

Sivasubramaniam Janarthan;Selvarajah Thuseethan;Charles Joseph;Vigneshwaran Palanisamy;Sutharshan Rajasegarar;John Yearwood

{"title":"Efficient Attention-Lightweight Deep Learning Architecture Integration for Plant Pest Recognition","authors":"Sivasubramaniam Janarthan;Selvarajah Thuseethan;Charles Joseph;Vigneshwaran Palanisamy;Sutharshan Rajasegarar;John Yearwood","doi":"10.1109/TAFE.2025.3583334","DOIUrl":null,"url":null,"abstract":"Many real-world agricultural applications, such as automatic pest recognition, benefit from lightweight deep learning (DL) architectures due to their reduced computational complexity, enabling deployment on resource-constrained devices. However, this paradigm shift comes at the cost of model performance, significantly limiting its extensive use. Traditional data-centric approaches for improving model performance, such as using large training datasets, are often unsuitable for the agricultural domain due to limited labeled data and high data collection costs. On the other hand, architectural improvements, such as attention mechanisms, have demonstrated the potential to enhance the performance of lightweight DL architectures. However, improper integration can lead to increased complexity and diminished performance. To address this challenge, this study proposes a novel mechanism to systematically determine the optimal integration configuration of popular attention techniques with the MobileNet lightweight DL architecture. The proposed method is evaluated on four variants of two benchmark plant pest datasets (D15,869 and D1500, D21599, and D2545) and the best integration configurations are reported along with their results. The Bottleneck Attention Module (BAM) attention mechanism, integrated into 12 different layers of MobileNetV2 (BAM12), demonstrated superior performance on D15869 and D1500, and D21599 and D2545, while integrating BAM into eight layers yielded higher accuracy on D21599. As a result, a comparison with the MobileNet baseline demonstrates that the careful integration of attention mechanisms significantly improves performance.","PeriodicalId":100637,"journal":{"name":"IEEE Transactions on AgriFood Electronics","volume":"3 2","pages":"548-560"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on AgriFood Electronics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11073813/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Many real-world agricultural applications, such as automatic pest recognition, benefit from lightweight deep learning (DL) architectures due to their reduced computational complexity, enabling deployment on resource-constrained devices. However, this paradigm shift comes at the cost of model performance, significantly limiting its extensive use. Traditional data-centric approaches for improving model performance, such as using large training datasets, are often unsuitable for the agricultural domain due to limited labeled data and high data collection costs. On the other hand, architectural improvements, such as attention mechanisms, have demonstrated the potential to enhance the performance of lightweight DL architectures. However, improper integration can lead to increased complexity and diminished performance. To address this challenge, this study proposes a novel mechanism to systematically determine the optimal integration configuration of popular attention techniques with the MobileNet lightweight DL architecture. The proposed method is evaluated on four variants of two benchmark plant pest datasets (D1_5,869 and D1₅₀₀, D2₁₅₉₉, and D2₅₄₅) and the best integration configurations are reported along with their results. The Bottleneck Attention Module (BAM) attention mechanism, integrated into 12 different layers of MobileNetV2 (BAM12), demonstrated superior performance on D1₅₈₆₉ and D1₅₀₀, and D2₁₅₉₉ and D2₅₄₅, while integrating BAM into eight layers yielded higher accuracy on D2₁₅₉₉. As a result, a comparison with the MobileNet baseline demonstrates that the careful integration of attention mechanisms significantly improves performance.

查看原文本刊更多论文

高效关注-轻量级深度学习架构集成植物病虫害识别

许多现实世界的农业应用，如自动害虫识别，都受益于轻量级深度学习（DL）架构，因为它们降低了计算复杂性，可以在资源受限的设备上部署。然而，这种范式转换是以模型性能为代价的，极大地限制了它的广泛使用。传统的以数据为中心的提高模型性能的方法，如使用大型训练数据集，由于有限的标记数据和高昂的数据收集成本，通常不适合农业领域。另一方面，体系结构的改进，比如注意力机制，已经证明了增强轻量级DL体系结构性能的潜力。然而，不适当的集成会导致复杂性的增加和性能的降低。为了应对这一挑战，本研究提出了一种新机制来系统地确定流行注意力技术与MobileNet轻量级DL架构的最佳集成配置。该方法在两个基准植物害虫数据集（d15869和D1500， D21599和D2545）的四种变体上进行了评估，并报告了最佳集成配置及其结果。将瓶颈注意模块（BAM）的注意机制集成到12个不同的MobileNetV2 （BAM12）层中，在D15869和D1500、D21599和D2545上表现出优异的性能，而将BAM集成到8个不同的层中，在D21599上获得更高的精度。结果，与MobileNet基线的比较表明，注意机制的精心整合显著提高了性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on AgriFood Electronics

自引率

0.00%

发文量