Lightweight citrus leaf disease detection model based on ARMS and cross-domain dynamic attention

IF 5.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Henghui Mo, Linjing Wei
{"title":"Lightweight citrus leaf disease detection model based on ARMS and cross-domain dynamic attention","authors":"Henghui Mo,&nbsp;Linjing Wei","doi":"10.1016/j.jksuci.2024.102133","DOIUrl":null,"url":null,"abstract":"<div><p>In citrus cultivation, Anthracnose, Scab, and Greasy Spot significantly impact yield and quality. Facing challenges in detecting small targets against complex orchard backgrounds with uneven lighting and obstructions, existing models suffer from low detection accuracy. This study introduces the YOLOv8n-CDDA citrus leaf disease detection model. The Cross-Domain Dynamic Attention (CDDA) mechanism deconstructs the backbone network’s input feature maps into sections, dynamically assigning spatial and channel attention weights to reconstruct critical information and capture the variations and weak semantic features of disease textures. The proposed Adaptive Random Mix-Cut Splicing (ARMS) image augmentation technique blends diseased leaf images with healthy citrus backgrounds, enhancing the diversity and number of background targets. To reduce computational and memory consumption, the network is streamlined through channel pruning; to compensate for the loss in accuracy from pruning, a teacher–assistant–student network format is used for knowledge distillation, where the student network learns from soft knowledge to improve disease recognition accuracy. Finally, Grad-CAM++ technology generates heatmaps of the detections, facilitating the visualization of effective features and deepening understanding of the model’s focus areas. Experimental results demonstrate that the YOLOv8n-CDDA model achieves an average accuracy of 90.89% in disease detection, with an average recall rate of 81.12%, and a mean Average Precision (mAP50) of 88.36%. Compared to the original YOLO v8n and current mainstream detection models such as YOLOv5s, SSD, and Faster-RCNN, the improvements in average accuracy are respectively 2.95%, 4.78%, 14.22%, and 21.01%; in average recall, 2.36%, 3.09%, 15.74%, and 23.27%; and in mAP50, 2.38%, 3.13%, 13.45%, and 20.91%. After pruning and distillation for lightweight adaptation, the YOLOv8n-CDDA model has a parameter size of 0.8M, requires 4.2 GFLOPs, weighs 2.0 MB, and operates at 45 fps. Compared to YOLOv8n, this represents a reduction of 2.2M in parameters, 3.9 GFLOPs, and 4 MB in model weight, with an increase of 7 fps in speed. This model exhibits exceptional performance in the complex environment of citrus leaf disease detection, providing robust technical support for citrus growth monitoring studies, and offering insights for disease detection in other crops as well.</p></div>","PeriodicalId":48547,"journal":{"name":"Journal of King Saud University-Computer and Information Sciences","volume":"36 7","pages":"Article 102133"},"PeriodicalIF":5.2000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1319157824002222/pdfft?md5=d186db7e088c79129786ff12b138ee08&pid=1-s2.0-S1319157824002222-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of King Saud University-Computer and Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1319157824002222","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In citrus cultivation, Anthracnose, Scab, and Greasy Spot significantly impact yield and quality. Facing challenges in detecting small targets against complex orchard backgrounds with uneven lighting and obstructions, existing models suffer from low detection accuracy. This study introduces the YOLOv8n-CDDA citrus leaf disease detection model. The Cross-Domain Dynamic Attention (CDDA) mechanism deconstructs the backbone network’s input feature maps into sections, dynamically assigning spatial and channel attention weights to reconstruct critical information and capture the variations and weak semantic features of disease textures. The proposed Adaptive Random Mix-Cut Splicing (ARMS) image augmentation technique blends diseased leaf images with healthy citrus backgrounds, enhancing the diversity and number of background targets. To reduce computational and memory consumption, the network is streamlined through channel pruning; to compensate for the loss in accuracy from pruning, a teacher–assistant–student network format is used for knowledge distillation, where the student network learns from soft knowledge to improve disease recognition accuracy. Finally, Grad-CAM++ technology generates heatmaps of the detections, facilitating the visualization of effective features and deepening understanding of the model’s focus areas. Experimental results demonstrate that the YOLOv8n-CDDA model achieves an average accuracy of 90.89% in disease detection, with an average recall rate of 81.12%, and a mean Average Precision (mAP50) of 88.36%. Compared to the original YOLO v8n and current mainstream detection models such as YOLOv5s, SSD, and Faster-RCNN, the improvements in average accuracy are respectively 2.95%, 4.78%, 14.22%, and 21.01%; in average recall, 2.36%, 3.09%, 15.74%, and 23.27%; and in mAP50, 2.38%, 3.13%, 13.45%, and 20.91%. After pruning and distillation for lightweight adaptation, the YOLOv8n-CDDA model has a parameter size of 0.8M, requires 4.2 GFLOPs, weighs 2.0 MB, and operates at 45 fps. Compared to YOLOv8n, this represents a reduction of 2.2M in parameters, 3.9 GFLOPs, and 4 MB in model weight, with an increase of 7 fps in speed. This model exhibits exceptional performance in the complex environment of citrus leaf disease detection, providing robust technical support for citrus growth monitoring studies, and offering insights for disease detection in other crops as well.

基于 ARMS 和跨域动态注意力的轻量级柑橘叶病检测模型
在柑橘种植中,炭疽病、疮痂病和油脂斑病对产量和质量有很大影响。面对复杂的果园背景、不均匀的光照和障碍物,现有模型在检测小目标时面临挑战,检测精度较低。本研究介绍了 YOLOv8n-CDDA 柑橘叶病检测模型。跨域动态注意力(CDDA)机制将骨干网络的输入特征图解构为多个部分,动态分配空间和通道注意力权重以重构关键信息,并捕捉病害纹理的变化和弱语义特征。所提出的自适应随机混合剪切拼接(ARMS)图像增强技术将病叶图像与健康柑橘背景图像混合在一起,增强了背景目标的多样性和数量。为了减少计算量和内存消耗,通过通道剪枝精简了网络;为了弥补剪枝带来的准确率损失,采用了教师-助手-学生的网络形式进行知识提炼,其中学生网络从软知识中学习,以提高疾病识别准确率。最后,Grad-CAM++ 技术生成了检测的热图,促进了有效特征的可视化,加深了对模型重点领域的理解。实验结果表明,YOLOv8n-CDDA 模型的疾病检测平均准确率为 90.89%,平均召回率为 81.12%,平均精度(mAP50)为 88.36%。与最初的 YOLO v8n 和目前主流的检测模型(如 YOLOv5s、SSD 和 Faster-RCNN)相比,平均准确率分别提高了 2.95%、4.78%、14.22% 和 21.01%;平均召回率分别提高了 2.36%、3.09%、15.74% 和 23.27%;mAP50 分别提高了 2.38%、3.13%、13.45% 和 20.91%。经过剪枝和蒸馏以实现轻量级适配后,YOLOv8n-CDDA 模型的参数大小为 0.8M,需要 4.2 GFLOPs,重 2.0 MB,运行速度为 45 fps。与 YOLOv8n 相比,参数减少了 2.2M,需要 3.9 GFLOPs,模型重量减少了 4 MB,速度提高了 7 fps。该模型在柑橘叶病检测的复杂环境中表现出卓越的性能,为柑橘生长监测研究提供了强大的技术支持,同时也为其他作物的病害检测提供了启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.50
自引率
8.70%
发文量
656
审稿时长
29 days
期刊介绍: In 2022 the Journal of King Saud University - Computer and Information Sciences will become an author paid open access journal. Authors who submit their manuscript after October 31st 2021 will be asked to pay an Article Processing Charge (APC) after acceptance of their paper to make their work immediately, permanently, and freely accessible to all. The Journal of King Saud University Computer and Information Sciences is a refereed, international journal that covers all aspects of both foundations of computer and its practical applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信