Generalized zero-shot pest and disease image classification based on causal gating model

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Shansong Wang , Qingtian Zeng , Guiyuan Yuan , Weijian Ni , Chao Li , Hua Duan , Nengfu Xie , Fengjin Xiao , Xiaofeng Yang
{"title":"Generalized zero-shot pest and disease image classification based on causal gating model","authors":"Shansong Wang ,&nbsp;Qingtian Zeng ,&nbsp;Guiyuan Yuan ,&nbsp;Weijian Ni ,&nbsp;Chao Li ,&nbsp;Hua Duan ,&nbsp;Nengfu Xie ,&nbsp;Fengjin Xiao ,&nbsp;Xiaofeng Yang","doi":"10.1016/j.compag.2024.109827","DOIUrl":null,"url":null,"abstract":"<div><div>Images of scarce agricultural pests and diseases categories are often hard to obtain, and there are frequently no visual examples in web search engines. Therefore, it is challenging to build an image classification model based on supervised learning. To address this issue, Generalized Zero-shot Learning (GZSL) based on Generative Adversarial Networks (GANs) offers an effective solution. However, a major challenge of GZSL is that the model tends to overfit on seen class data, causing unseen classes to be frequently misclassified as seen classes. To meet this challenge, we propose a novel <strong>S</strong>tructural <strong>C</strong>ausal <strong>M</strong>odel-based <strong>B</strong>inary <strong>D</strong>omain <strong>C</strong>lassifier (SCM-BDC) for generalized zero-shot pest and disease image classification. Our method introduces a Structural Causal Model (SCM) to extract causal features from visual features to reduce the impact of non-causal features that blur the distinction between seen and unseen classes. Furthermore, we use an Angular Linear Layer (ALL) to project class-level attributes and causal features onto the unit hypersphere and identify a boundary for each seen class. During the testing phase, if the similarity between the sample and all seen attributes is less than the corresponding threshold, it is classified as an unseen class; otherwise, as a seen class. Finally, we use a seen classifier and an unseen classifier to predict the corresponding samples, respectively. Extensive experiments on APTV99, ADTV68, AWA1, AWA2, CUB, SUN, and FLO demonstrate that the proposed method can significantly improve the performance of GZSL. For the APTV99 and ADTV68 datasets, our method achieves a 5.2 % and 1.4 % improvement in GZSL classification accuracy over state-of-the-art methods.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"230 ","pages":"Article 109827"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924012183","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Images of scarce agricultural pests and diseases categories are often hard to obtain, and there are frequently no visual examples in web search engines. Therefore, it is challenging to build an image classification model based on supervised learning. To address this issue, Generalized Zero-shot Learning (GZSL) based on Generative Adversarial Networks (GANs) offers an effective solution. However, a major challenge of GZSL is that the model tends to overfit on seen class data, causing unseen classes to be frequently misclassified as seen classes. To meet this challenge, we propose a novel Structural Causal Model-based Binary Domain Classifier (SCM-BDC) for generalized zero-shot pest and disease image classification. Our method introduces a Structural Causal Model (SCM) to extract causal features from visual features to reduce the impact of non-causal features that blur the distinction between seen and unseen classes. Furthermore, we use an Angular Linear Layer (ALL) to project class-level attributes and causal features onto the unit hypersphere and identify a boundary for each seen class. During the testing phase, if the similarity between the sample and all seen attributes is less than the corresponding threshold, it is classified as an unseen class; otherwise, as a seen class. Finally, we use a seen classifier and an unseen classifier to predict the corresponding samples, respectively. Extensive experiments on APTV99, ADTV68, AWA1, AWA2, CUB, SUN, and FLO demonstrate that the proposed method can significantly improve the performance of GZSL. For the APTV99 and ADTV68 datasets, our method achieves a 5.2 % and 1.4 % improvement in GZSL classification accuracy over state-of-the-art methods.
基于因果门控模型的广义零射击病虫害图像分类
稀缺的农业病虫害类别的图像通常很难获得,而且在网络搜索引擎中往往没有视觉示例。因此,建立基于监督学习的图像分类模型是一个挑战。为了解决这一问题,基于生成对抗网络(GANs)的广义零射击学习(GZSL)提供了一个有效的解决方案。然而,GZSL的一个主要挑战是模型倾向于过度拟合可见类数据,导致不可见类经常被错误地分类为可见类。为了应对这一挑战,我们提出了一种新的基于结构因果模型的二值域分类器(SCM-BDC),用于广义零射击病虫害图像分类。我们的方法引入了一个结构因果模型(SCM),从视觉特征中提取因果特征,以减少模糊可见类和未见类之间区别的非因果特征的影响。此外,我们使用角线性层(ALL)将类级属性和因果特征投影到单位超球上,并为每个看到的类确定边界。在测试阶段,如果样本与所有可见属性之间的相似度小于相应的阈值,则将其分类为未见类;否则,作为一个已见类。最后,我们分别使用一个可见分类器和一个不可见分类器来预测相应的样本。在APTV99、ADTV68、AWA1、AWA2、CUB、SUN和FLO上的大量实验表明,该方法可以显著提高GZSL的性能。对于APTV99和ADTV68数据集,我们的方法在GZSL分类精度上比最先进的方法提高了5.2%和1.4%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信