Alessandro Rocco Denarda, Francesco Crocetti, Gabriele Costante, Paolo Valigi, Mario Luca Fravolini
{"title":"MangoDetNet: 新型标签效率高的弱监督水果检测框架","authors":"Alessandro Rocco Denarda, Francesco Crocetti, Gabriele Costante, Paolo Valigi, Mario Luca Fravolini","doi":"10.1007/s11119-024-10187-0","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Purpose</h3><p>Fruit detection and counting represent one of the most important steps toward yield estimation and a well-known practice for farmers, on which they base the management of the harvesting, storage, and distribution phases of agricultural products. In the era of precision agriculture, yield estimation, which was previously performed only by human operators, is currently being re-designed through the employment of Artificial Intelligence and Computer Vision techniques. Despite the impressive results that AI has demonstrated in fruit detection systems, they rely on large image datasets, whose availability is still limited if compared to the great number of crop typologies. For this reason, great interest has recently been devoted to weakly supervised algorithms, which can reduce the dataset annotation effort required by using simple image-level labels.</p><h3 data-test=\"abstract-sub-heading\">Method</h3><p>Based on these considerations, this work proposes a new method relying on a sample-efficient weakly supervised approach. The proposed system, named MangoDetNet, is trained through a two-stage curriculum learning approach, first involving an image reconstruction task, and secondly an image binary classification task for heatmap generation. In particular, during the first stage, the network is trained in an unsupervised manner for the image reconstruction task, in order to promote the learning of robust feature extractors that are customized for the fruit scenarios. The second stage of training, instead, is performed to achieve image binary classification, employing presence/absence binary labels. This phase further refines the feature extractor from the previous stage and favors the computation of more refined and precise activation maps.</p><h3 data-test=\"abstract-sub-heading\">Conclusion</h3><p>As demonstrated through the experimental campaign, performed on a mango orchard image dataset, MangoDetNet is able to outperform the state-of-the-art weakly supervised approaches, providing an F1 score equal to 0.861, which is on par with those of fully supervised methods, and an F1 score equal to 0.856 when halving the number of labeled samples needed for training.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":null,"pages":null},"PeriodicalIF":5.4000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MangoDetNet: a novel label-efficient weakly supervised fruit detection framework\",\"authors\":\"Alessandro Rocco Denarda, Francesco Crocetti, Gabriele Costante, Paolo Valigi, Mario Luca Fravolini\",\"doi\":\"10.1007/s11119-024-10187-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Purpose</h3><p>Fruit detection and counting represent one of the most important steps toward yield estimation and a well-known practice for farmers, on which they base the management of the harvesting, storage, and distribution phases of agricultural products. In the era of precision agriculture, yield estimation, which was previously performed only by human operators, is currently being re-designed through the employment of Artificial Intelligence and Computer Vision techniques. Despite the impressive results that AI has demonstrated in fruit detection systems, they rely on large image datasets, whose availability is still limited if compared to the great number of crop typologies. For this reason, great interest has recently been devoted to weakly supervised algorithms, which can reduce the dataset annotation effort required by using simple image-level labels.</p><h3 data-test=\\\"abstract-sub-heading\\\">Method</h3><p>Based on these considerations, this work proposes a new method relying on a sample-efficient weakly supervised approach. The proposed system, named MangoDetNet, is trained through a two-stage curriculum learning approach, first involving an image reconstruction task, and secondly an image binary classification task for heatmap generation. In particular, during the first stage, the network is trained in an unsupervised manner for the image reconstruction task, in order to promote the learning of robust feature extractors that are customized for the fruit scenarios. The second stage of training, instead, is performed to achieve image binary classification, employing presence/absence binary labels. This phase further refines the feature extractor from the previous stage and favors the computation of more refined and precise activation maps.</p><h3 data-test=\\\"abstract-sub-heading\\\">Conclusion</h3><p>As demonstrated through the experimental campaign, performed on a mango orchard image dataset, MangoDetNet is able to outperform the state-of-the-art weakly supervised approaches, providing an F1 score equal to 0.861, which is on par with those of fully supervised methods, and an F1 score equal to 0.856 when halving the number of labeled samples needed for training.</p>\",\"PeriodicalId\":20423,\"journal\":{\"name\":\"Precision Agriculture\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2024-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Precision Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1007/s11119-024-10187-0\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Precision Agriculture","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11119-024-10187-0","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
目的水果检测和计数是产量估算最重要的步骤之一,也是农民众所周知的做法,他们据此对农产品的收获、储存和销售阶段进行管理。在精准农业时代,以前只能由人类操作员完成的产量估算工作,目前正在通过人工智能和计算机视觉技术进行重新设计。尽管人工智能在水果检测系统中取得了令人印象深刻的成果,但它们依赖于大型图像数据集,而与大量作物类型相比,这些数据集的可用性仍然有限。基于这个原因,最近人们对弱监督算法产生了浓厚的兴趣,因为这种算法可以通过使用简单的图像级标签来减少所需的数据集注释工作。所提出的系统名为 MangoDetNet,通过两阶段课程学习方法进行训练,第一阶段涉及图像重建任务,第二阶段涉及生成热图的图像二元分类任务。其中,在第一阶段,网络以无监督的方式进行图像重建任务的训练,以促进针对水果场景定制的鲁棒特征提取器的学习。第二阶段的训练则是采用存在/不存在二进制标签,实现图像二进制分类。结论 正如在芒果园图像数据集上进行的实验活动所证明的那样,MangoDetNet 的表现优于最先进的弱监督方法,其 F1 分数为 0.861,与完全监督方法相当,而将训练所需的标记样本数量减半后,其 F1 分数为 0.856。
MangoDetNet: a novel label-efficient weakly supervised fruit detection framework
Purpose
Fruit detection and counting represent one of the most important steps toward yield estimation and a well-known practice for farmers, on which they base the management of the harvesting, storage, and distribution phases of agricultural products. In the era of precision agriculture, yield estimation, which was previously performed only by human operators, is currently being re-designed through the employment of Artificial Intelligence and Computer Vision techniques. Despite the impressive results that AI has demonstrated in fruit detection systems, they rely on large image datasets, whose availability is still limited if compared to the great number of crop typologies. For this reason, great interest has recently been devoted to weakly supervised algorithms, which can reduce the dataset annotation effort required by using simple image-level labels.
Method
Based on these considerations, this work proposes a new method relying on a sample-efficient weakly supervised approach. The proposed system, named MangoDetNet, is trained through a two-stage curriculum learning approach, first involving an image reconstruction task, and secondly an image binary classification task for heatmap generation. In particular, during the first stage, the network is trained in an unsupervised manner for the image reconstruction task, in order to promote the learning of robust feature extractors that are customized for the fruit scenarios. The second stage of training, instead, is performed to achieve image binary classification, employing presence/absence binary labels. This phase further refines the feature extractor from the previous stage and favors the computation of more refined and precise activation maps.
Conclusion
As demonstrated through the experimental campaign, performed on a mango orchard image dataset, MangoDetNet is able to outperform the state-of-the-art weakly supervised approaches, providing an F1 score equal to 0.861, which is on par with those of fully supervised methods, and an F1 score equal to 0.856 when halving the number of labeled samples needed for training.
期刊介绍:
Precision Agriculture promotes the most innovative results coming from the research in the field of precision agriculture. It provides an effective forum for disseminating original and fundamental research and experience in the rapidly advancing area of precision farming.
There are many topics in the field of precision agriculture; therefore, the topics that are addressed include, but are not limited to:
Natural Resources Variability: Soil and landscape variability, digital elevation models, soil mapping, geostatistics, geographic information systems, microclimate, weather forecasting, remote sensing, management units, scale, etc.
Managing Variability: Sampling techniques, site-specific nutrient and crop protection chemical recommendation, crop quality, tillage, seed density, seed variety, yield mapping, remote sensing, record keeping systems, data interpretation and use, crops (corn, wheat, sugar beets, potatoes, peanut, cotton, vegetables, etc.), management scale, etc.
Engineering Technology: Computers, positioning systems, DGPS, machinery, tillage, planting, nutrient and crop protection implements, manure, irrigation, fertigation, yield monitor and mapping, soil physical and chemical characteristic sensors, weed/pest mapping, etc.
Profitability: MEY, net returns, BMPs, optimum recommendations, crop quality, technology cost, sustainability, social impacts, marketing, cooperatives, farm scale, crop type, etc.
Environment: Nutrient, crop protection chemicals, sediments, leaching, runoff, practices, field, watershed, on/off farm, artificial drainage, ground water, surface water, etc.
Technology Transfer: Skill needs, education, training, outreach, methods, surveys, agri-business, producers, distance education, Internet, simulations models, decision support systems, expert systems, on-farm experimentation, partnerships, quality of rural life, etc.