用于苹果园水果检测和定位的改进型 DeepLabv3+ 架构的性能分析

IF 6.3 Q1 AGRICULTURAL ENGINEERING
Prabhakar Maheswari , Purushothaman Raja , Manoj Karkee , Mugundhan Raja , Rahmath Ulla Baig , Kiet Tran Trung , Vinh Truong Hoang
{"title":"用于苹果园水果检测和定位的改进型 DeepLabv3+ 架构的性能分析","authors":"Prabhakar Maheswari ,&nbsp;Purushothaman Raja ,&nbsp;Manoj Karkee ,&nbsp;Mugundhan Raja ,&nbsp;Rahmath Ulla Baig ,&nbsp;Kiet Tran Trung ,&nbsp;Vinh Truong Hoang","doi":"10.1016/j.atech.2024.100729","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning plays an important role in automating various operations in fruit crop production including irrigation, nutrition management, yield estimation and harvesting. Yield estimation is essential in fruit crop production as it helps farmers optimize cultivation, harvesting, logistics and marketing operations. Furthermore, fruit detection and localization is a very important step in the development of an automated fruit harvesting system. Hence, an intelligent system was proposed in this study for apple fruit detection and localization using modified DeepLabv3+, semantic segmentation based architecture. The finetuned customizations (such as modifying the activation function, optimization technique and loss function) were performed in the original architecture of DeepLabv3+ and its performance was analyzed. The modified model was trained with the training dataset of 2600 apple tree images. Images were split into 80 % of training and 20 % of validation. The modified architecture was also compared with the other variants of DeepLabv3+ architectures. After training, the model was tested with the unobserved test dataset of 101 images. The test results demonstrated the Mean Accuracy (<span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>) of 98.58 % and the Mean Intersection over Union (<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span>) of 96.66 % without compromising the inference time (i.e., 15 ms). The proposed model revealed the improved results than the original model which attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> of 92.12 % and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 88.94 % for the same dataset with the inference time of 40 ms. To ascertain further, the modified model was compared with other single stage detectors, including Fully Convolutional Network (FCN) and U-Net. FCN attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>and<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 77.5 % and 77.27 %, respectively whereas U-Net resulted a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 83.95 % and 81.09 %, respectively. Results demonstrated that the modified DeepLabv3+ with ResNet18 is capable of detecting the apple fruits by mitigating the effects of class imbalance which is the major drawback in single stage detectors. Further, better detection and localization of apple fruits can lead to the precise picking by the robotic system.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"10 ","pages":"Article 100729"},"PeriodicalIF":6.3000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance analysis of modified DeepLabv3+ architecture for fruit detection and localization in apple orchards\",\"authors\":\"Prabhakar Maheswari ,&nbsp;Purushothaman Raja ,&nbsp;Manoj Karkee ,&nbsp;Mugundhan Raja ,&nbsp;Rahmath Ulla Baig ,&nbsp;Kiet Tran Trung ,&nbsp;Vinh Truong Hoang\",\"doi\":\"10.1016/j.atech.2024.100729\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning plays an important role in automating various operations in fruit crop production including irrigation, nutrition management, yield estimation and harvesting. Yield estimation is essential in fruit crop production as it helps farmers optimize cultivation, harvesting, logistics and marketing operations. Furthermore, fruit detection and localization is a very important step in the development of an automated fruit harvesting system. Hence, an intelligent system was proposed in this study for apple fruit detection and localization using modified DeepLabv3+, semantic segmentation based architecture. The finetuned customizations (such as modifying the activation function, optimization technique and loss function) were performed in the original architecture of DeepLabv3+ and its performance was analyzed. The modified model was trained with the training dataset of 2600 apple tree images. Images were split into 80 % of training and 20 % of validation. The modified architecture was also compared with the other variants of DeepLabv3+ architectures. After training, the model was tested with the unobserved test dataset of 101 images. The test results demonstrated the Mean Accuracy (<span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>) of 98.58 % and the Mean Intersection over Union (<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span>) of 96.66 % without compromising the inference time (i.e., 15 ms). The proposed model revealed the improved results than the original model which attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> of 92.12 % and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 88.94 % for the same dataset with the inference time of 40 ms. To ascertain further, the modified model was compared with other single stage detectors, including Fully Convolutional Network (FCN) and U-Net. FCN attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>and<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 77.5 % and 77.27 %, respectively whereas U-Net resulted a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 83.95 % and 81.09 %, respectively. Results demonstrated that the modified DeepLabv3+ with ResNet18 is capable of detecting the apple fruits by mitigating the effects of class imbalance which is the major drawback in single stage detectors. Further, better detection and localization of apple fruits can lead to the precise picking by the robotic system.</div></div>\",\"PeriodicalId\":74813,\"journal\":{\"name\":\"Smart agricultural technology\",\"volume\":\"10 \",\"pages\":\"Article 100729\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2024-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Smart agricultural technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772375524003332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524003332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

深度学习在水果作物生产的各种自动化操作中发挥着重要作用,包括灌溉、营养管理、产量估计和收获。产量估算在水果作物生产中至关重要,因为它能帮助农民优化种植、收获、物流和营销操作。此外,水果检测和定位是开发水果自动收获系统的重要步骤。因此,本研究提出了一种智能系统,利用改进的 DeepLabv3+、基于语义分割的架构进行苹果果实检测和定位。对 DeepLabv3+ 的原始架构进行了微调定制(如修改激活函数、优化技术和损失函数),并对其性能进行了分析。修改后的模型使用 2600 张苹果树图像的训练数据集进行训练。图像分为 80% 的训练图像和 20% 的验证图像。修改后的架构还与 DeepLabv3+ 架构的其他变体进行了比较。训练结束后,该模型在包含 101 张图像的未观察测试数据集上进行了测试。测试结果表明,在不影响推理时间(即 15 毫秒)的情况下,平均准确率(MAcc)为 98.58%,平均联合交叉率(MIoU)为 96.66%。在推理时间为 40 毫秒的相同数据集上,原模型的 MAcc 为 92.12%,MIoU 为 88.94%。为了进一步确定结果,我们将修改后的模型与其他单级检测器进行了比较,包括全卷积网络(FCN)和 U-Net。FCN 的 MAcc 和 MIoU 分别为 77.5 % 和 77.27 %,而 U-Net 的 MAcc 和 MIoU 分别为 83.95 % 和 81.09 %。结果表明,改进后的 DeepLabv3+ 与 ResNet18 能够减轻单级检测器的主要缺点--类不平衡的影响,从而检测出苹果水果。此外,对苹果果实进行更好的检测和定位可帮助机器人系统进行精确采摘。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Performance analysis of modified DeepLabv3+ architecture for fruit detection and localization in apple orchards
Deep learning plays an important role in automating various operations in fruit crop production including irrigation, nutrition management, yield estimation and harvesting. Yield estimation is essential in fruit crop production as it helps farmers optimize cultivation, harvesting, logistics and marketing operations. Furthermore, fruit detection and localization is a very important step in the development of an automated fruit harvesting system. Hence, an intelligent system was proposed in this study for apple fruit detection and localization using modified DeepLabv3+, semantic segmentation based architecture. The finetuned customizations (such as modifying the activation function, optimization technique and loss function) were performed in the original architecture of DeepLabv3+ and its performance was analyzed. The modified model was trained with the training dataset of 2600 apple tree images. Images were split into 80 % of training and 20 % of validation. The modified architecture was also compared with the other variants of DeepLabv3+ architectures. After training, the model was tested with the unobserved test dataset of 101 images. The test results demonstrated the Mean Accuracy (MAcc) of 98.58 % and the Mean Intersection over Union (MIoU) of 96.66 % without compromising the inference time (i.e., 15 ms). The proposed model revealed the improved results than the original model which attained a MAcc of 92.12 % and MIoU of 88.94 % for the same dataset with the inference time of 40 ms. To ascertain further, the modified model was compared with other single stage detectors, including Fully Convolutional Network (FCN) and U-Net. FCN attained a MAccandMIoU of 77.5 % and 77.27 %, respectively whereas U-Net resulted a MAcc and MIoU of 83.95 % and 81.09 %, respectively. Results demonstrated that the modified DeepLabv3+ with ResNet18 is capable of detecting the apple fruits by mitigating the effects of class imbalance which is the major drawback in single stage detectors. Further, better detection and localization of apple fruits can lead to the precise picking by the robotic system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信