Retinanet_G2S: a multi-scale feature fusion-based network for fruit detection of punna navel oranges in complex field environments

IF 5.4 2区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Hongxing Peng, Hu Chen, Xin Zhang, Huanai Liu, Keyin Chen, Juntao Xiong
{"title":"Retinanet_G2S: a multi-scale feature fusion-based network for fruit detection of punna navel oranges in complex field environments","authors":"Hongxing Peng, Hu Chen, Xin Zhang, Huanai Liu, Keyin Chen, Juntao Xiong","doi":"10.1007/s11119-023-10098-6","DOIUrl":null,"url":null,"abstract":"<p>In the natural environment, the detection and recognition process of Punna navel orange fruit using machine vision systems is affected by many factors, such as complex background, uneven light illumination, occlusions of branches and leaves and large variations in fruit size. To solve these problems of low accuracy in fruit detection and poor robustness of the detection algorithm in the field conditions, a new object detection algorithm, named Retinanet_G2S, was proposed in this paper based on the modified Retinanet network. The images of Punna navel orange were collected with Microsoft Kinect V2 in the uncontrolled environment. Firstly, a new Res2Net-GF network was designed to replace the section of feature extraction in the original Retinanet, which can potentially improve the learning ability of target features of the trunk network. Secondly, a multi-scale cross-regional feature fusion grids network was designed to replace the feature pyramid network module in the original Retinanet, which could enhance the ability of feature information fusion among different scales of the feature pyramid. Finally, the original border regression localization method in Retinanet network was optimized based on the accurate boundary box regression algorithm. The study results showed that, compared with the original Retinanet network, Retinanet_G2S improved mAP, mAP50, mAP75, mAP<sub>S</sub>, mAP<sub>M</sub> and mAP<sub>L</sub> by 3.8%, 1.7%, 5.8%, 2.4%, 2.1% and 5.5%, respectively. Moreover, compared with 7 types of classic object detection models, including SSD, YOLOv3, CenterNet, CornerNet, FCOS, Faster-RCNN and Retinanet, the average increase in mAP of Retinanet_G2S was 9.11%. Overall, Retinanet_G2S showed a promising optimization effect, particularly for the detection of small targets and overlapping fruits.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"19 1","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Precision Agriculture","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11119-023-10098-6","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

In the natural environment, the detection and recognition process of Punna navel orange fruit using machine vision systems is affected by many factors, such as complex background, uneven light illumination, occlusions of branches and leaves and large variations in fruit size. To solve these problems of low accuracy in fruit detection and poor robustness of the detection algorithm in the field conditions, a new object detection algorithm, named Retinanet_G2S, was proposed in this paper based on the modified Retinanet network. The images of Punna navel orange were collected with Microsoft Kinect V2 in the uncontrolled environment. Firstly, a new Res2Net-GF network was designed to replace the section of feature extraction in the original Retinanet, which can potentially improve the learning ability of target features of the trunk network. Secondly, a multi-scale cross-regional feature fusion grids network was designed to replace the feature pyramid network module in the original Retinanet, which could enhance the ability of feature information fusion among different scales of the feature pyramid. Finally, the original border regression localization method in Retinanet network was optimized based on the accurate boundary box regression algorithm. The study results showed that, compared with the original Retinanet network, Retinanet_G2S improved mAP, mAP50, mAP75, mAPS, mAPM and mAPL by 3.8%, 1.7%, 5.8%, 2.4%, 2.1% and 5.5%, respectively. Moreover, compared with 7 types of classic object detection models, including SSD, YOLOv3, CenterNet, CornerNet, FCOS, Faster-RCNN and Retinanet, the average increase in mAP of Retinanet_G2S was 9.11%. Overall, Retinanet_G2S showed a promising optimization effect, particularly for the detection of small targets and overlapping fruits.

Abstract Image

Retinanet_G2S:基于多尺度特征融合的网络,用于在复杂的田间环境中检测番泻叶脐橙果实
在自然环境中,使用机器视觉系统检测和识别蓬莱脐橙果实的过程会受到很多因素的影响,如复杂的背景、不均匀的光照、枝叶的遮挡以及果实大小的巨大变化等。为了解决果实检测精度低、检测算法在现场条件下鲁棒性差等问题,本文基于改进的 Retinanet 网络提出了一种新的物体检测算法,命名为 Retinanet_G2S。本文使用 Microsoft Kinect V2 在非受控环境下采集蓬莱脐橙的图像。首先,设计了一个新的 Res2Net-GF 网络来替代原有 Retinanet 网络中的特征提取部分,从而有可能提高主干网络对目标特征的学习能力。其次,设计了一个多尺度跨区域特征融合网格网络,以替代原有 Retinanet 中的特征金字塔网络模块,从而提高不同尺度特征金字塔之间的特征信息融合能力。最后,基于精确的边界盒回归算法,优化了 Retinanet 网络中原有的边界回归定位方法。研究结果表明,与原始 Retinanet 网络相比,Retinanet_G2S 的 mAP、mAP50、mAP75、mAPS、mAPM 和 mAPL 分别提高了 3.8%、1.7%、5.8%、2.4%、2.1% 和 5.5%。此外,与 SSD、YOLOv3、CenterNet、CornerNet、FCOS、Faster-RCNN 和 Retinanet 等 7 种经典物体检测模型相比,Retinanet_G2S 的 mAP 平均提高了 9.11%。总体而言,Retinanet_G2S 显示出了良好的优化效果,尤其是在检测小目标和重叠果实方面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Precision Agriculture
Precision Agriculture 农林科学-农业综合
CiteScore
12.30
自引率
8.10%
发文量
103
审稿时长
>24 weeks
期刊介绍: Precision Agriculture promotes the most innovative results coming from the research in the field of precision agriculture. It provides an effective forum for disseminating original and fundamental research and experience in the rapidly advancing area of precision farming. There are many topics in the field of precision agriculture; therefore, the topics that are addressed include, but are not limited to: Natural Resources Variability: Soil and landscape variability, digital elevation models, soil mapping, geostatistics, geographic information systems, microclimate, weather forecasting, remote sensing, management units, scale, etc. Managing Variability: Sampling techniques, site-specific nutrient and crop protection chemical recommendation, crop quality, tillage, seed density, seed variety, yield mapping, remote sensing, record keeping systems, data interpretation and use, crops (corn, wheat, sugar beets, potatoes, peanut, cotton, vegetables, etc.), management scale, etc. Engineering Technology: Computers, positioning systems, DGPS, machinery, tillage, planting, nutrient and crop protection implements, manure, irrigation, fertigation, yield monitor and mapping, soil physical and chemical characteristic sensors, weed/pest mapping, etc. Profitability: MEY, net returns, BMPs, optimum recommendations, crop quality, technology cost, sustainability, social impacts, marketing, cooperatives, farm scale, crop type, etc. Environment: Nutrient, crop protection chemicals, sediments, leaching, runoff, practices, field, watershed, on/off farm, artificial drainage, ground water, surface water, etc. Technology Transfer: Skill needs, education, training, outreach, methods, surveys, agri-business, producers, distance education, Internet, simulations models, decision support systems, expert systems, on-farm experimentation, partnerships, quality of rural life, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信