Maturity recognition and localisation of broccoli under occlusion based on RGB-D instance segmentation network

IF 4.4 1区 农林科学 Q1 AGRICULTURAL ENGINEERING
Shuo Kang , Jiali Fan , Yongkai Ye , Chenglong Li , Dongdong Du , Jun Wang
{"title":"Maturity recognition and localisation of broccoli under occlusion based on RGB-D instance segmentation network","authors":"Shuo Kang ,&nbsp;Jiali Fan ,&nbsp;Yongkai Ye ,&nbsp;Chenglong Li ,&nbsp;Dongdong Du ,&nbsp;Jun Wang","doi":"10.1016/j.biosystemseng.2025.01.007","DOIUrl":null,"url":null,"abstract":"<div><div>Selective harvesting robots for broccoli face significant challenges in field operations, where occlusions by leaves and stems, varying maturity stages and lighting interferences greatly affect performance. Addressing the need for a robust network capable of maturity recognition and localisation under various occlusion conditions for spherical crops, OccluInst—a single-stage instance segmentation network based on RGB-D and CNN-Transformer architecture was proposed. The solution is to make full use of visible information and crop characteristics. This model builds a dual-branch cross-modal calibration framework to generate instance-aware kernels and segmentation mask features. The proposed Attention Weight Interactive Fusion Module (AWIF) enhances the fusion efficiency of multi-scale RGB and depth features in complex scenarios, while the designed Adaptive Fusion Ratio Module (AFR) filters out noisy depth data and extracts valuable information to achieve feature alignment. Additionally, the developed Material Awareness Module (MA) highlights critical areas, improving feature extraction for irregular, multi-scale targets. The improved circular boundary anchor box accurately localises broccoli under various levels of occlusion. Ablation studies confirm the effectiveness of each module. OccluInst can swiftly and accurately identify the maturity categories and coordinates of broccoli under different occlusion levels. It achieves a mAP<sub>50</sub> of 86.2% and mAR of 83.5%, with an average centre point deviation of 3.68 pixels on images with a resolution of 848 × 480, and a detection speed of 51.4 frames per second, providing a robust visual foundation for selective harvesting robots.</div></div>","PeriodicalId":9173,"journal":{"name":"Biosystems Engineering","volume":"250 ","pages":"Pages 270-284"},"PeriodicalIF":4.4000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosystems Engineering","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1537511025000078","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

Selective harvesting robots for broccoli face significant challenges in field operations, where occlusions by leaves and stems, varying maturity stages and lighting interferences greatly affect performance. Addressing the need for a robust network capable of maturity recognition and localisation under various occlusion conditions for spherical crops, OccluInst—a single-stage instance segmentation network based on RGB-D and CNN-Transformer architecture was proposed. The solution is to make full use of visible information and crop characteristics. This model builds a dual-branch cross-modal calibration framework to generate instance-aware kernels and segmentation mask features. The proposed Attention Weight Interactive Fusion Module (AWIF) enhances the fusion efficiency of multi-scale RGB and depth features in complex scenarios, while the designed Adaptive Fusion Ratio Module (AFR) filters out noisy depth data and extracts valuable information to achieve feature alignment. Additionally, the developed Material Awareness Module (MA) highlights critical areas, improving feature extraction for irregular, multi-scale targets. The improved circular boundary anchor box accurately localises broccoli under various levels of occlusion. Ablation studies confirm the effectiveness of each module. OccluInst can swiftly and accurately identify the maturity categories and coordinates of broccoli under different occlusion levels. It achieves a mAP50 of 86.2% and mAR of 83.5%, with an average centre point deviation of 3.68 pixels on images with a resolution of 848 × 480, and a detection speed of 51.4 frames per second, providing a robust visual foundation for selective harvesting robots.

Abstract Image

求助全文
约1分钟内获得全文 求助全文
来源期刊
Biosystems Engineering
Biosystems Engineering 农林科学-农业工程
CiteScore
10.60
自引率
7.80%
发文量
239
审稿时长
53 days
期刊介绍: Biosystems Engineering publishes research in engineering and the physical sciences that represent advances in understanding or modelling of the performance of biological systems for sustainable developments in land use and the environment, agriculture and amenity, bioproduction processes and the food chain. The subject matter of the journal reflects the wide range and interdisciplinary nature of research in engineering for biological systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信