MRSU2Net：从个体目标到群体目标的群体生菜语义分割新方法

IF 8.9 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2024-10-20 DOI:10.1016/j.compag.2024.109560

Pan Zhang, Daoliang Li

{"title":"MRSU2Net：从个体目标到群体目标的群体生菜语义分割新方法","authors":"Pan Zhang, Daoliang Li","doi":"10.1016/j.compag.2024.109560","DOIUrl":null,"url":null,"abstract":"<div><div>Semantic segmentation methods have played an important role in a wide range of applications, as they contribute to more accurate phenotypic information extraction in the field of plant phenotype. However, the high annotation cost of semantic segmentation datasets remains a major challenge, and most of them are constructed and validated on training and testing datasets with similar scales. Most studies overlook its effectiveness on multi-scale datasets, especially on low resolution datasets. Although some semantic segmentation methods extract and learn multi-scale features from datasets through methods such as multi-scale feature fusion modules and attention mechanisms, the model’s scale down compatibility, i.e. the segmentation reliability of the model on low resolution datasets, has not yet been verified. To address this challenge, this study proposes for the first time a new approach to plant object oriented semantic segmentation, which involves modeling individual target datasets and validating group target datasets. This modeling approach can significantly reduce the annotation cost of datasets to some extent. On this basis, we propose a multi-scale feature fusion module (MSFAF-M) for multi-level feature relationship exploration and a multi receptive field feature fusion module (MRFFF-S) for single-layer feature relationship exploration. By applying MSFAF-M and MRFFF-S to U2Net, an upgraded semantic segmentation method MRSU2Net is proposed, which can fully extract global and local feature information of target objects at multiple scales, and improve the segmentation reliability of semantic segmentation models based on individual target datasets on multi-scale group target datasets. Due to the fact that the construction approach of the semantic segmentation model proposed in this study is different from traditional semantic segmentation methods, we validated the scale down compatibility of MRSU2Net on the target dataset of lettuce populations collected at the seedling stage. When MRSU2Net is applied to group target images with the same resolution (2992 × 2992), the MIoU is 0.9719 and the inference-time is 0.3550. When MRSU2Net is applied to group target images of the same input size (224 × 224), the MIoU can reach 0.7346 and the inference time is 0.0219. The results demonstrate that the segmentation performance of the MRSU2Net constructed in this study is significantly superior to other classic semantic segmentation methods in low resolution images.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"227 ","pages":"Article 109560"},"PeriodicalIF":8.9000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MRSU2Net: A novel method for semantic segmentation of group lettuce from individual Objectives to group Objectives\",\"authors\":\"Pan Zhang, Daoliang Li\",\"doi\":\"10.1016/j.compag.2024.109560\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Semantic segmentation methods have played an important role in a wide range of applications, as they contribute to more accurate phenotypic information extraction in the field of plant phenotype. However, the high annotation cost of semantic segmentation datasets remains a major challenge, and most of them are constructed and validated on training and testing datasets with similar scales. Most studies overlook its effectiveness on multi-scale datasets, especially on low resolution datasets. Although some semantic segmentation methods extract and learn multi-scale features from datasets through methods such as multi-scale feature fusion modules and attention mechanisms, the model’s scale down compatibility, i.e. the segmentation reliability of the model on low resolution datasets, has not yet been verified. To address this challenge, this study proposes for the first time a new approach to plant object oriented semantic segmentation, which involves modeling individual target datasets and validating group target datasets. This modeling approach can significantly reduce the annotation cost of datasets to some extent. On this basis, we propose a multi-scale feature fusion module (MSFAF-M) for multi-level feature relationship exploration and a multi receptive field feature fusion module (MRFFF-S) for single-layer feature relationship exploration. By applying MSFAF-M and MRFFF-S to U2Net, an upgraded semantic segmentation method MRSU2Net is proposed, which can fully extract global and local feature information of target objects at multiple scales, and improve the segmentation reliability of semantic segmentation models based on individual target datasets on multi-scale group target datasets. Due to the fact that the construction approach of the semantic segmentation model proposed in this study is different from traditional semantic segmentation methods, we validated the scale down compatibility of MRSU2Net on the target dataset of lettuce populations collected at the seedling stage. When MRSU2Net is applied to group target images with the same resolution (2992 × 2992), the MIoU is 0.9719 and the inference-time is 0.3550. When MRSU2Net is applied to group target images of the same input size (224 × 224), the MIoU can reach 0.7346 and the inference time is 0.0219. The results demonstrate that the segmentation performance of the MRSU2Net constructed in this study is significantly superior to other classic semantic segmentation methods in low resolution images.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"227 \",\"pages\":\"Article 109560\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2024-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169924009517\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924009517","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

语义分割方法在植物表型领域有助于更准确地提取表型信息，因此在广泛的应用中发挥了重要作用。然而，语义分割数据集的高注释成本仍是一大挑战，而且大多数语义分割方法都是在相似尺度的训练和测试数据集上构建和验证的。大多数研究忽视了其在多尺度数据集上的有效性，尤其是在低分辨率数据集上。虽然一些语义分割方法通过多尺度特征融合模块和注意力机制等方法从数据集中提取和学习多尺度特征，但模型的缩放兼容性，即模型在低分辨率数据集上的分割可靠性尚未得到验证。为应对这一挑战，本研究首次提出了一种新的面向植物对象的语义分割方法，包括对单个目标数据集建模和对群体目标数据集进行验证。这种建模方法可以在一定程度上大大降低数据集的标注成本。在此基础上，我们提出了用于探索多层次特征关系的多尺度特征融合模块（MSFAF-M）和用于探索单层特征关系的多感受野特征融合模块（MRFFF-S）。通过将 MSFAF-M 和 MRFFF-S 应用于 U2Net，提出了一种升级版的语义分割方法 MRSU2Net，该方法能在多个尺度上充分提取目标对象的全局和局部特征信息，提高了基于单个目标数据集的语义分割模型对多尺度群体目标数据集的分割可靠性。由于本研究提出的语义分割模型的构建方法不同于传统的语义分割方法，我们在苗期采集的莴苣种群目标数据集上验证了 MRSU2Net 的缩放兼容性。当 MRSU2Net 应用于相同分辨率（2992 × 2992）的目标图像组时，MIoU 为 0.9719，推理时间为 0.3550。当 MRSU2Net 应用于相同输入尺寸（224 × 224）的组目标图像时，MIoU 可达到 0.7346，推理时间为 0.0219。结果表明，在低分辨率图像中，本研究构建的 MRSU2Net 的分割性能明显优于其他经典语义分割方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MRSU2Net: A novel method for semantic segmentation of group lettuce from individual Objectives to group Objectives

Semantic segmentation methods have played an important role in a wide range of applications, as they contribute to more accurate phenotypic information extraction in the field of plant phenotype. However, the high annotation cost of semantic segmentation datasets remains a major challenge, and most of them are constructed and validated on training and testing datasets with similar scales. Most studies overlook its effectiveness on multi-scale datasets, especially on low resolution datasets. Although some semantic segmentation methods extract and learn multi-scale features from datasets through methods such as multi-scale feature fusion modules and attention mechanisms, the model’s scale down compatibility, i.e. the segmentation reliability of the model on low resolution datasets, has not yet been verified. To address this challenge, this study proposes for the first time a new approach to plant object oriented semantic segmentation, which involves modeling individual target datasets and validating group target datasets. This modeling approach can significantly reduce the annotation cost of datasets to some extent. On this basis, we propose a multi-scale feature fusion module (MSFAF-M) for multi-level feature relationship exploration and a multi receptive field feature fusion module (MRFFF-S) for single-layer feature relationship exploration. By applying MSFAF-M and MRFFF-S to U2Net, an upgraded semantic segmentation method MRSU2Net is proposed, which can fully extract global and local feature information of target objects at multiple scales, and improve the segmentation reliability of semantic segmentation models based on individual target datasets on multi-scale group target datasets. Due to the fact that the construction approach of the semantic segmentation model proposed in this study is different from traditional semantic segmentation methods, we validated the scale down compatibility of MRSU2Net on the target dataset of lettuce populations collected at the seedling stage. When MRSU2Net is applied to group target images with the same resolution (2992 × 2992), the MIoU is 0.9719 and the inference-time is 0.3550. When MRSU2Net is applied to group target images of the same input size (224 × 224), the MIoU can reach 0.7346 and the inference time is 0.0219. The results demonstrate that the segmentation performance of the MRSU2Net constructed in this study is significantly superior to other classic semantic segmentation methods in low resolution images.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.