RT-Net:基于高级深度学习框架的植物表型语义分割网络

Chengui Fu, Wenbiao Xie, Yin Jin, Kai Zhao, Qiuming Liu, He Xiao
{"title":"RT-Net:基于高级深度学习框架的植物表型语义分割网络","authors":"Chengui Fu, Wenbiao Xie, Yin Jin, Kai Zhao, Qiuming Liu, He Xiao","doi":"10.1145/3581807.3581831","DOIUrl":null,"url":null,"abstract":"Quantitatively deriving plant phenotypes from plant images in a non-contact manner is a very challenging task that relies heavily on the accurate segmentation of plant images. Previous methods mainly used the U-Net network structure and attention mechanism to obtain the corresponding plant phenotype segmentation results. However, the U-Net structure and attention mechanism are relatively outdated, and its method can only achieve a Dice score of 98.47% on the open source dataset, which is still insufficient for the recent plant phenotype segmentation task and needs to be further improved for more detailed research. Therefore, in view of the low segmentation performance of existing plant phenotype semantic segmentation models, this paper proposes a semantic segmentation network RT-Net based on an advanced deep learning framework. Specifically, the network mainly adopts the encoder-decoder network structure of deeplabv3+, and the encoding part of the network adopts the more efficient RepVGG as the backbone network for local feature extraction. At the same time, compared with the traditional Atrous Spatial Pyramid Pooling (ASPP), this paper designs the (Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT module to extract more global feature information through a global adaptive method to obtain denser plant phenotypes. The decoding part performs feature fusion on the output of the encoding part, and then uses upsampling to restore the scale, and finally obtains the semantic segmentation result. The experimental results show that the proposed network has achieved a Dice score of 99.33% on the Arabidopsis plant dataset released by the CVPPP14 competition, and has better segmentation ability compared with other advanced plant field segmentation algorithms","PeriodicalId":292813,"journal":{"name":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RT-Net: Plant Phenotype Semantic Segmentation Network Based on Advanced Deep Learning Framework\",\"authors\":\"Chengui Fu, Wenbiao Xie, Yin Jin, Kai Zhao, Qiuming Liu, He Xiao\",\"doi\":\"10.1145/3581807.3581831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Quantitatively deriving plant phenotypes from plant images in a non-contact manner is a very challenging task that relies heavily on the accurate segmentation of plant images. Previous methods mainly used the U-Net network structure and attention mechanism to obtain the corresponding plant phenotype segmentation results. However, the U-Net structure and attention mechanism are relatively outdated, and its method can only achieve a Dice score of 98.47% on the open source dataset, which is still insufficient for the recent plant phenotype segmentation task and needs to be further improved for more detailed research. Therefore, in view of the low segmentation performance of existing plant phenotype semantic segmentation models, this paper proposes a semantic segmentation network RT-Net based on an advanced deep learning framework. Specifically, the network mainly adopts the encoder-decoder network structure of deeplabv3+, and the encoding part of the network adopts the more efficient RepVGG as the backbone network for local feature extraction. At the same time, compared with the traditional Atrous Spatial Pyramid Pooling (ASPP), this paper designs the (Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT module to extract more global feature information through a global adaptive method to obtain denser plant phenotypes. The decoding part performs feature fusion on the output of the encoding part, and then uses upsampling to restore the scale, and finally obtains the semantic segmentation result. The experimental results show that the proposed network has achieved a Dice score of 99.33% on the Arabidopsis plant dataset released by the CVPPP14 competition, and has better segmentation ability compared with other advanced plant field segmentation algorithms\",\"PeriodicalId\":292813,\"journal\":{\"name\":\"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3581807.3581831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3581807.3581831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

以非接触的方式从植物图像中定量获取植物表型是一项非常具有挑战性的任务,这在很大程度上依赖于植物图像的准确分割。以往的方法主要是利用U-Net网络结构和注意机制来获得相应的植物表型分割结果。但是,U-Net结构和注意机制相对落后,其方法在开源数据集上只能达到98.47%的Dice分数,对于最近的植物表型分割任务来说仍然不够,需要进一步改进以进行更详细的研究。因此,针对现有植物表型语义切分模型切分性能较低的问题,本文提出了一种基于先进深度学习框架的语义切分网络RT-Net。具体来说,该网络主要采用了deeplabv3+的编解码器网络结构,网络的编码部分采用了效率更高的RepVGG作为骨干网络进行局部特征提取。同时,与传统的亚特劳斯空间金字塔池(ASPP)相比,本文设计了基于亚特劳斯空间金字塔池的变压器(Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT模块,通过全局自适应方法提取更多的全局特征信息,获得更密集的植物表型。解码部分对编码部分的输出进行特征融合,然后上采样恢复尺度,最后得到语义分割结果。实验结果表明,该网络在CVPPP14竞赛发布的拟南芥植物数据集上的Dice得分达到99.33%,与其他先进的植物场分割算法相比,具有更好的分割能力
本文章由计算机程序翻译,如有差异,请以英文原文为准。
RT-Net: Plant Phenotype Semantic Segmentation Network Based on Advanced Deep Learning Framework
Quantitatively deriving plant phenotypes from plant images in a non-contact manner is a very challenging task that relies heavily on the accurate segmentation of plant images. Previous methods mainly used the U-Net network structure and attention mechanism to obtain the corresponding plant phenotype segmentation results. However, the U-Net structure and attention mechanism are relatively outdated, and its method can only achieve a Dice score of 98.47% on the open source dataset, which is still insufficient for the recent plant phenotype segmentation task and needs to be further improved for more detailed research. Therefore, in view of the low segmentation performance of existing plant phenotype semantic segmentation models, this paper proposes a semantic segmentation network RT-Net based on an advanced deep learning framework. Specifically, the network mainly adopts the encoder-decoder network structure of deeplabv3+, and the encoding part of the network adopts the more efficient RepVGG as the backbone network for local feature extraction. At the same time, compared with the traditional Atrous Spatial Pyramid Pooling (ASPP), this paper designs the (Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT module to extract more global feature information through a global adaptive method to obtain denser plant phenotypes. The decoding part performs feature fusion on the output of the encoding part, and then uses upsampling to restore the scale, and finally obtains the semantic segmentation result. The experimental results show that the proposed network has achieved a Dice score of 99.33% on the Arabidopsis plant dataset released by the CVPPP14 competition, and has better segmentation ability compared with other advanced plant field segmentation algorithms
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信