Chengui Fu, Wenbiao Xie, Yin Jin, Kai Zhao, Qiuming Liu, He Xiao
{"title":"RT-Net:基于高级深度学习框架的植物表型语义分割网络","authors":"Chengui Fu, Wenbiao Xie, Yin Jin, Kai Zhao, Qiuming Liu, He Xiao","doi":"10.1145/3581807.3581831","DOIUrl":null,"url":null,"abstract":"Quantitatively deriving plant phenotypes from plant images in a non-contact manner is a very challenging task that relies heavily on the accurate segmentation of plant images. Previous methods mainly used the U-Net network structure and attention mechanism to obtain the corresponding plant phenotype segmentation results. However, the U-Net structure and attention mechanism are relatively outdated, and its method can only achieve a Dice score of 98.47% on the open source dataset, which is still insufficient for the recent plant phenotype segmentation task and needs to be further improved for more detailed research. Therefore, in view of the low segmentation performance of existing plant phenotype semantic segmentation models, this paper proposes a semantic segmentation network RT-Net based on an advanced deep learning framework. Specifically, the network mainly adopts the encoder-decoder network structure of deeplabv3+, and the encoding part of the network adopts the more efficient RepVGG as the backbone network for local feature extraction. At the same time, compared with the traditional Atrous Spatial Pyramid Pooling (ASPP), this paper designs the (Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT module to extract more global feature information through a global adaptive method to obtain denser plant phenotypes. The decoding part performs feature fusion on the output of the encoding part, and then uses upsampling to restore the scale, and finally obtains the semantic segmentation result. The experimental results show that the proposed network has achieved a Dice score of 99.33% on the Arabidopsis plant dataset released by the CVPPP14 competition, and has better segmentation ability compared with other advanced plant field segmentation algorithms","PeriodicalId":292813,"journal":{"name":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RT-Net: Plant Phenotype Semantic Segmentation Network Based on Advanced Deep Learning Framework\",\"authors\":\"Chengui Fu, Wenbiao Xie, Yin Jin, Kai Zhao, Qiuming Liu, He Xiao\",\"doi\":\"10.1145/3581807.3581831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Quantitatively deriving plant phenotypes from plant images in a non-contact manner is a very challenging task that relies heavily on the accurate segmentation of plant images. Previous methods mainly used the U-Net network structure and attention mechanism to obtain the corresponding plant phenotype segmentation results. However, the U-Net structure and attention mechanism are relatively outdated, and its method can only achieve a Dice score of 98.47% on the open source dataset, which is still insufficient for the recent plant phenotype segmentation task and needs to be further improved for more detailed research. Therefore, in view of the low segmentation performance of existing plant phenotype semantic segmentation models, this paper proposes a semantic segmentation network RT-Net based on an advanced deep learning framework. Specifically, the network mainly adopts the encoder-decoder network structure of deeplabv3+, and the encoding part of the network adopts the more efficient RepVGG as the backbone network for local feature extraction. At the same time, compared with the traditional Atrous Spatial Pyramid Pooling (ASPP), this paper designs the (Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT module to extract more global feature information through a global adaptive method to obtain denser plant phenotypes. The decoding part performs feature fusion on the output of the encoding part, and then uses upsampling to restore the scale, and finally obtains the semantic segmentation result. The experimental results show that the proposed network has achieved a Dice score of 99.33% on the Arabidopsis plant dataset released by the CVPPP14 competition, and has better segmentation ability compared with other advanced plant field segmentation algorithms\",\"PeriodicalId\":292813,\"journal\":{\"name\":\"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3581807.3581831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3581807.3581831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
以非接触的方式从植物图像中定量获取植物表型是一项非常具有挑战性的任务,这在很大程度上依赖于植物图像的准确分割。以往的方法主要是利用U-Net网络结构和注意机制来获得相应的植物表型分割结果。但是,U-Net结构和注意机制相对落后,其方法在开源数据集上只能达到98.47%的Dice分数,对于最近的植物表型分割任务来说仍然不够,需要进一步改进以进行更详细的研究。因此,针对现有植物表型语义切分模型切分性能较低的问题,本文提出了一种基于先进深度学习框架的语义切分网络RT-Net。具体来说,该网络主要采用了deeplabv3+的编解码器网络结构,网络的编码部分采用了效率更高的RepVGG作为骨干网络进行局部特征提取。同时,与传统的亚特劳斯空间金字塔池(ASPP)相比,本文设计了基于亚特劳斯空间金字塔池的变压器(Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT模块,通过全局自适应方法提取更多的全局特征信息,获得更密集的植物表型。解码部分对编码部分的输出进行特征融合,然后上采样恢复尺度,最后得到语义分割结果。实验结果表明,该网络在CVPPP14竞赛发布的拟南芥植物数据集上的Dice得分达到99.33%,与其他先进的植物场分割算法相比,具有更好的分割能力
RT-Net: Plant Phenotype Semantic Segmentation Network Based on Advanced Deep Learning Framework
Quantitatively deriving plant phenotypes from plant images in a non-contact manner is a very challenging task that relies heavily on the accurate segmentation of plant images. Previous methods mainly used the U-Net network structure and attention mechanism to obtain the corresponding plant phenotype segmentation results. However, the U-Net structure and attention mechanism are relatively outdated, and its method can only achieve a Dice score of 98.47% on the open source dataset, which is still insufficient for the recent plant phenotype segmentation task and needs to be further improved for more detailed research. Therefore, in view of the low segmentation performance of existing plant phenotype semantic segmentation models, this paper proposes a semantic segmentation network RT-Net based on an advanced deep learning framework. Specifically, the network mainly adopts the encoder-decoder network structure of deeplabv3+, and the encoding part of the network adopts the more efficient RepVGG as the backbone network for local feature extraction. At the same time, compared with the traditional Atrous Spatial Pyramid Pooling (ASPP), this paper designs the (Atrous Spatial Pyramid Pooling Based Transformer)ASPPBT module to extract more global feature information through a global adaptive method to obtain denser plant phenotypes. The decoding part performs feature fusion on the output of the encoding part, and then uses upsampling to restore the scale, and finally obtains the semantic segmentation result. The experimental results show that the proposed network has achieved a Dice score of 99.33% on the Arabidopsis plant dataset released by the CVPPP14 competition, and has better segmentation ability compared with other advanced plant field segmentation algorithms