Paddy Fields Segmentation using Fully Convolutional Network with Pyramid Pooling Module

Siti Raihanah Abdani, M. A. Zulkifley, Muhammad Nazir Siham, Nurshafiza Zanal Abiddin, N. A. Aziz
{"title":"Paddy Fields Segmentation using Fully Convolutional Network with Pyramid Pooling Module","authors":"Siti Raihanah Abdani, M. A. Zulkifley, Muhammad Nazir Siham, Nurshafiza Zanal Abiddin, N. A. Aziz","doi":"10.1109/ISTT50966.2020.9279341","DOIUrl":null,"url":null,"abstract":"One of the initiatives by the Malaysian government to reduce foreign dependency on staple food stock is by giving subsidies to the rice farmers. The amount received by them directly correlates with the cultivated paddy areas that include subsistence in fertilizers, seeds, and machinery. Hence, it is important for the Malaysian government to identify the exact areas that have been cultivated so that the subsidies will reach the targeted groups correctly. Currently, the surveying process is done manually by filed observer, which is a costly and tedious process. Hence, a remote sensing approach is proposed for an automated surveying system that semantically segments the satellite images of the paddy fields according to the intended class. A deep learning approach is adopted where a fully convolutional network with spatial pyramid pooling (SPP) module is designed to segment the images into four types of class, which are cultivated areas, uncultivated areas, backgrounds, and others. The encoder backbone of the network is based on VGG16, where the SPP module is comprised of four parallel branches of multiscale feature maps. The up-sample process is done through two layers of transposed convolution, where the output will be resized back according to the input image. The results show that the proposed network with SPP kernel set of 4x4, 5x5, 6x6, and 7x7 returns the best performance with a mean accuracy of 0.9869 and Jaccard index of 0.8326. The model faced its biggest training challenge when the clouds obstructed the surface information, which makes the areas uninformative. In the future, the network can be further improved by adding feed-forward layers and residual skip connections that help in reducing the zero gradient diminishing problem.","PeriodicalId":345344,"journal":{"name":"2020 IEEE 5th International Symposium on Telecommunication Technologies (ISTT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 5th International Symposium on Telecommunication Technologies (ISTT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTT50966.2020.9279341","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

One of the initiatives by the Malaysian government to reduce foreign dependency on staple food stock is by giving subsidies to the rice farmers. The amount received by them directly correlates with the cultivated paddy areas that include subsistence in fertilizers, seeds, and machinery. Hence, it is important for the Malaysian government to identify the exact areas that have been cultivated so that the subsidies will reach the targeted groups correctly. Currently, the surveying process is done manually by filed observer, which is a costly and tedious process. Hence, a remote sensing approach is proposed for an automated surveying system that semantically segments the satellite images of the paddy fields according to the intended class. A deep learning approach is adopted where a fully convolutional network with spatial pyramid pooling (SPP) module is designed to segment the images into four types of class, which are cultivated areas, uncultivated areas, backgrounds, and others. The encoder backbone of the network is based on VGG16, where the SPP module is comprised of four parallel branches of multiscale feature maps. The up-sample process is done through two layers of transposed convolution, where the output will be resized back according to the input image. The results show that the proposed network with SPP kernel set of 4x4, 5x5, 6x6, and 7x7 returns the best performance with a mean accuracy of 0.9869 and Jaccard index of 0.8326. The model faced its biggest training challenge when the clouds obstructed the surface information, which makes the areas uninformative. In the future, the network can be further improved by adding feed-forward layers and residual skip connections that help in reducing the zero gradient diminishing problem.
基于金字塔池化模块的全卷积网络稻田分割
马来西亚政府为减少外国对主食库存的依赖而采取的举措之一是向稻农提供补贴。他们的收入与稻田种植面积直接相关,包括维持生计的肥料、种子和机械。因此,马来西亚政府必须确定种植的确切区域,以便补贴能够正确地到达目标群体。目前,测量过程是由现场观测员手工完成的,这是一个昂贵且繁琐的过程。因此,提出了一种遥感方法,用于根据预期类别对水田卫星图像进行语义分割的自动测量系统。采用深度学习方法,设计具有空间金字塔池(SPP)模块的全卷积网络,将图像划分为四种类型,分别是耕地、荒地、背景和其他。网络的编码器骨干基于VGG16,其中SPP模块由多尺度特征图的四个并行分支组成。上采样过程是通过两层转置卷积完成的,其中输出将根据输入图像调整大小。结果表明,SPP核集为4x4、5x5、6x6和7x7的网络返回的性能最好,平均准确率为0.9869,Jaccard指数为0.8326。当云层遮挡了表面信息时,模型面临着最大的训练挑战,这使得区域不具有信息。在未来,网络可以通过增加前馈层和残余的跳跃连接来进一步改进,这有助于减少零梯度递减问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信