基于半监督语义分割的无人机图像小麦植株计数

Hamza Mukhtar, Muhammad Zeeshan Khan, M. Usman Ghani Khan, T. Saba, R. Latif
{"title":"基于半监督语义分割的无人机图像小麦植株计数","authors":"Hamza Mukhtar, Muhammad Zeeshan Khan, M. Usman Ghani Khan, T. Saba, R. Latif","doi":"10.1109/CAIDA51941.2021.9425252","DOIUrl":null,"url":null,"abstract":"Plant counting in major grain crops like wheat through aerial images still poses a challenge due to the very high infield density of plants and occlusion. Annotation of aerial images for counting through perfect detection or segmentation is extremely difficult due to a large number of extremely small plant instances. In this paper, we present a semi-supervised method based on cross-consistency for the semantic segmentation of field images and an inception-based regression network for plant counting. Through loosely semantic segmentation, tiny plant clusters are extracted from the RGB image and fed to a regression network to get the count. Cross-consistency under the cluster assumption is a powerful semi-supervised training technique to leverage the unlabeled images. In this work, it is observed that regions with lower density are more detectable within hidden representations as compared to inputs. Supervised training of an encoder in a shared fashion and the main decoder is carried out on the RGB images and the corresponding mask. Consistency between the prediction of main and auxiliary decoders is imposed to leverage the unlabeled images. Induction of inception in the regression network benefits in extracting the multi-scale features which are very important because of quite tiny plant instances as compared to the whole image. The proposed plant counting framework achieves very high performance having a standard deviation of 0.94 and a mean of 0.87 of absolute difference in the count given the semi-supervised nature. Our network has performed reasonably well as compared to supervised detection and segmentation-based counting framework. Moreover, labeling for detection or segmentation is a quite tedious task, so our network has the leverage to train the model with few labeled and large numbers of unlabeled images which also provides the advantage to train the system for other crops like rice and maize with few labeled images.","PeriodicalId":272573,"journal":{"name":"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Wheat Plant Counting Using UAV Images Based on Semi-supervised Semantic Segmentation\",\"authors\":\"Hamza Mukhtar, Muhammad Zeeshan Khan, M. Usman Ghani Khan, T. Saba, R. Latif\",\"doi\":\"10.1109/CAIDA51941.2021.9425252\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Plant counting in major grain crops like wheat through aerial images still poses a challenge due to the very high infield density of plants and occlusion. Annotation of aerial images for counting through perfect detection or segmentation is extremely difficult due to a large number of extremely small plant instances. In this paper, we present a semi-supervised method based on cross-consistency for the semantic segmentation of field images and an inception-based regression network for plant counting. Through loosely semantic segmentation, tiny plant clusters are extracted from the RGB image and fed to a regression network to get the count. Cross-consistency under the cluster assumption is a powerful semi-supervised training technique to leverage the unlabeled images. In this work, it is observed that regions with lower density are more detectable within hidden representations as compared to inputs. Supervised training of an encoder in a shared fashion and the main decoder is carried out on the RGB images and the corresponding mask. Consistency between the prediction of main and auxiliary decoders is imposed to leverage the unlabeled images. Induction of inception in the regression network benefits in extracting the multi-scale features which are very important because of quite tiny plant instances as compared to the whole image. The proposed plant counting framework achieves very high performance having a standard deviation of 0.94 and a mean of 0.87 of absolute difference in the count given the semi-supervised nature. Our network has performed reasonably well as compared to supervised detection and segmentation-based counting framework. Moreover, labeling for detection or segmentation is a quite tedious task, so our network has the leverage to train the model with few labeled and large numbers of unlabeled images which also provides the advantage to train the system for other crops like rice and maize with few labeled images.\",\"PeriodicalId\":272573,\"journal\":{\"name\":\"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAIDA51941.2021.9425252\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAIDA51941.2021.9425252","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

由于植物的内场密度和遮挡非常高,通过航空图像对小麦等主要粮食作物进行植物计数仍然存在挑战。由于航拍图像中存在大量极小的植物实例,因此通过完美的检测或分割对航拍图像进行标注进行计数是非常困难的。在本文中,我们提出了一种基于交叉一致性的半监督方法用于田间图像的语义分割,并提出了一种基于初始化的回归网络用于植物计数。通过松散语义分割,从RGB图像中提取微小植物簇,并将其输入回归网络进行计数。聚类假设下的交叉一致性是一种有效利用未标记图像的半监督训练技术。在这项工作中,观察到与输入相比,密度较低的区域在隐藏表示中更容易被检测到。在RGB图像和相应的掩码上以共享方式对编码器和主解码器进行监督训练。主解码器和辅助解码器预测之间的一致性是强加的,以利用未标记的图像。在回归网络中引入初始化有利于提取多尺度特征,这是非常重要的,因为与整个图像相比,植物实例非常小。所提出的植物计数框架实现了非常高的性能,在给定半监督性质的情况下,其绝对计数差的标准偏差为0.94,平均值为0.87。与监督检测和基于分割的计数框架相比,我们的网络表现得相当好。此外,标记用于检测或分割是一项相当繁琐的任务,因此我们的网络具有使用少量标记和大量未标记图像训练模型的优势,这也为使用少量标记图像训练其他作物(如水稻和玉米)的系统提供了优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Wheat Plant Counting Using UAV Images Based on Semi-supervised Semantic Segmentation
Plant counting in major grain crops like wheat through aerial images still poses a challenge due to the very high infield density of plants and occlusion. Annotation of aerial images for counting through perfect detection or segmentation is extremely difficult due to a large number of extremely small plant instances. In this paper, we present a semi-supervised method based on cross-consistency for the semantic segmentation of field images and an inception-based regression network for plant counting. Through loosely semantic segmentation, tiny plant clusters are extracted from the RGB image and fed to a regression network to get the count. Cross-consistency under the cluster assumption is a powerful semi-supervised training technique to leverage the unlabeled images. In this work, it is observed that regions with lower density are more detectable within hidden representations as compared to inputs. Supervised training of an encoder in a shared fashion and the main decoder is carried out on the RGB images and the corresponding mask. Consistency between the prediction of main and auxiliary decoders is imposed to leverage the unlabeled images. Induction of inception in the regression network benefits in extracting the multi-scale features which are very important because of quite tiny plant instances as compared to the whole image. The proposed plant counting framework achieves very high performance having a standard deviation of 0.94 and a mean of 0.87 of absolute difference in the count given the semi-supervised nature. Our network has performed reasonably well as compared to supervised detection and segmentation-based counting framework. Moreover, labeling for detection or segmentation is a quite tedious task, so our network has the leverage to train the model with few labeled and large numbers of unlabeled images which also provides the advantage to train the system for other crops like rice and maize with few labeled images.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信