A Semi-self-supervised Learning Approach for Wheat Head Detection using Extremely Small Number of Labeled Samples

Keyhan Najafian, Ali Ghanbari, I. Stavness, Lingling Jin, G. Shirdel, Farhad Maleki
{"title":"A Semi-self-supervised Learning Approach for Wheat Head Detection using Extremely Small Number of Labeled Samples","authors":"Keyhan Najafian, Ali Ghanbari, I. Stavness, Lingling Jin, G. Shirdel, Farhad Maleki","doi":"10.1109/ICCVW54120.2021.00155","DOIUrl":null,"url":null,"abstract":"Most of the success of deep learning is owed to supervised learning, where a large-scale annotated dataset is used for model training. However, developing such datasets is challenging. In this paper, we develop a semi-self-supervised learning approach for wheat head detection. The proposed method utilized a few short video clips and only one annotated image from each video clip of wheat fields to simulate a large computationally annotated dataset used for model building. Considering the domain gap be-tween the simulated and real images, we applied two do-main adaptation steps to alleviate the challenge of distributional shift. The resulting model achieved high performance when applied to real unannotated datasets. When fine-tuned on the dataset from the Global Wheat Head Detection Challenge, the performance was further improved. The model achieved a mean average precision of 0.827, where an over-lap of 50% or more between a predicted bounding box and ground truth was considered as a correct prediction. Al-though the utility of the proposed methodology was shown by applying it to wheat head detection, the proposed method is not limited to this application and could be used for other domains, such as detecting different crop types, alleviating the barrier of lack of large-scale annotated datasets in those domains.","PeriodicalId":226794,"journal":{"name":"2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCVW54120.2021.00155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Most of the success of deep learning is owed to supervised learning, where a large-scale annotated dataset is used for model training. However, developing such datasets is challenging. In this paper, we develop a semi-self-supervised learning approach for wheat head detection. The proposed method utilized a few short video clips and only one annotated image from each video clip of wheat fields to simulate a large computationally annotated dataset used for model building. Considering the domain gap be-tween the simulated and real images, we applied two do-main adaptation steps to alleviate the challenge of distributional shift. The resulting model achieved high performance when applied to real unannotated datasets. When fine-tuned on the dataset from the Global Wheat Head Detection Challenge, the performance was further improved. The model achieved a mean average precision of 0.827, where an over-lap of 50% or more between a predicted bounding box and ground truth was considered as a correct prediction. Al-though the utility of the proposed methodology was shown by applying it to wheat head detection, the proposed method is not limited to this application and could be used for other domains, such as detecting different crop types, alleviating the barrier of lack of large-scale annotated datasets in those domains.
基于极少量标记样本的小麦抽穗检测半自监督学习方法
深度学习的大部分成功归功于监督学习,其中使用大规模带注释的数据集进行模型训练。然而,开发这样的数据集是具有挑战性的。本文提出了一种小麦抽头检测的半自监督学习方法。该方法利用几个短视频片段和每个麦田视频片段仅一个注释图像来模拟用于模型构建的大型计算注释数据集。考虑到模拟图像和真实图像之间的域差距,我们采用了两个主要的自适应步骤来缓解分布转移的挑战。当应用于真实的无注释数据集时,所得到的模型取得了很高的性能。当对来自全球麦穗检测挑战的数据集进行微调时,性能进一步提高。该模型的平均精度为0.827,其中预测的边界框与地面真实值之间50%或更多的重叠被认为是正确的预测。尽管所提出的方法的实用性通过将其应用于麦穗检测来证明,但所提出的方法不仅限于此应用,而且可以用于其他领域,例如检测不同的作物类型,从而缓解了这些领域缺乏大规模注释数据集的障碍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信