A Semi-self-supervised Learning Approach for Wheat Head Detection using Extremely Small Number of Labeled Samples

2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) Pub Date : 2021-10-01 DOI:10.1109/ICCVW54120.2021.00155

Keyhan Najafian, Ali Ghanbari, I. Stavness, Lingling Jin, G. Shirdel, Farhad Maleki

{"title":"A Semi-self-supervised Learning Approach for Wheat Head Detection using Extremely Small Number of Labeled Samples","authors":"Keyhan Najafian, Ali Ghanbari, I. Stavness, Lingling Jin, G. Shirdel, Farhad Maleki","doi":"10.1109/ICCVW54120.2021.00155","DOIUrl":null,"url":null,"abstract":"Most of the success of deep learning is owed to supervised learning, where a large-scale annotated dataset is used for model training. However, developing such datasets is challenging. In this paper, we develop a semi-self-supervised learning approach for wheat head detection. The proposed method utilized a few short video clips and only one annotated image from each video clip of wheat fields to simulate a large computationally annotated dataset used for model building. Considering the domain gap be-tween the simulated and real images, we applied two do-main adaptation steps to alleviate the challenge of distributional shift. The resulting model achieved high performance when applied to real unannotated datasets. When fine-tuned on the dataset from the Global Wheat Head Detection Challenge, the performance was further improved. The model achieved a mean average precision of 0.827, where an over-lap of 50% or more between a predicted bounding box and ground truth was considered as a correct prediction. Al-though the utility of the proposed methodology was shown by applying it to wheat head detection, the proposed method is not limited to this application and could be used for other domains, such as detecting different crop types, alleviating the barrier of lack of large-scale annotated datasets in those domains.","PeriodicalId":226794,"journal":{"name":"2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCVW54120.2021.00155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Most of the success of deep learning is owed to supervised learning, where a large-scale annotated dataset is used for model training. However, developing such datasets is challenging. In this paper, we develop a semi-self-supervised learning approach for wheat head detection. The proposed method utilized a few short video clips and only one annotated image from each video clip of wheat fields to simulate a large computationally annotated dataset used for model building. Considering the domain gap be-tween the simulated and real images, we applied two do-main adaptation steps to alleviate the challenge of distributional shift. The resulting model achieved high performance when applied to real unannotated datasets. When fine-tuned on the dataset from the Global Wheat Head Detection Challenge, the performance was further improved. The model achieved a mean average precision of 0.827, where an over-lap of 50% or more between a predicted bounding box and ground truth was considered as a correct prediction. Al-though the utility of the proposed methodology was shown by applying it to wheat head detection, the proposed method is not limited to this application and could be used for other domains, such as detecting different crop types, alleviating the barrier of lack of large-scale annotated datasets in those domains.

查看原文本刊更多论文

基于极少量标记样本的小麦抽穗检测半自监督学习方法

深度学习的大部分成功归功于监督学习，其中使用大规模带注释的数据集进行模型训练。然而，开发这样的数据集是具有挑战性的。本文提出了一种小麦抽头检测的半自监督学习方法。该方法利用几个短视频片段和每个麦田视频片段仅一个注释图像来模拟用于模型构建的大型计算注释数据集。考虑到模拟图像和真实图像之间的域差距，我们采用了两个主要的自适应步骤来缓解分布转移的挑战。当应用于真实的无注释数据集时，所得到的模型取得了很高的性能。当对来自全球麦穗检测挑战的数据集进行微调时，性能进一步提高。该模型的平均精度为0.827，其中预测的边界框与地面真实值之间50%或更多的重叠被认为是正确的预测。尽管所提出的方法的实用性通过将其应用于麦穗检测来证明，但所提出的方法不仅限于此应用，而且可以用于其他领域，例如检测不同的作物类型，从而缓解了这些领域缺乏大规模注释数据集的障碍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

自引率

0.00%

发文量