Jigsaw Self-Supervised Visual Representation Learning: An Applied Comparative Analysis Study

2022 2nd International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC) Pub Date : 2022-05-08 DOI:10.1109/MIUCC55081.2022.9781725

Yomna A. Kawashti, D. Khattab, M. Aref

引用次数: 0

Abstract

Self-supervised learning has been gaining momentum in the computer vision community as a hopeful contender to replace supervised learning. It aims to leverage unlabeled data by training a network on a proxy task and using transfer learning for a downstream task. Jigsaw is one of the proxy tasks used for learning better feature representations in self-supervised learning. In this work, we comparably evaluated the transferability of jigsaw using different architectures and a different dataset for jigsaw training. The features extracted from each convolutional block were evaluated using a unified downstream task. The best performance was achieved by the shallower architecture of AlexNet where the 2nd block achieved better transferability with a mean average precision of 36.17. We conclude that this behavior could be attributed to the smaller scale of our used dataset, so features extracted from earlier and shallower blocks had higher transferability to a dataset of a different domain.

查看原文本刊更多论文

拼图自监督视觉表征学习:应用比较分析研究

自监督学习作为替代监督学习的有希望的竞争者，在计算机视觉社区中获得了动力。它旨在通过在代理任务上训练网络并在下游任务中使用迁移学习来利用未标记的数据。拼图是自监督学习中用于学习更好的特征表示的代理任务之一。在这项工作中，我们使用不同的架构和不同的拼图训练数据集来比较评估拼图的可转移性。从每个卷积块中提取的特征使用统一的下游任务进行评估。AlexNet较浅的架构实现了最好的性能，其中第二个区块实现了更好的可转移性，平均精度为36.17。我们得出的结论是，这种行为可能归因于我们使用的数据集规模较小，因此从较早和较浅的块中提取的特征具有更高的可移植性到不同领域的数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 2nd International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC)

自引率

0.00%

发文量