Spatial Transformer Networks for Curriculum Learning

2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2021-08-22 DOI:10.1109/DICTA56598.2022.10034595

Fatemeh Azimi, J. Nies, Sebastián M. Palacio, Federico Raue, Jörn Hees, A. Dengel

{"title":"Spatial Transformer Networks for Curriculum Learning","authors":"Fatemeh Azimi, J. Nies, Sebastián M. Palacio, Federico Raue, Jörn Hees, A. Dengel","doi":"10.1109/DICTA56598.2022.10034595","DOIUrl":null,"url":null,"abstract":"Curriculum learning is a bio-inspired training technique that is widely adopted in machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proved capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of 3.8pp in classification accuracy compared to the baseline, indicating that STNs can be considered as a tool for generating the easy-to-hard training schedule required for curriculum learning.","PeriodicalId":159377,"journal":{"name":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA56598.2022.10034595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Curriculum learning is a bio-inspired training technique that is widely adopted in machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proved capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of 3.8pp in classification accuracy compared to the baseline, indicating that STNs can be considered as a tool for generating the easy-to-hard training schedule required for curriculum learning.

查看原文本刊更多论文

课程学习的空间转换网络

课程学习是一种生物启发的训练技术，被广泛应用于机器学习中，用于提高神经网络在收敛速度或获得精度方面的优化和更好的训练。课程学习的主要理念是从简单的任务开始训练，逐渐增加难度。因此，一个自然的问题是如何确定或生成这些更简单的任务。在这项工作中，我们从空间变压器网络(STNs)中获得灵感，以形成一个简单难的课程。由于STNs已被证明能够去除输入图像中的杂波，并在图像分类任务中获得更高的精度，我们假设经过STNs处理的图像可以被视为更容易的任务，并用于课程学习。为此，我们研究了利用STNs生成的数据制定培训课程的多种策略。我们在杂乱的MNIST和Fashion-MNIST数据集上进行了各种实验，前者的分类准确率比基线提高了3.8pp，这表明stn可以被认为是生成课程学习所需的易难训练计划的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量