Spatial Transformer Networks for Curriculum Learning

Fatemeh Azimi, J. Nies, Sebastián M. Palacio, Federico Raue, Jörn Hees, A. Dengel
{"title":"Spatial Transformer Networks for Curriculum Learning","authors":"Fatemeh Azimi, J. Nies, Sebastián M. Palacio, Federico Raue, Jörn Hees, A. Dengel","doi":"10.1109/DICTA56598.2022.10034595","DOIUrl":null,"url":null,"abstract":"Curriculum learning is a bio-inspired training technique that is widely adopted in machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proved capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of 3.8pp in classification accuracy compared to the baseline, indicating that STNs can be considered as a tool for generating the easy-to-hard training schedule required for curriculum learning.","PeriodicalId":159377,"journal":{"name":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA56598.2022.10034595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Curriculum learning is a bio-inspired training technique that is widely adopted in machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proved capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of 3.8pp in classification accuracy compared to the baseline, indicating that STNs can be considered as a tool for generating the easy-to-hard training schedule required for curriculum learning.
课程学习的空间转换网络
课程学习是一种生物启发的训练技术,被广泛应用于机器学习中,用于提高神经网络在收敛速度或获得精度方面的优化和更好的训练。课程学习的主要理念是从简单的任务开始训练,逐渐增加难度。因此,一个自然的问题是如何确定或生成这些更简单的任务。在这项工作中,我们从空间变压器网络(STNs)中获得灵感,以形成一个简单难的课程。由于STNs已被证明能够去除输入图像中的杂波,并在图像分类任务中获得更高的精度,我们假设经过STNs处理的图像可以被视为更容易的任务,并用于课程学习。为此,我们研究了利用STNs生成的数据制定培训课程的多种策略。我们在杂乱的MNIST和Fashion-MNIST数据集上进行了各种实验,前者的分类准确率比基线提高了3.8pp,这表明stn可以被认为是生成课程学习所需的易难训练计划的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信