{"title":"Automatic Hyperparameter Tuning in Deep Convolutional Neural Networks Using Asynchronous Reinforcement Learning","authors":"P. Neary","doi":"10.1109/ICCC.2018.00017","DOIUrl":null,"url":null,"abstract":"Major gains have been made in recent years in object recognition due to advances in deep neural networks. One struggle with deep learning, however, revolves around the fact that currently it is unknown what network architecture is best for a given problem. Consequently, different configurations are tried until one is identified that gives acceptable results. This paper proposes an asynchronous reinforcement learning algorithm that finds an optimal network configuration by automatically adjusting parameters for a given problem. It is shown that asynchronous reinforcement learning is able to converge on an optimal solution for the MNIST data set.","PeriodicalId":306012,"journal":{"name":"2018 IEEE International Conference on Cognitive Computing (ICCC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Cognitive Computing (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC.2018.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37
Abstract
Major gains have been made in recent years in object recognition due to advances in deep neural networks. One struggle with deep learning, however, revolves around the fact that currently it is unknown what network architecture is best for a given problem. Consequently, different configurations are tried until one is identified that gives acceptable results. This paper proposes an asynchronous reinforcement learning algorithm that finds an optimal network configuration by automatically adjusting parameters for a given problem. It is shown that asynchronous reinforcement learning is able to converge on an optimal solution for the MNIST data set.