{"title":"基于Flappy bridge玩法的深度强化学习分析","authors":"Zhixuan He","doi":"10.1109/ICICT55905.2022.00025","DOIUrl":null,"url":null,"abstract":"In recent years, machine learning has become a popular area in the IT field. In 2016, a game AI AlphaGo turned out to defeat many Go masters. The technology behind AlphaGo: the popularity of deep reinforcement learning is gradually increasing. This paper aims to use deep reinforcement learning algorithm to train a specific model which can play the Flappy Bird game autonomously. In this paper, two types of optimizers: Adam and RMSProp are used in the deep neural network during the training sessions and the testing results with the models are compared to figure out which model has a better performance when playing Flappy Bird. From the testing results, when the training rounds is insufficient, the model with Adam optimizer performs better than the model with RMSProp optimizer. However, when the training rounds are large enough, the performance of the model with RMSProp optimizer is almost 2 times better than the model with Adam optimizer. After comparison between the two models, this paper finds out that with the increasing of the training rounds, the performance of model with RMSProp optimizer will gradually exceed the model with Adam optimizer.","PeriodicalId":273927,"journal":{"name":"2022 5th International Conference on Information and Computer Technologies (ICICT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis on Deep Reinforcement Learning with Flappy Brid Gameplay\",\"authors\":\"Zhixuan He\",\"doi\":\"10.1109/ICICT55905.2022.00025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, machine learning has become a popular area in the IT field. In 2016, a game AI AlphaGo turned out to defeat many Go masters. The technology behind AlphaGo: the popularity of deep reinforcement learning is gradually increasing. This paper aims to use deep reinforcement learning algorithm to train a specific model which can play the Flappy Bird game autonomously. In this paper, two types of optimizers: Adam and RMSProp are used in the deep neural network during the training sessions and the testing results with the models are compared to figure out which model has a better performance when playing Flappy Bird. From the testing results, when the training rounds is insufficient, the model with Adam optimizer performs better than the model with RMSProp optimizer. However, when the training rounds are large enough, the performance of the model with RMSProp optimizer is almost 2 times better than the model with Adam optimizer. After comparison between the two models, this paper finds out that with the increasing of the training rounds, the performance of model with RMSProp optimizer will gradually exceed the model with Adam optimizer.\",\"PeriodicalId\":273927,\"journal\":{\"name\":\"2022 5th International Conference on Information and Computer Technologies (ICICT)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Information and Computer Technologies (ICICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICT55905.2022.00025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Information and Computer Technologies (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT55905.2022.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis on Deep Reinforcement Learning with Flappy Brid Gameplay
In recent years, machine learning has become a popular area in the IT field. In 2016, a game AI AlphaGo turned out to defeat many Go masters. The technology behind AlphaGo: the popularity of deep reinforcement learning is gradually increasing. This paper aims to use deep reinforcement learning algorithm to train a specific model which can play the Flappy Bird game autonomously. In this paper, two types of optimizers: Adam and RMSProp are used in the deep neural network during the training sessions and the testing results with the models are compared to figure out which model has a better performance when playing Flappy Bird. From the testing results, when the training rounds is insufficient, the model with Adam optimizer performs better than the model with RMSProp optimizer. However, when the training rounds are large enough, the performance of the model with RMSProp optimizer is almost 2 times better than the model with Adam optimizer. After comparison between the two models, this paper finds out that with the increasing of the training rounds, the performance of model with RMSProp optimizer will gradually exceed the model with Adam optimizer.