{"title":"强化学习在传统电子游戏中的表现","authors":"Yuanxi Sun","doi":"10.1109/AIAM54119.2021.00063","DOIUrl":null,"url":null,"abstract":"Reinforcement learning is a model structure that learns and updates the model based on receiving the reward and feedback of certain environments. In recent researches, reinforcement learning has been shown to be very effective in a well- defined or simulated environment to identify optimal strategies for solving given problems. This work test the performance of several Deep Reinforcement Learning model structures on a classic video game, Flappy bird. This project wants to further analyze the performance of different settings of deep Q networks and policy gradient models, to see how different model structures affect the training speed, accuracy, and efficiency. The author has found that the Double DQN model can achieve the best score among all deep learning models, whereas the Q-table model can achieve a better score with a proper feature extraction method, as the environment is rather simple.","PeriodicalId":227320,"journal":{"name":"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance of Reinforcement Learning on Traditional Video Games\",\"authors\":\"Yuanxi Sun\",\"doi\":\"10.1109/AIAM54119.2021.00063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning is a model structure that learns and updates the model based on receiving the reward and feedback of certain environments. In recent researches, reinforcement learning has been shown to be very effective in a well- defined or simulated environment to identify optimal strategies for solving given problems. This work test the performance of several Deep Reinforcement Learning model structures on a classic video game, Flappy bird. This project wants to further analyze the performance of different settings of deep Q networks and policy gradient models, to see how different model structures affect the training speed, accuracy, and efficiency. The author has found that the Double DQN model can achieve the best score among all deep learning models, whereas the Q-table model can achieve a better score with a proper feature extraction method, as the environment is rather simple.\",\"PeriodicalId\":227320,\"journal\":{\"name\":\"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIAM54119.2021.00063\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIAM54119.2021.00063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance of Reinforcement Learning on Traditional Video Games
Reinforcement learning is a model structure that learns and updates the model based on receiving the reward and feedback of certain environments. In recent researches, reinforcement learning has been shown to be very effective in a well- defined or simulated environment to identify optimal strategies for solving given problems. This work test the performance of several Deep Reinforcement Learning model structures on a classic video game, Flappy bird. This project wants to further analyze the performance of different settings of deep Q networks and policy gradient models, to see how different model structures affect the training speed, accuracy, and efficiency. The author has found that the Double DQN model can achieve the best score among all deep learning models, whereas the Q-table model can achieve a better score with a proper feature extraction method, as the environment is rather simple.