{"title":"深度强化学习的选择性数据收集方法","authors":"Tao Wang, Haiyang Yang, Zhiyong Tan, Yao Yu","doi":"10.1109/YAC57282.2022.10023607","DOIUrl":null,"url":null,"abstract":"In deep reinforcement learning, reinforcement learning is responsible for interacting with the environment to produce data, and artificial neural networks are responsible for value function fitting. It is observed that artificial neural networks converged differently to different inputs, which, in our analysis, is due to imbalanced data. Therefore, we propose selective data collection to boost the quality of the data by then discarding the excess data. It has been proved experimentally that our method can significantly contribute to the convergence rate of the reinforcement learning algorithm.","PeriodicalId":272227,"journal":{"name":"2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Selective Data Collection Method for Deep Reinforcement Learning\",\"authors\":\"Tao Wang, Haiyang Yang, Zhiyong Tan, Yao Yu\",\"doi\":\"10.1109/YAC57282.2022.10023607\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In deep reinforcement learning, reinforcement learning is responsible for interacting with the environment to produce data, and artificial neural networks are responsible for value function fitting. It is observed that artificial neural networks converged differently to different inputs, which, in our analysis, is due to imbalanced data. Therefore, we propose selective data collection to boost the quality of the data by then discarding the excess data. It has been proved experimentally that our method can significantly contribute to the convergence rate of the reinforcement learning algorithm.\",\"PeriodicalId\":272227,\"journal\":{\"name\":\"2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/YAC57282.2022.10023607\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/YAC57282.2022.10023607","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Selective Data Collection Method for Deep Reinforcement Learning
In deep reinforcement learning, reinforcement learning is responsible for interacting with the environment to produce data, and artificial neural networks are responsible for value function fitting. It is observed that artificial neural networks converged differently to different inputs, which, in our analysis, is due to imbalanced data. Therefore, we propose selective data collection to boost the quality of the data by then discarding the excess data. It has been proved experimentally that our method can significantly contribute to the convergence rate of the reinforcement learning algorithm.