Chang Xu, Heng Ding, Xuejian Zhang, Cong Wang, Hongji Yang
{"title":"A Data-Efficient Method of Deep Reinforcement Learning for Chinese Chess","authors":"Chang Xu, Heng Ding, Xuejian Zhang, Cong Wang, Hongji Yang","doi":"10.1109/QRS-C57518.2022.00109","DOIUrl":null,"url":null,"abstract":"The computer game is the Drosophila in the field of artificial intelligence. Recently, a series of computer game systems., such as AlphaGo and AlphaGo Zero, defeating the world human champion of Go, has greatly refreshed people's understanding of the creativity of machine. This paper applies the deep reinforcement learning method to the computer Chinese Chess. We are committed to decrease the demand for computing resources heavily from multi-perspectives, such as data augmentation and using more intermediate results as labels. The experiment shows that the level of our program is increased rapidly.","PeriodicalId":183728,"journal":{"name":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/QRS-C57518.2022.00109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The computer game is the Drosophila in the field of artificial intelligence. Recently, a series of computer game systems., such as AlphaGo and AlphaGo Zero, defeating the world human champion of Go, has greatly refreshed people's understanding of the creativity of machine. This paper applies the deep reinforcement learning method to the computer Chinese Chess. We are committed to decrease the demand for computing resources heavily from multi-perspectives, such as data augmentation and using more intermediate results as labels. The experiment shows that the level of our program is increased rapidly.