改进了在分布式学习环境下运行的基于神经网络的引水代理的实现

Lidia Bononi Paiva Tomaz, Rita Maria Silva Julia, Ayres Roberto Araújo Barcelos
{"title":"改进了在分布式学习环境下运行的基于神经网络的引水代理的实现","authors":"Lidia Bononi Paiva Tomaz, Rita Maria Silva Julia, Ayres Roberto Araújo Barcelos","doi":"10.1109/IRI.2013.6642481","DOIUrl":null,"url":null,"abstract":"This article presents an extension to the system D-VisionDraughts: a draughts player agent based on a MultiLayer Perceptron Neural Network which operates in a distributed environment, and in a manner which distinguishes it from the current world champion Chinook, it learns without human supervision. The network weights are updated by Temporal Differences Methods using self-play with cloning technique. The best move is chosen by the parallel Alpha-Beta search algorithm called Young Brothers Wait Concept. The representation of the game board states is based on the NET-FEATUREMAP techniques (functions describing features inherent to Draughts game). This paper investigates the improvement obtained by D-VisionDraughts through the insertion of new features that allow a more precise representation of the board states. Further, the authors show to what extent the addition of new processors compensates the increase in training time that would be an obvious consequence of the optimization of the board state representation.","PeriodicalId":418492,"journal":{"name":"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)","volume":"214 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Improving the accomplishment of a neural network based agent for draughts that operates in a distributed learning environment\",\"authors\":\"Lidia Bononi Paiva Tomaz, Rita Maria Silva Julia, Ayres Roberto Araújo Barcelos\",\"doi\":\"10.1109/IRI.2013.6642481\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents an extension to the system D-VisionDraughts: a draughts player agent based on a MultiLayer Perceptron Neural Network which operates in a distributed environment, and in a manner which distinguishes it from the current world champion Chinook, it learns without human supervision. The network weights are updated by Temporal Differences Methods using self-play with cloning technique. The best move is chosen by the parallel Alpha-Beta search algorithm called Young Brothers Wait Concept. The representation of the game board states is based on the NET-FEATUREMAP techniques (functions describing features inherent to Draughts game). This paper investigates the improvement obtained by D-VisionDraughts through the insertion of new features that allow a more precise representation of the board states. Further, the authors show to what extent the addition of new processors compensates the increase in training time that would be an obvious consequence of the optimization of the board state representation.\",\"PeriodicalId\":418492,\"journal\":{\"name\":\"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)\",\"volume\":\"214 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2013.6642481\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2013.6642481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

本文介绍了系统d - visiondrafts的扩展:一个基于多层感知器神经网络的跳棋玩家代理,它在分布式环境中运行,并且以一种与当前世界冠军奇努克不同的方式,它在没有人类监督的情况下进行学习。网络权值的更新采用时间差分法,采用自玩克隆技术。最佳走法是由并行的阿尔法-贝塔搜索算法选择的,该算法被称为Young Brothers Wait Concept。游戏棋盘状态的表示基于。net featuremap技术(描述草稿游戏固有特征的函数)。本文研究了d - visiondrafts通过插入允许更精确地表示板状态的新特征而获得的改进。此外,作者还展示了新处理器的添加在多大程度上补偿了训练时间的增加,这将是优化棋盘状态表示的一个明显结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving the accomplishment of a neural network based agent for draughts that operates in a distributed learning environment
This article presents an extension to the system D-VisionDraughts: a draughts player agent based on a MultiLayer Perceptron Neural Network which operates in a distributed environment, and in a manner which distinguishes it from the current world champion Chinook, it learns without human supervision. The network weights are updated by Temporal Differences Methods using self-play with cloning technique. The best move is chosen by the parallel Alpha-Beta search algorithm called Young Brothers Wait Concept. The representation of the game board states is based on the NET-FEATUREMAP techniques (functions describing features inherent to Draughts game). This paper investigates the improvement obtained by D-VisionDraughts through the insertion of new features that allow a more precise representation of the board states. Further, the authors show to what extent the addition of new processors compensates the increase in training time that would be an obvious consequence of the optimization of the board state representation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信