改进了在分布式学习环境下运行的基于神经网络的引水代理的实现

2013 IEEE 14th International Conference on Information Reuse & Integration (IRI) Pub Date : 2013-10-24 DOI:10.1109/IRI.2013.6642481

Lidia Bononi Paiva Tomaz, Rita Maria Silva Julia, Ayres Roberto Araújo Barcelos

{"title":"改进了在分布式学习环境下运行的基于神经网络的引水代理的实现","authors":"Lidia Bononi Paiva Tomaz, Rita Maria Silva Julia, Ayres Roberto Araújo Barcelos","doi":"10.1109/IRI.2013.6642481","DOIUrl":null,"url":null,"abstract":"This article presents an extension to the system D-VisionDraughts: a draughts player agent based on a MultiLayer Perceptron Neural Network which operates in a distributed environment, and in a manner which distinguishes it from the current world champion Chinook, it learns without human supervision. The network weights are updated by Temporal Differences Methods using self-play with cloning technique. The best move is chosen by the parallel Alpha-Beta search algorithm called Young Brothers Wait Concept. The representation of the game board states is based on the NET-FEATUREMAP techniques (functions describing features inherent to Draughts game). This paper investigates the improvement obtained by D-VisionDraughts through the insertion of new features that allow a more precise representation of the board states. Further, the authors show to what extent the addition of new processors compensates the increase in training time that would be an obvious consequence of the optimization of the board state representation.","PeriodicalId":418492,"journal":{"name":"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)","volume":"214 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Improving the accomplishment of a neural network based agent for draughts that operates in a distributed learning environment\",\"authors\":\"Lidia Bononi Paiva Tomaz, Rita Maria Silva Julia, Ayres Roberto Araújo Barcelos\",\"doi\":\"10.1109/IRI.2013.6642481\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents an extension to the system D-VisionDraughts: a draughts player agent based on a MultiLayer Perceptron Neural Network which operates in a distributed environment, and in a manner which distinguishes it from the current world champion Chinook, it learns without human supervision. The network weights are updated by Temporal Differences Methods using self-play with cloning technique. The best move is chosen by the parallel Alpha-Beta search algorithm called Young Brothers Wait Concept. The representation of the game board states is based on the NET-FEATUREMAP techniques (functions describing features inherent to Draughts game). This paper investigates the improvement obtained by D-VisionDraughts through the insertion of new features that allow a more precise representation of the board states. Further, the authors show to what extent the addition of new processors compensates the increase in training time that would be an obvious consequence of the optimization of the board state representation.\",\"PeriodicalId\":418492,\"journal\":{\"name\":\"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)\",\"volume\":\"214 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2013.6642481\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2013.6642481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

本文介绍了系统d - visiondrafts的扩展:一个基于多层感知器神经网络的跳棋玩家代理，它在分布式环境中运行，并且以一种与当前世界冠军奇努克不同的方式，它在没有人类监督的情况下进行学习。网络权值的更新采用时间差分法，采用自玩克隆技术。最佳走法是由并行的阿尔法-贝塔搜索算法选择的，该算法被称为Young Brothers Wait Concept。游戏棋盘状态的表示基于。net featuremap技术(描述草稿游戏固有特征的函数)。本文研究了d - visiondrafts通过插入允许更精确地表示板状态的新特征而获得的改进。此外，作者还展示了新处理器的添加在多大程度上补偿了训练时间的增加，这将是优化棋盘状态表示的一个明显结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving the accomplishment of a neural network based agent for draughts that operates in a distributed learning environment

This article presents an extension to the system D-VisionDraughts: a draughts player agent based on a MultiLayer Perceptron Neural Network which operates in a distributed environment, and in a manner which distinguishes it from the current world champion Chinook, it learns without human supervision. The network weights are updated by Temporal Differences Methods using self-play with cloning technique. The best move is chosen by the parallel Alpha-Beta search algorithm called Young Brothers Wait Concept. The representation of the game board states is based on the NET-FEATUREMAP techniques (functions describing features inherent to Draughts game). This paper investigates the improvement obtained by D-VisionDraughts through the insertion of new features that allow a more precise representation of the board states. Further, the authors show to what extent the addition of new processors compensates the increase in training time that would be an obvious consequence of the optimization of the board state representation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)

自引率

0.00%

发文量