主体行为策略的遗传编码

Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160) Pub Date : 1998-07-03 DOI:10.1109/ICMAS.1998.699234

Stéphane Calderoni, P. Marcenac, R. Courdier

{"title":"主体行为策略的遗传编码","authors":"Stéphane Calderoni, P. Marcenac, R. Courdier","doi":"10.1109/ICMAS.1998.699234","DOIUrl":null,"url":null,"abstract":"The general framework tackled in this paper is the automatic generation of intelligent collective behaviors using genetic programming and reinforcement teaming. We define a behavior-based system relying on automatic design process using artificial evolution to synthesize high level behaviors for autonomous agents. Behavioral strategies are described by tree-based structures, and manipulated by generic evolving processes. Each strategy is dynamically evaluated during simulation, and is weighted by an adaptation function as a quality factor that reflects its relevance as good solution for the learning task. It is computed using heterogeneous reinforcement techniques associating immediate reinforcements and delayed reinforcements as dynamic progress estimators.","PeriodicalId":244857,"journal":{"name":"Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Genetic encoding of agent behavioral strategy\",\"authors\":\"Stéphane Calderoni, P. Marcenac, R. Courdier\",\"doi\":\"10.1109/ICMAS.1998.699234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The general framework tackled in this paper is the automatic generation of intelligent collective behaviors using genetic programming and reinforcement teaming. We define a behavior-based system relying on automatic design process using artificial evolution to synthesize high level behaviors for autonomous agents. Behavioral strategies are described by tree-based structures, and manipulated by generic evolving processes. Each strategy is dynamically evaluated during simulation, and is weighted by an adaptation function as a quality factor that reflects its relevance as good solution for the learning task. It is computed using heterogeneous reinforcement techniques associating immediate reinforcements and delayed reinforcements as dynamic progress estimators.\",\"PeriodicalId\":244857,\"journal\":{\"name\":\"Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMAS.1998.699234\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMAS.1998.699234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文讨论的总体框架是利用遗传规划和强化团队来自动生成智能集体行为。我们定义了一个基于自动设计过程的基于行为的系统，使用人工进化来合成自主代理的高级行为。行为策略由树状结构描述，并由一般进化过程控制。每个策略在模拟过程中被动态评估，并通过一个适应函数作为质量因子加权，反映其作为学习任务的良好解决方案的相关性。采用非均质加固技术，将即时加固和延迟加固作为动态进度估计量进行计算。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Genetic encoding of agent behavioral strategy

The general framework tackled in this paper is the automatic generation of intelligent collective behaviors using genetic programming and reinforcement teaming. We define a behavior-based system relying on automatic design process using artificial evolution to synthesize high level behaviors for autonomous agents. Behavioral strategies are described by tree-based structures, and manipulated by generic evolving processes. Each strategy is dynamically evaluated during simulation, and is weighted by an adaptation function as a quality factor that reflects its relevance as good solution for the learning task. It is computed using heterogeneous reinforcement techniques associating immediate reinforcements and delayed reinforcements as dynamic progress estimators.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160)

自引率

0.00%

发文量