Towards a Generic Methodology for Evaluating MAS Performance

2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems Pub Date : 2007-06-11 DOI:10.1109/KIMAS.2007.369805

C. Dimou, A. Symeonidis, P. Mitkas

{"title":"Towards a Generic Methodology for Evaluating MAS Performance","authors":"C. Dimou, A. Symeonidis, P. Mitkas","doi":"10.1109/KIMAS.2007.369805","DOIUrl":null,"url":null,"abstract":"As agent technology (AT) becomes a well-established engineering field of computing, the need for generalized, standardized methodologies for agent evaluation is imperative. Despite the plethora of available development tools and theories that researchers in agent computing have access to, there is a remarkable lack of general metrics, tools, benchmarks and experimental methods for formal validation and comparison of existing or newly developed systems. It is argued that AT has reached a certain degree of maturity, and it is therefore feasible to move from ad-hoc, domain-specific evaluation methods to standardized, repeatable and easily verifiable procedures. In this paper, we outline a first attempt towards a generic evaluation methodology for MAS performance. Instead of following the research path towards defining more powerful mathematical description tools for determining intelligence and performance metrics, we adopt an engineering point of view to the problem of deploying a methodology that is both implementation and domain independent. The proposed methodology consists of a concise set of steps, novel theoretical representation tools and appropriate software tools that assist evaluators in selecting the appropriate metrics, undertaking measurement and aggregation techniques for the system at hand","PeriodicalId":193808,"journal":{"name":"2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems","volume":"138 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KIMAS.2007.369805","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

As agent technology (AT) becomes a well-established engineering field of computing, the need for generalized, standardized methodologies for agent evaluation is imperative. Despite the plethora of available development tools and theories that researchers in agent computing have access to, there is a remarkable lack of general metrics, tools, benchmarks and experimental methods for formal validation and comparison of existing or newly developed systems. It is argued that AT has reached a certain degree of maturity, and it is therefore feasible to move from ad-hoc, domain-specific evaluation methods to standardized, repeatable and easily verifiable procedures. In this paper, we outline a first attempt towards a generic evaluation methodology for MAS performance. Instead of following the research path towards defining more powerful mathematical description tools for determining intelligence and performance metrics, we adopt an engineering point of view to the problem of deploying a methodology that is both implementation and domain independent. The proposed methodology consists of a concise set of steps, novel theoretical representation tools and appropriate software tools that assist evaluators in selecting the appropriate metrics, undertaking measurement and aggregation techniques for the system at hand

查看原文本刊更多论文

迈向评估MAS绩效的通用方法

随着智能体技术(AT)成为一个成熟的计算工程领域，对智能体评估的一般化、标准化方法的需求势在必行。尽管智能体计算研究人员可以使用大量可用的开发工具和理论，但对于现有或新开发的系统进行正式验证和比较的通用度量、工具、基准和实验方法明显缺乏。有人认为，AT已经达到了一定程度的成熟，因此从专门的、特定领域的评估方法转向标准化的、可重复的和易于验证的程序是可行的。在本文中，我们概述了对MAS性能的通用评估方法的第一次尝试。我们没有遵循定义更强大的数学描述工具来确定智能和性能指标的研究路径，而是采用工程的观点来部署既独立于实现又独立于领域的方法。建议的方法包括一套简明的步骤，新颖的理论表示工具和适当的软件工具，这些工具可以帮助评估人员选择适当的度量标准，为手头的系统进行测量和汇总技术

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems

自引率

0.00%

发文量