Towards a Generic Methodology for Evaluating MAS Performance

C. Dimou, A. Symeonidis, P. Mitkas
{"title":"Towards a Generic Methodology for Evaluating MAS Performance","authors":"C. Dimou, A. Symeonidis, P. Mitkas","doi":"10.1109/KIMAS.2007.369805","DOIUrl":null,"url":null,"abstract":"As agent technology (AT) becomes a well-established engineering field of computing, the need for generalized, standardized methodologies for agent evaluation is imperative. Despite the plethora of available development tools and theories that researchers in agent computing have access to, there is a remarkable lack of general metrics, tools, benchmarks and experimental methods for formal validation and comparison of existing or newly developed systems. It is argued that AT has reached a certain degree of maturity, and it is therefore feasible to move from ad-hoc, domain-specific evaluation methods to standardized, repeatable and easily verifiable procedures. In this paper, we outline a first attempt towards a generic evaluation methodology for MAS performance. Instead of following the research path towards defining more powerful mathematical description tools for determining intelligence and performance metrics, we adopt an engineering point of view to the problem of deploying a methodology that is both implementation and domain independent. The proposed methodology consists of a concise set of steps, novel theoretical representation tools and appropriate software tools that assist evaluators in selecting the appropriate metrics, undertaking measurement and aggregation techniques for the system at hand","PeriodicalId":193808,"journal":{"name":"2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems","volume":"138 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KIMAS.2007.369805","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

As agent technology (AT) becomes a well-established engineering field of computing, the need for generalized, standardized methodologies for agent evaluation is imperative. Despite the plethora of available development tools and theories that researchers in agent computing have access to, there is a remarkable lack of general metrics, tools, benchmarks and experimental methods for formal validation and comparison of existing or newly developed systems. It is argued that AT has reached a certain degree of maturity, and it is therefore feasible to move from ad-hoc, domain-specific evaluation methods to standardized, repeatable and easily verifiable procedures. In this paper, we outline a first attempt towards a generic evaluation methodology for MAS performance. Instead of following the research path towards defining more powerful mathematical description tools for determining intelligence and performance metrics, we adopt an engineering point of view to the problem of deploying a methodology that is both implementation and domain independent. The proposed methodology consists of a concise set of steps, novel theoretical representation tools and appropriate software tools that assist evaluators in selecting the appropriate metrics, undertaking measurement and aggregation techniques for the system at hand
迈向评估MAS绩效的通用方法
随着智能体技术(AT)成为一个成熟的计算工程领域,对智能体评估的一般化、标准化方法的需求势在必行。尽管智能体计算研究人员可以使用大量可用的开发工具和理论,但对于现有或新开发的系统进行正式验证和比较的通用度量、工具、基准和实验方法明显缺乏。有人认为,AT已经达到了一定程度的成熟,因此从专门的、特定领域的评估方法转向标准化的、可重复的和易于验证的程序是可行的。在本文中,我们概述了对MAS性能的通用评估方法的第一次尝试。我们没有遵循定义更强大的数学描述工具来确定智能和性能指标的研究路径,而是采用工程的观点来部署既独立于实现又独立于领域的方法。建议的方法包括一套简明的步骤,新颖的理论表示工具和适当的软件工具,这些工具可以帮助评估人员选择适当的度量标准,为手头的系统进行测量和汇总技术
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信