Autonomous Agents and Multi-Agent Systems最新文献

筛选
英文 中文
Classifying ambiguous identities in hidden-role Stochastic games with multi-agent reinforcement learning 基于多智能体强化学习的隐藏角色随机博弈模糊身份分类
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-08-11 DOI: 10.1007/s10458-023-09620-x
Shijie Han, Siyuan Li, Bo An, Wei Zhao, Peng Liu
{"title":"Classifying ambiguous identities in hidden-role Stochastic games with multi-agent reinforcement learning","authors":"Shijie Han,&nbsp;Siyuan Li,&nbsp;Bo An,&nbsp;Wei Zhao,&nbsp;Peng Liu","doi":"10.1007/s10458-023-09620-x","DOIUrl":"10.1007/s10458-023-09620-x","url":null,"abstract":"<div><p>Multi-agent reinforcement learning (MARL) is a prevalent learning paradigm for solving stochastic games. In most MARL studies, agents in a game are defined as teammates or enemies beforehand, and the relationships among the agents (i.e., their <i>identities</i>) remain fixed throughout the game. However, in real-world problems, the agent relationships are commonly unknown in advance or dynamically changing. Many multi-party interactions start off by asking: who is on my team? This question arises whether it is the first day at the stock exchange or the kindergarten. Therefore, training policies for such situations in the face of imperfect information and ambiguous <i>identities</i> is an important problem that needs to be addressed. In this work, we develop a novel identity detection reinforcement learning (IDRL) framework that allows an agent to dynamically infer the identities of nearby agents and select an appropriate policy to accomplish the task. In the IDRL framework, a relation network is constructed to deduce the identities of other agents by observing the behaviors of the agents. A danger network is optimized to estimate the risk of false-positive identifications. Beyond that, we propose an intrinsic reward that balances the need to maximize external rewards and accurate identification. After identifying the cooperation-competition pattern among the agents, IDRL applies one of the off-the-shelf MARL methods to learn the policy. To evaluate the proposed method, we conduct experiments on <i>Red-10</i> card-shedding game, and the results show that IDRL achieves superior performance over other state-of-the-art MARL methods. Impressively, the relation network has the par performance to identify the identities of agents with top human players; the danger network reasonably avoids the risk of imperfect identification. The code to reproduce all the reported results is available online at https://github.com/MR-BENjie/IDRL.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43149265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining theory of mind and abductive reasoning in agent-oriented programming 面向agent程序设计中思维理论与溯因推理的结合
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-08-11 DOI: 10.1007/s10458-023-09613-w
Nieves Montes, Michael Luck, Nardine Osman, Odinaldo Rodrigues, Carles Sierra
{"title":"Combining theory of mind and abductive reasoning in agent-oriented programming","authors":"Nieves Montes,&nbsp;Michael Luck,&nbsp;Nardine Osman,&nbsp;Odinaldo Rodrigues,&nbsp;Carles Sierra","doi":"10.1007/s10458-023-09613-w","DOIUrl":"10.1007/s10458-023-09613-w","url":null,"abstract":"<div><p>This paper presents a novel model, called T<span>om</span>A<span>bd</span>, that endows autonomous agents with Theory of Mind capabilities. T<span>om</span>A<span>bd</span> agents are able to simulate the perspective of the world that their peers have and reason from their perspective. Furthermore, T<span>om</span>A<span>bd</span> agents can reason from the perspective of others down to an <i>arbitrary level of recursion</i>, using Theory of Mind of <span>(n^{text {th}})</span> order. By combining the previous capability with abductive reasoning, T<span>om</span>A<span>bd</span> agents can infer the beliefs that others were relying upon to select their actions, hence putting them in a more informed position when it comes to their own decision-making. We have tested the T<span>om</span>A<span>bd</span> model in the challenging domain of Hanabi, a game characterised by cooperation and imperfect information. Our results show that the abilities granted by the T<span>om</span>A<span>bd</span> model boost the performance of the team along a variety of metrics, including final score, efficiency of communication, and uncertainty reduction.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10458-023-09613-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47381284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel policy-graph approach with natural language and counterfactual abstractions for explaining reinforcement learning agents 用自然语言和反事实抽象的策略图方法解释强化学习代理
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-08-09 DOI: 10.1007/s10458-023-09615-8
Tongtong Liu, Joe McCalmon, Thai Le, Md Asifur Rahman, Dongwon Lee, Sarra Alqahtani
{"title":"A novel policy-graph approach with natural language and counterfactual abstractions for explaining reinforcement learning agents","authors":"Tongtong Liu,&nbsp;Joe McCalmon,&nbsp;Thai Le,&nbsp;Md Asifur Rahman,&nbsp;Dongwon Lee,&nbsp;Sarra Alqahtani","doi":"10.1007/s10458-023-09615-8","DOIUrl":"10.1007/s10458-023-09615-8","url":null,"abstract":"<div><p>As reinforcement learning (RL) continues to improve and be applied in situations alongside humans, the need to explain the learned behaviors of RL agents to end-users becomes more important. Strategies for explaining the reasoning behind an agent’s policy, called <i>policy-level explanations</i>, can lead to important insights about both the task and the agent’s behaviors. Following this line of research, in this work, we propose a novel approach, named as <span>CAPS</span>, that summarizes an agent’s policy in the form of a directed graph with natural language descriptions. A decision tree based clustering method is utilized to abstract the state space of the task into fewer, condensed states which makes the policy graphs more digestible to end-users. We then use the user-defined predicates to enrich the abstract states with semantic meaning. To introduce counterfactual state explanations to the policy graph, we first identify the critical states in the graph then develop a novel counterfactual explanation method based on action perturbation in those critical states. We generate explanation graphs using <span>CAPS</span> on 5 RL tasks, using both deterministic and stochastic policies. We also evaluate the effectiveness of CAPS on human participants who are not RL experts in two user studies. When provided with our explanation graph, end-users are able to accurately interpret policies of trained RL agents 80% of the time, compared to 10% when provided with the next best baseline and <span>(68.2%)</span> of users demonstrated an increase in their confidence in understanding an agent’s behavior after provided with the counterfactual explanations.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46086354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Full communication memory networks for team-level cooperation learning 团队级合作学习的全沟通记忆网络
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-08-07 DOI: 10.1007/s10458-023-09617-6
Yutong Wang, Yizhuo Wang, Guillaume Sartoretti
{"title":"Full communication memory networks for team-level cooperation learning","authors":"Yutong Wang,&nbsp;Yizhuo Wang,&nbsp;Guillaume Sartoretti","doi":"10.1007/s10458-023-09617-6","DOIUrl":"10.1007/s10458-023-09617-6","url":null,"abstract":"<div><p>Communication in multi-agent systems is a key driver of team-level cooperation, for instance allowing individual agents to augment their knowledge about the world in partially-observable environments. In this paper, we propose two reinforcement learning-based multi-agent models, namely FCMNet and FCMTran. The two models both allow agents to simultaneously learn a differentiable communication mechanism that connects all agents as well as a common, cooperative policy conditioned upon received information. FCMNet utilizes multiple directional Long Short-Term Memory chains to sequentially transmit and encode the current observation-based messages sent by every other agent at each timestep. FCMTran further relies on the encoder of a modified transformer to simultaneously aggregate multiple self-generated messages sent by all agents at the previous timestep into a single message that is used in the current timestep. Results from evaluating our models on a challenging set of StarCraft II micromanagement tasks with shared rewards show that FCMNet and FCMTran both outperform recent communication-based methods and value decomposition methods in almost all tested StarCraft II micromanagement tasks. We further improve the performance of our models by combining them with value decomposition techniques; there, in particular, we show that FCMTran with value decomposition significantly pushes the state-of-the-art on one of the hardest benchmark tasks without any task-specific tuning. We also investigate the robustness of FCMNet under communication disturbances (i.e., binarized messages, random message loss, and random communication order) in an asymmetric collaborative pathfinding task with individual rewards, demonstrating FMCNet’s potential applicability in real-world robotic tasks.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43223707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deploying vaccine distribution sites for improved accessibility and equity to support pandemic response 部署疫苗分发点,提高可及性和公平性,以支持疫情应对
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-08-02 DOI: 10.1007/s10458-023-09614-9
George Z. Li, Ann Li, Madhav Marathe, Aravind Srinivasan, Leonidas Tsepenekas, Anil Vullikanti
{"title":"Deploying vaccine distribution sites for improved accessibility and equity to support pandemic response","authors":"George Z. Li,&nbsp;Ann Li,&nbsp;Madhav Marathe,&nbsp;Aravind Srinivasan,&nbsp;Leonidas Tsepenekas,&nbsp;Anil Vullikanti","doi":"10.1007/s10458-023-09614-9","DOIUrl":"10.1007/s10458-023-09614-9","url":null,"abstract":"<div><p>In response to COVID-19, many countries have mandated social distancing and banned large group gatherings in order to slow down the spread of SARS-CoV-2. These social interventions along with vaccines remain the best way forward to reduce the spread of SARS CoV-2. In order to increase vaccine accessibility, states such as Virginia have deployed mobile vaccination centers to distribute vaccines across the state. When choosing where to place these sites, there are two important factors to take into account: accessibility and equity. We formulate a combinatorial problem that captures these factors and then develop efficient algorithms with theoretical guarantees on both of these aspects. Furthermore, we study the inherent hardness of the problem, and demonstrate strong impossibility results. Finally, we run computational experiments on real-world data to show the efficacy of our methods.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10458-023-09614-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50437075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Balancing fairness and efficiency in traffic routing via interpolated traffic assignment 通过插值流量分配平衡流量路由的公平性和效率
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-08-02 DOI: 10.1007/s10458-023-09616-7
Devansh Jalota, Kiril Solovey, Matthew Tsao, Stephen Zoepf, Marco Pavone
{"title":"Balancing fairness and efficiency in traffic routing via interpolated traffic assignment","authors":"Devansh Jalota,&nbsp;Kiril Solovey,&nbsp;Matthew Tsao,&nbsp;Stephen Zoepf,&nbsp;Marco Pavone","doi":"10.1007/s10458-023-09616-7","DOIUrl":"10.1007/s10458-023-09616-7","url":null,"abstract":"<div><p>System optimum (SO) routing, wherein the total travel time of all users is minimized, is a holy grail for transportation authorities. However, SO routing may discriminate against users who incur much larger travel times than others to achieve high system efficiency, i.e., low total travel times. To address the inherent unfairness of SO routing, we study the <span>({beta })</span>-fair SO problem whose goal is to minimize the total travel time while guaranteeing a <span>({beta ge 1})</span> level of unfairness, which specifies the maximum possible ratio between the travel times of different users with shared origins and destinations. To obtain feasible solutions to the <span>({beta })</span>-fair SO problem while achieving high system efficiency, we develop a new convex program, the interpolated traffic assignment problem (I-TAP), which interpolates between a fairness-promoting and an efficiency-promoting traffic-assignment objective. We evaluate the efficacy of I-TAP through theoretical bounds on the total system travel time and level of unfairness in terms of its interpolation parameter, as well as present a numerical comparison between I-TAP and a state-of-the-art algorithm on a range of transportation networks. The numerical results indicate that our approach is faster by several orders of magnitude as compared to the benchmark algorithm, while achieving higher system efficiency for all desirable levels of unfairness. We further leverage the structure of I-TAP to develop two pricing mechanisms to collectively enforce the I-TAP solution in the presence of selfish homogeneous and heterogeneous users, respectively, that independently choose routes to minimize their own travel costs. We mention that this is the first study of pricing in the context of fair routing for general road networks (as opposed to, e.g., parallel road networks).</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47101947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A performance-impact based multi-task distributed scheduling algorithm with task removal inference and deadlock avoidance 一种基于性能影响的多任务分布式调度算法
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-07-18 DOI: 10.1007/s10458-023-09611-y
Jie Li, Runfeng Chen, Chang Wang, Yiting Chen, Yuchong Huang, Xiangke Wang
{"title":"A performance-impact based multi-task distributed scheduling algorithm with task removal inference and deadlock avoidance","authors":"Jie Li,&nbsp;Runfeng Chen,&nbsp;Chang Wang,&nbsp;Yiting Chen,&nbsp;Yuchong Huang,&nbsp;Xiangke Wang","doi":"10.1007/s10458-023-09611-y","DOIUrl":"10.1007/s10458-023-09611-y","url":null,"abstract":"<div><p>Multi-task distributed scheduling (MTDS) remains a challenging problem for multi-agent systems used for uncertain and dynamic real-world tasks such as search-and-rescue. The Performance Impact (PI) algorithm is an excellent solution for MTDS, but it suffers from the problem of non-convergence that it may fall into an infinite cycle of exchanging the same task. In this paper, we improve the PI algorithm through the integration of a task removal inference strategy and a deadlock avoidance mechanism. Specifically, the task removal inference strategy results in better exploration performance than the original PI, improving the suboptimal solutions caused by the heuristics for local task selection as done in PI. In addition, we design a deadlock avoidance mechanism that limits the number of times of removing the same task and isolating consecutive inclusions of the same task. Therefore, it guarantees the convergence of the MTDS algorithm. We demonstrate the advantage of the proposed algorithm over the original PI algorithm through Monte Carlo simulation of the search-and-rescue task. The results show that the proposed algorithm can obtain a lower average time cost and the highest total allocation number.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10458-023-09611-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46459090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Non-chaotic limit sets in multi-agent learning 多智能体学习中的非混沌极限集
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-07-13 DOI: 10.1007/s10458-023-09612-x
Aleksander Czechowski, Georgios Piliouras
{"title":"Non-chaotic limit sets in multi-agent learning","authors":"Aleksander Czechowski,&nbsp;Georgios Piliouras","doi":"10.1007/s10458-023-09612-x","DOIUrl":"10.1007/s10458-023-09612-x","url":null,"abstract":"<div><p>Non-convergence is an inherent aspect of adaptive multi-agent systems, and even basic learning models, such as the replicator dynamics, are not guaranteed to equilibriate. Limit cycles, and even more complicated chaotic sets are in fact possible even in rather simple games, including variants of the Rock-Paper-Scissors game. A key challenge of multi-agent learning theory lies in characterization of these limit sets, based on qualitative features of the underlying game. Although chaotic behavior in learning dynamics can be precluded by the celebrated Poincaré–Bendixson theorem, it is only applicable directly to low-dimensional settings. In this work, we attempt to find other characteristics of a game that can force regularity in the limit sets of learning. We show that behavior consistent with the Poincaré–Bendixson theorem (limit cycles, but no chaotic attractor) follows purely from the topological structure of interactions, even for high-dimensional settings with an arbitrary number of players, and arbitrary payoff matrices. We prove our result for a wide class of follow-the-regularized leader (FoReL) dynamics, which generalize replicator dynamics, for binary games characterized interaction graphs where the payoffs of each player are only affected by one other player (i.e., interaction graphs of indegree one). Moreover, for cyclic games we provide further insight into the planar structure of limit sets, and in particular limit cycles. We propose simple conditions under which learning comes with efficiency guarantees, implying that FoReL learning achieves time-averaged sum of payoffs at least as good as that of a Nash equilibrium, thereby connecting the topology of the dynamics to social-welfare analysis.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45822746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Parameterized complexity of multiwinner determination: more effort towards fixed-parameter tractability 多赢家决策的参数化复杂性:更多的精力放在固定参数的可追溯性上
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-06-30 DOI: 10.1007/s10458-023-09610-z
Yongjie Yang, Jianxin Wang
{"title":"Parameterized complexity of multiwinner determination: more effort towards fixed-parameter tractability","authors":"Yongjie Yang,&nbsp;Jianxin Wang","doi":"10.1007/s10458-023-09610-z","DOIUrl":"10.1007/s10458-023-09610-z","url":null,"abstract":"<div><p>We study the parameterized complexity of winner determination problems for three prevalent <i>k</i>-committee selection rules, namely the minimax approval voting (MAV), the proportional approval voting (PAV), and the Chamberlin–Courant’s approval voting (CCAV). It is known that these problems are computationally hard. Although they have been studied from the parameterized complexity point of view with respect to several natural parameters, many of them turned out to be <span>W[1]</span>-hard or <span>W[2]</span>-hard. Aiming at obtaining plentiful fixed-parameter algorithms, we revisit these problems by considering more natural single parameters, combined parameters, and structural parameters.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10458-023-09610-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45276635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Symbolic knowledge injection meets intelligent agents: QoS metrics and experiments 符号知识注入与智能代理:QoS度量和实验
IF 1.9 3区 计算机科学
Autonomous Agents and Multi-Agent Systems Pub Date : 2023-06-23 DOI: 10.1007/s10458-023-09609-6
Andrea Agiollo, Andrea Rafanelli, Matteo Magnini, Giovanni Ciatto, Andrea Omicini
{"title":"Symbolic knowledge injection meets intelligent agents: QoS metrics and experiments","authors":"Andrea Agiollo,&nbsp;Andrea Rafanelli,&nbsp;Matteo Magnini,&nbsp;Giovanni Ciatto,&nbsp;Andrea Omicini","doi":"10.1007/s10458-023-09609-6","DOIUrl":"10.1007/s10458-023-09609-6","url":null,"abstract":"<div><p>Bridging intelligent symbolic agents and sub-symbolic predictors is a long-standing research goal in AI. Among the recent integration efforts, symbolic knowledge injection (SKI) proposes algorithms aimed at steering sub-symbolic predictors’ learning towards compliance w.r.t. pre-existing symbolic knowledge bases. However, state-of-the-art contributions about SKI mostly tackle injection from a foundational perspective, often focussing solely on improving the predictive performance of the sub-symbolic predictors undergoing injection. Technical contributions, in turn, are tailored on individual methods/experiments and therefore poorly interoperable with agent technologies as well as among each others. Intelligent agents may exploit SKI to serve many purposes other than predictive performance alone—provided that, of course, adequate technological support exists: for instance, SKI may allow agents to tune computational, energetic, or data requirements of sub-symbolic predictors. Given that different algorithms may exist to serve all those many purposes, some criteria for <i>algorithm selection</i> as well as a suitable <i>technology</i> should be available to let agents dynamically select and exploit the most suitable algorithm for the problem at hand. Along this line, in this work we design a set of <i>quality-of-service</i> (QoS) <i>metrics</i> for SKI, and a <i>general-purpose software API</i> to enable their application to various SKI algorithms—namely, platform for symbolic knowledge injection (PSyKI). We provide an abstract formulation of four QoS metrics for SKI, and describe the design of PSyKI according to a software engineering perspective. Then we discuss how our QoS metrics are supported by PSyKI. Finally, we demonstrate the effectiveness of both our QoS metrics and PSyKI via a number of experiments, where SKI is both applied and assessed via our proposed API. Our empirical analysis demonstrates both the soundness of our proposed metrics and the versatility of PSyKI as the first software tool supporting the application, interchange, and numerical assessment of SKI techniques. To the best of our knowledge, our proposals represent the first attempt to introduce QoS metrics for SKI, and the software tools enabling their <i>practical</i> exploitation for both human and computational agents. In particular, our contributions could be exploited to automate and/or compare the manifold SKI algorithms from the state of the art. Hence moving a concrete step forward the engineering of efficient, robust, and trustworthy software applications that integrate symbolic agents and sub-symbolic predictors.</p></div>","PeriodicalId":55586,"journal":{"name":"Autonomous Agents and Multi-Agent Systems","volume":"37 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10458-023-09609-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48725447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信