In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search

arXiv - CS - Artificial Intelligence Pub Date : 2024-09-05 DOI:arxiv-2409.03260

Emir Demirović, Christian Schilling, Anna Lukina

{"title":"In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search","authors":"Emir Demirović, Christian Schilling, Anna Lukina","doi":"arxiv-2409.03260","DOIUrl":null,"url":null,"abstract":"Decision trees, owing to their interpretability, are attractive as control\npolicies for (dynamical) systems. Unfortunately, constructing, or synthesising,\nsuch policies is a challenging task. Previous approaches do so by imitating a\nneural-network policy, approximating a tabular policy obtained via formal\nsynthesis, employing reinforcement learning, or modelling the problem as a\nmixed-integer linear program. However, these works may require access to a\nhard-to-obtain accurate policy or a formal model of the environment (within\nreach of formal synthesis), and may not provide guarantees on the quality or\nsize of the final tree policy. In contrast, we present an approach to\nsynthesise optimal decision-tree policies given a black-box environment and\nspecification, and a discretisation of the tree predicates, where optimality is\ndefined with respect to the number of steps to achieve the goal. Our approach\nis a specialised search algorithm which systematically explores the\n(exponentially large) space of decision trees under the given discretisation.\nThe key component is a novel pruning mechanism that significantly reduces the\nsearch space. Our approach represents a conceptually novel way of synthesising\nsmall decision-tree policies with optimality guarantees even for black-box\nenvironments with black-box specifications.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"54 2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Decision trees, owing to their interpretability, are attractive as control policies for (dynamical) systems. Unfortunately, constructing, or synthesising, such policies is a challenging task. Previous approaches do so by imitating a neural-network policy, approximating a tabular policy obtained via formal synthesis, employing reinforcement learning, or modelling the problem as a mixed-integer linear program. However, these works may require access to a hard-to-obtain accurate policy or a formal model of the environment (within reach of formal synthesis), and may not provide guarantees on the quality or size of the final tree policy. In contrast, we present an approach to synthesise optimal decision-tree policies given a black-box environment and specification, and a discretisation of the tree predicates, where optimality is defined with respect to the number of steps to achieve the goal. Our approach is a specialised search algorithm which systematically explores the (exponentially large) space of decision trees under the given discretisation. The key component is a novel pruning mechanism that significantly reduces the search space. Our approach represents a conceptually novel way of synthesising small decision-tree policies with optimality guarantees even for black-box environments with black-box specifications.

查看原文本刊更多论文

寻找树：通过搜索为黑盒系统合成决策树策略

决策树由于其可解释性，作为（动态）系统的控制策略很有吸引力。遗憾的是，构建或合成此类策略是一项极具挑战性的任务。以往的方法包括模仿神经网络策略、近似通过形式合成获得的表格策略、采用强化学习或将问题建模为混合整数线性程序。不过，这些工作可能需要获取难以获得的精确策略或环境的正式模型（在正式合成的范围内），而且可能无法保证最终树状策略的质量或大小。与此相反，我们提出了一种合成最优决策树策略的方法，该方法给定了一个黑箱环境和规范，以及树谓词的离散化，其中最优性是根据实现目标的步骤数来定义的。我们的方法是一种专门的搜索算法，可以在给定的离散化条件下系统地探索决策树的（指数级大）空间。我们的方法代表了一种概念新颖的方法，可以合成即使是具有黑盒规范的黑盒环境也能保证最优性的小型决策树策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Artificial Intelligence

自引率

0.00%

发文量