{"title":"In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search","authors":"Emir Demirović, Christian Schilling, Anna Lukina","doi":"arxiv-2409.03260","DOIUrl":null,"url":null,"abstract":"Decision trees, owing to their interpretability, are attractive as control\npolicies for (dynamical) systems. Unfortunately, constructing, or synthesising,\nsuch policies is a challenging task. Previous approaches do so by imitating a\nneural-network policy, approximating a tabular policy obtained via formal\nsynthesis, employing reinforcement learning, or modelling the problem as a\nmixed-integer linear program. However, these works may require access to a\nhard-to-obtain accurate policy or a formal model of the environment (within\nreach of formal synthesis), and may not provide guarantees on the quality or\nsize of the final tree policy. In contrast, we present an approach to\nsynthesise optimal decision-tree policies given a black-box environment and\nspecification, and a discretisation of the tree predicates, where optimality is\ndefined with respect to the number of steps to achieve the goal. Our approach\nis a specialised search algorithm which systematically explores the\n(exponentially large) space of decision trees under the given discretisation.\nThe key component is a novel pruning mechanism that significantly reduces the\nsearch space. Our approach represents a conceptually novel way of synthesising\nsmall decision-tree policies with optimality guarantees even for black-box\nenvironments with black-box specifications.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"54 2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Decision trees, owing to their interpretability, are attractive as control
policies for (dynamical) systems. Unfortunately, constructing, or synthesising,
such policies is a challenging task. Previous approaches do so by imitating a
neural-network policy, approximating a tabular policy obtained via formal
synthesis, employing reinforcement learning, or modelling the problem as a
mixed-integer linear program. However, these works may require access to a
hard-to-obtain accurate policy or a formal model of the environment (within
reach of formal synthesis), and may not provide guarantees on the quality or
size of the final tree policy. In contrast, we present an approach to
synthesise optimal decision-tree policies given a black-box environment and
specification, and a discretisation of the tree predicates, where optimality is
defined with respect to the number of steps to achieve the goal. Our approach
is a specialised search algorithm which systematically explores the
(exponentially large) space of decision trees under the given discretisation.
The key component is a novel pruning mechanism that significantly reduces the
search space. Our approach represents a conceptually novel way of synthesising
small decision-tree policies with optimality guarantees even for black-box
environments with black-box specifications.