Léo Saulières, Martin C. Cooper, Florence Dupin de Saint Cyr
{"title":"Backward explanations via redefinition of predicates","authors":"Léo Saulières, Martin C. Cooper, Florence Dupin de Saint Cyr","doi":"arxiv-2408.02606","DOIUrl":null,"url":null,"abstract":"History eXplanation based on Predicates (HXP), studies the behavior of a\nReinforcement Learning (RL) agent in a sequence of agent's interactions with\nthe environment (a history), through the prism of an arbitrary predicate. To\nthis end, an action importance score is computed for each action in the\nhistory. The explanation consists in displaying the most important actions to\nthe user. As the calculation of an action's importance is #W[1]-hard, it is\nnecessary for long histories to approximate the scores, at the expense of their\nquality. We therefore propose a new HXP method, called Backward-HXP, to provide\nexplanations for these histories without having to approximate scores.\nExperiments show the ability of B-HXP to summarise long histories.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computational Complexity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
History eXplanation based on Predicates (HXP), studies the behavior of a
Reinforcement Learning (RL) agent in a sequence of agent's interactions with
the environment (a history), through the prism of an arbitrary predicate. To
this end, an action importance score is computed for each action in the
history. The explanation consists in displaying the most important actions to
the user. As the calculation of an action's importance is #W[1]-hard, it is
necessary for long histories to approximate the scores, at the expense of their
quality. We therefore propose a new HXP method, called Backward-HXP, to provide
explanations for these histories without having to approximate scores.
Experiments show the ability of B-HXP to summarise long histories.