{"title":"Extracting Decision Tree from Trained Deep Reinforcement Learning in Traffic Signal Control","authors":"Yuanyang Zhu, Xiao Yin, Ruyu Li, Chunlin Chen","doi":"10.1109/ICCSI53130.2021.9736263","DOIUrl":null,"url":null,"abstract":"Deep reinforcement learning (DRL) has achieved promising results on traffic signal control systems. However, due to the complexity of the decisions of deep neural networks, it is a great challenge to explain and visualize the policy of reinforcement learning (RL) agents. The decision tree can provide useful information for experts responsible for making reliable decisions. In this paper, we employ decision trees to extract models with readable interpretations from expert policy achieved by DRL methods. We evaluate our methods via a single-intersection traffic signal control task on the simulation platform of Urban MObility (SUMO). The experimental results demonstrate that the extracted decision trees can be used to understand the learning process and the learned optimal policy of the DRL methods.","PeriodicalId":175851,"journal":{"name":"2021 International Conference on Cyber-Physical Social Intelligence (ICCSI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Cyber-Physical Social Intelligence (ICCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSI53130.2021.9736263","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Deep reinforcement learning (DRL) has achieved promising results on traffic signal control systems. However, due to the complexity of the decisions of deep neural networks, it is a great challenge to explain and visualize the policy of reinforcement learning (RL) agents. The decision tree can provide useful information for experts responsible for making reliable decisions. In this paper, we employ decision trees to extract models with readable interpretations from expert policy achieved by DRL methods. We evaluate our methods via a single-intersection traffic signal control task on the simulation platform of Urban MObility (SUMO). The experimental results demonstrate that the extracted decision trees can be used to understand the learning process and the learned optimal policy of the DRL methods.