{"title":"Evaluating trustworthiness of decision tree learning algorithms based on equivalence checking","authors":"Omer Nguena Timo, Tianqi Xiao, Florent Avellaneda, Yasir Malik, Stefan Bruda","doi":"10.1007/s43681-023-00415-0","DOIUrl":null,"url":null,"abstract":"<div><p>Learning algorithms and their implementations are used as black-boxes to produce decision trees, e.g., for realizing critical classification tasks. A low confidence in (the learning ability of) the algorithms increases the mistrust of the produced decision trees, which leads to costly test and validation activities and to the waste of the learning time in case the decision trees are likely to be faulty due to the inability to learn. Methods for evaluating trustworthiness of the algorithms are needed especially when the testing of the learned decision trees is also challenging. We propose a novel oracle-centered approach to the evaluation. It consists of generating deterministic or noise-free datasets from reference trees playing the role of oracles, producing learned trees with existing (implementations of) learning algorithms, and determining the degree of equivalence (DOE) of the learned trees by comparing them with the oracles. We evaluate (six implementations of) five decision tree learning algorithms based on the proposed approach.</p></div>","PeriodicalId":72137,"journal":{"name":"AI and ethics","volume":"4 1","pages":"37 - 46"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI and ethics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s43681-023-00415-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Learning algorithms and their implementations are used as black-boxes to produce decision trees, e.g., for realizing critical classification tasks. A low confidence in (the learning ability of) the algorithms increases the mistrust of the produced decision trees, which leads to costly test and validation activities and to the waste of the learning time in case the decision trees are likely to be faulty due to the inability to learn. Methods for evaluating trustworthiness of the algorithms are needed especially when the testing of the learned decision trees is also challenging. We propose a novel oracle-centered approach to the evaluation. It consists of generating deterministic or noise-free datasets from reference trees playing the role of oracles, producing learned trees with existing (implementations of) learning algorithms, and determining the degree of equivalence (DOE) of the learned trees by comparing them with the oracles. We evaluate (six implementations of) five decision tree learning algorithms based on the proposed approach.