Tree Based Models: A Comparative And Explainable Study For Credit Default Classification

2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON) Pub Date : 2022-12-02 DOI:10.1109/UPCON56432.2022.9986411

Narayana Darapaneni, Pramod Srinivas, K. Reddy, A. Paduri, Lakshmikanth Kanugovi, Pavithra J, S. B. G., B. S

{"title":"Tree Based Models: A Comparative And Explainable Study For Credit Default Classification","authors":"Narayana Darapaneni, Pramod Srinivas, K. Reddy, A. Paduri, Lakshmikanth Kanugovi, Pavithra J, S. B. G., B. S","doi":"10.1109/UPCON56432.2022.9986411","DOIUrl":null,"url":null,"abstract":"Most of the real-world problems can be solved with the help of classification. Some problems that might feel to be regression can also be solved using classification by binning the values. But due to the presence of a large number of classification algorithms, it becomes difficult to choose one particular model. SVC, KNN, and Naive Bayes are some of the traditional models available which do the task quite efficiently, but when it comes to the explainability of the outcome these models fail to address them. This is when the real use of tree-based models comes in handy as we can visualize the entire flow of conditions that leads to a particular outcome. In this paper, we have tried to compare the performance and explainability of the Decision tree, Random Forest, AdaBoost, GBoost, and XGBoost for this we have considered the credit default dataset from Kaggle.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986411","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Most of the real-world problems can be solved with the help of classification. Some problems that might feel to be regression can also be solved using classification by binning the values. But due to the presence of a large number of classification algorithms, it becomes difficult to choose one particular model. SVC, KNN, and Naive Bayes are some of the traditional models available which do the task quite efficiently, but when it comes to the explainability of the outcome these models fail to address them. This is when the real use of tree-based models comes in handy as we can visualize the entire flow of conditions that leads to a particular outcome. In this paper, we have tried to compare the performance and explainability of the Decision tree, Random Forest, AdaBoost, GBoost, and XGBoost for this we have considered the credit default dataset from Kaggle.

查看原文本刊更多论文

基于树的模型:信用违约分类的比较与解释研究

在分类的帮助下，大多数现实世界的问题都可以得到解决。一些看起来像是回归的问题也可以通过对值进行分类来解决。但是由于存在大量的分类算法，很难选择一个特定的模型。SVC、KNN和朴素贝叶斯是一些传统的模型，它们可以相当有效地完成任务，但是当涉及到结果的可解释性时，这些模型无法解决它们。这是真正使用基于树的模型派上用场的时候，因为我们可以可视化导致特定结果的整个条件流。在本文中，我们试图比较决策树、随机森林、AdaBoost、GBoost和XGBoost的性能和可解释性，为此我们考虑了来自Kaggle的信用违约数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)

自引率

0.00%

发文量