A Two-Stage Interpretable Model to Explain Classifier in Credit Risk Prediction

IF 2.7 3区经济学 Q1 ECONOMICS

Journal of Forecasting Pub Date : 2025-06-02 DOI:10.1002/for.3288

Lu Wang, Zecheng Yu, Jingling Ma, Xiaofang Chen, Chong Wu

{"title":"A Two-Stage Interpretable Model to Explain Classifier in Credit Risk Prediction","authors":"Lu Wang, Zecheng Yu, Jingling Ma, Xiaofang Chen, Chong Wu","doi":"10.1002/for.3288","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the financial sector, credit risk represents a critical issue, and accurate prediction is essential for mitigating financial risk and ensuring economic stability. Although artificial intelligence methods can achieve satisfactory accuracy, explaining their predictive results poses a significant challenge, thereby prompting research on interpretability. Current research primarily focuses on individual interpretability methods and seldom investigates the combined application of multiple approaches. To address the limitations of existing research, this study proposes a two-stage interpretability model that integrates SHAP and counterfactual explanations. In the first stage, SHAP is employed to analyze feature importance, categorizing features into subsets according to their positive or negative impact on predicted outcomes. In the second stage, a genetic algorithm generates counterfactual explanations by considering feature importance and applying perturbations in various directions based on predefined subsets, thereby accurately identifying counterfactual samples that can modify predicted outcomes. We conducted experiments on the German credit datasets, HMEQ datasets, and the Taiwan Default of Credit Card Clients dataset using SVM, XGB, MLP, and LSTM as base classifiers, respectively. The experimental results indicate that the frequency of feature changes in the counterfactual explanations generated closely aligns with the feature importance derived from the SHAP method. Under the evaluation metrics of effectiveness and sparsity, the performance demonstrates improvements over both basic counterfactual explanation methods and prototype-based counterfactuals. Furthermore, this study offers recommendations based on features derived from SHAP analysis results and counterfactual explanations to reduce the risk of classification as a default.</p>\n </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 7","pages":"2132-2150"},"PeriodicalIF":2.7000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Forecasting","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/for.3288","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

Abstract

In the financial sector, credit risk represents a critical issue, and accurate prediction is essential for mitigating financial risk and ensuring economic stability. Although artificial intelligence methods can achieve satisfactory accuracy, explaining their predictive results poses a significant challenge, thereby prompting research on interpretability. Current research primarily focuses on individual interpretability methods and seldom investigates the combined application of multiple approaches. To address the limitations of existing research, this study proposes a two-stage interpretability model that integrates SHAP and counterfactual explanations. In the first stage, SHAP is employed to analyze feature importance, categorizing features into subsets according to their positive or negative impact on predicted outcomes. In the second stage, a genetic algorithm generates counterfactual explanations by considering feature importance and applying perturbations in various directions based on predefined subsets, thereby accurately identifying counterfactual samples that can modify predicted outcomes. We conducted experiments on the German credit datasets, HMEQ datasets, and the Taiwan Default of Credit Card Clients dataset using SVM, XGB, MLP, and LSTM as base classifiers, respectively. The experimental results indicate that the frequency of feature changes in the counterfactual explanations generated closely aligns with the feature importance derived from the SHAP method. Under the evaluation metrics of effectiveness and sparsity, the performance demonstrates improvements over both basic counterfactual explanation methods and prototype-based counterfactuals. Furthermore, this study offers recommendations based on features derived from SHAP analysis results and counterfactual explanations to reduce the risk of classification as a default.

Abstract Image

查看原文本刊更多论文

信用风险预测中的两阶段解释分类器模型

在金融领域，信用风险是一个关键问题，准确的预测对于降低金融风险和确保经济稳定至关重要。虽然人工智能方法可以达到令人满意的准确性，但解释其预测结果提出了重大挑战，从而促进了对可解释性的研究。目前的研究主要集中在单个可解释性方法上，很少研究多种方法的联合应用。为了解决现有研究的局限性，本研究提出了一个整合了SHAP和反事实解释的两阶段可解释性模型。在第一阶段，使用SHAP分析特征重要性，根据特征对预测结果的积极或消极影响将特征分类为子集。在第二阶段，遗传算法通过考虑特征重要性并基于预定义子集在各个方向上应用扰动来生成反事实解释，从而准确识别可以修改预测结果的反事实样本。我们分别使用SVM、XGB、MLP和LSTM作为基本分类器对德国信用数据集、HMEQ数据集和台湾信用卡客户端违约数据集进行了实验。实验结果表明，生成的反事实解释中特征变化的频率与由SHAP方法得到的特征重要性密切相关。在有效性和稀疏度评价指标下，该方法的性能优于基本反事实解释方法和基于原型的反事实解释方法。此外，本研究还提出了基于SHAP分析结果和反事实解释得出的特征的建议，以降低被分类为违约的风险。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Forecasting Multiple-

CiteScore

5.40

自引率

5.90%

发文量

期刊介绍： The Journal of Forecasting is an international journal that publishes refereed papers on forecasting. It is multidisciplinary, welcoming papers dealing with any aspect of forecasting: theoretical, practical, computational and methodological. A broad interpretation of the topic is taken with approaches from various subject areas, such as statistics, economics, psychology, systems engineering and social sciences, all encouraged. Furthermore, the Journal welcomes a wide diversity of applications in such fields as business, government, technology and the environment. Of particular interest are papers dealing with modelling issues and the relationship of forecasting systems to decision-making processes.