{"title":"利用深度学习和SMOTE-ENN重采样增强信用风险预测","authors":"Idowu Aruleba, Yanxia Sun","doi":"10.1016/j.mlwa.2025.100692","DOIUrl":null,"url":null,"abstract":"<div><div>Credit risk prediction is a vital task in financial services, ensuring that institutions can manage their lending risks effectively. This study investigates the effectiveness of deep learning (DL) models for credit risk prediction, with a focus on addressing the challenge of class imbalance and the black box nature of these models using the Synthetic Minority Over-sampling Technique - Edited Nearest Neighbor (SMOTE-ENN) resampling method and Shapley Additive Explanations (SHAP), respectively. The study compares the performance of various DL architectures, including Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), Gated Recurrent Units (GRU), and Graph Neural Networks (GNN), on two real-world datasets: the Australian and German credit datasets. The findings reveal that the GRU model, enhanced with SMOTE-ENN resampling, outperforms other models in terms of accuracy, sensitivity, and specificity. The superior performance of the GRU-SMOTE-ENN model demonstrates its potential as a robust deep learning technique for financial institutions to enhance credit risk assessment. Additionally, the study demonstrates how the integration of SHAP values significantly improves the interpretability of deep learning models, making them more transparent and trustworthy for stakeholders.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"21 ","pages":"Article 100692"},"PeriodicalIF":4.9000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced credit risk prediction using deep learning and SMOTE-ENN resampling\",\"authors\":\"Idowu Aruleba, Yanxia Sun\",\"doi\":\"10.1016/j.mlwa.2025.100692\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Credit risk prediction is a vital task in financial services, ensuring that institutions can manage their lending risks effectively. This study investigates the effectiveness of deep learning (DL) models for credit risk prediction, with a focus on addressing the challenge of class imbalance and the black box nature of these models using the Synthetic Minority Over-sampling Technique - Edited Nearest Neighbor (SMOTE-ENN) resampling method and Shapley Additive Explanations (SHAP), respectively. The study compares the performance of various DL architectures, including Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), Gated Recurrent Units (GRU), and Graph Neural Networks (GNN), on two real-world datasets: the Australian and German credit datasets. The findings reveal that the GRU model, enhanced with SMOTE-ENN resampling, outperforms other models in terms of accuracy, sensitivity, and specificity. The superior performance of the GRU-SMOTE-ENN model demonstrates its potential as a robust deep learning technique for financial institutions to enhance credit risk assessment. Additionally, the study demonstrates how the integration of SHAP values significantly improves the interpretability of deep learning models, making them more transparent and trustworthy for stakeholders.</div></div>\",\"PeriodicalId\":74093,\"journal\":{\"name\":\"Machine learning with applications\",\"volume\":\"21 \",\"pages\":\"Article 100692\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Machine learning with applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666827025000751\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enhanced credit risk prediction using deep learning and SMOTE-ENN resampling
Credit risk prediction is a vital task in financial services, ensuring that institutions can manage their lending risks effectively. This study investigates the effectiveness of deep learning (DL) models for credit risk prediction, with a focus on addressing the challenge of class imbalance and the black box nature of these models using the Synthetic Minority Over-sampling Technique - Edited Nearest Neighbor (SMOTE-ENN) resampling method and Shapley Additive Explanations (SHAP), respectively. The study compares the performance of various DL architectures, including Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), Gated Recurrent Units (GRU), and Graph Neural Networks (GNN), on two real-world datasets: the Australian and German credit datasets. The findings reveal that the GRU model, enhanced with SMOTE-ENN resampling, outperforms other models in terms of accuracy, sensitivity, and specificity. The superior performance of the GRU-SMOTE-ENN model demonstrates its potential as a robust deep learning technique for financial institutions to enhance credit risk assessment. Additionally, the study demonstrates how the integration of SHAP values significantly improves the interpretability of deep learning models, making them more transparent and trustworthy for stakeholders.