{"title":"Default Prediction Framework With Optimal Feature Set and Matching Ratio","authors":"Guotai Chi, Fengshan Bai, Hongping Tan, Ying Zhou","doi":"10.1002/for.3284","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>We propose a default prediction framework that incorporates imbalance handling and feature selection. For imbalance handling, we determine the optimal ratio of non-default to default firms by minimizing the Type-II error of the majority voting deep fully connected network (MV-DFCN) model. For feature selection, we design a two-stage process that first eliminates highly correlated and redundant features, and then refines the feature set using backward selection. Experimental results show that the DFCN model within the proposed framework outperforms baseline models in terms of G-Mean and AUC and achieves the lowest Type-II error rate. Furthermore, the framework outperforms eight baseline combinations of imbalance handling and feature selection strategies. Additionally, SHAP values are used to assess feature contributions, and nine features with statistically significant impacts are identified.</p>\n </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 7","pages":"2067-2088"},"PeriodicalIF":2.7000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Forecasting","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/for.3284","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a default prediction framework that incorporates imbalance handling and feature selection. For imbalance handling, we determine the optimal ratio of non-default to default firms by minimizing the Type-II error of the majority voting deep fully connected network (MV-DFCN) model. For feature selection, we design a two-stage process that first eliminates highly correlated and redundant features, and then refines the feature set using backward selection. Experimental results show that the DFCN model within the proposed framework outperforms baseline models in terms of G-Mean and AUC and achieves the lowest Type-II error rate. Furthermore, the framework outperforms eight baseline combinations of imbalance handling and feature selection strategies. Additionally, SHAP values are used to assess feature contributions, and nine features with statistically significant impacts are identified.
期刊介绍:
The Journal of Forecasting is an international journal that publishes refereed papers on forecasting. It is multidisciplinary, welcoming papers dealing with any aspect of forecasting: theoretical, practical, computational and methodological. A broad interpretation of the topic is taken with approaches from various subject areas, such as statistics, economics, psychology, systems engineering and social sciences, all encouraged. Furthermore, the Journal welcomes a wide diversity of applications in such fields as business, government, technology and the environment. Of particular interest are papers dealing with modelling issues and the relationship of forecasting systems to decision-making processes.