AdaFNDFS: An AdaBoost Ensemble Model with Fast Nondominated Feature Selection for Predicting Enterprise Credit Risk in the Supply Chain

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Intelligent Systems Pub Date : 2024-10-29 DOI:10.1155/2024/5529847

Gang Yao, Xiaojian Hu, Pingfan Song, Taiyun Zhou, Yue Zhang, Ammar Yasir, Suizhi Luo

{"title":"AdaFNDFS: An AdaBoost Ensemble Model with Fast Nondominated Feature Selection for Predicting Enterprise Credit Risk in the Supply Chain","authors":"Gang Yao, Xiaojian Hu, Pingfan Song, Taiyun Zhou, Yue Zhang, Ammar Yasir, Suizhi Luo","doi":"10.1155/2024/5529847","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Early warnings of enterprise credit risk based on supply chain scenarios are helpful for preventing enterprise credit deterioration and resolving systemic risk. Enterprise credit risk data in the supply chain are characterized by higher-dimension information and class imbalance. The class imbalance influences the feature selection effect, and the feature subset is closely related to the predictive performance of subsequent learning algorithms. Therefore, ensuring the adaptivity of feature selection and the subsequent class imbalance–oriented classification model is a key issue. We propose an AdaBoost ensemble model with fast nondominated feature selection (AdaFNDFS). AdaFNDFS uses the FNDFS method in the AdaBoost algorithm to iteratively select features and uses the classifier to evaluate the performance of feature subsets to train the class imbalance–oriented classifier and the best-matched feature subset, ensuring the adaptivity of feature selection and subsequent classifiers. The further use of the differential sampling rate (DSR) method enables AdaFNDFS to integrate more training models with different knowledge and to obtain higher accuracy and better generalization ability for prediction tasks facing high-dimensional information and class imbalance. A test using credit risk data from Chinese listed enterprises containing supply chain information demonstrates that the prediction scoring indicators, such as AUC, KS, AP, and accuracy, of the AdaFNDFS are better than those of basic models such as LR, LDA, DT, and SVM and multiple hybrid models that use SMOTE, feature selection, and ensemble methods. AdaFNDFS outperforms the basic models by at least 0.0073 (0.0344, 0.0349, and 0.0071) in terms of the AUC (KS, AP, and accuracy). AdaFNDFS has outstanding advantages in predicting enterprise credit risk in the supply chain and can support interested decision-makers.</p>\n </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2024 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/2024/5529847","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/2024/5529847","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Early warnings of enterprise credit risk based on supply chain scenarios are helpful for preventing enterprise credit deterioration and resolving systemic risk. Enterprise credit risk data in the supply chain are characterized by higher-dimension information and class imbalance. The class imbalance influences the feature selection effect, and the feature subset is closely related to the predictive performance of subsequent learning algorithms. Therefore, ensuring the adaptivity of feature selection and the subsequent class imbalance–oriented classification model is a key issue. We propose an AdaBoost ensemble model with fast nondominated feature selection (AdaFNDFS). AdaFNDFS uses the FNDFS method in the AdaBoost algorithm to iteratively select features and uses the classifier to evaluate the performance of feature subsets to train the class imbalance–oriented classifier and the best-matched feature subset, ensuring the adaptivity of feature selection and subsequent classifiers. The further use of the differential sampling rate (DSR) method enables AdaFNDFS to integrate more training models with different knowledge and to obtain higher accuracy and better generalization ability for prediction tasks facing high-dimensional information and class imbalance. A test using credit risk data from Chinese listed enterprises containing supply chain information demonstrates that the prediction scoring indicators, such as AUC, KS, AP, and accuracy, of the AdaFNDFS are better than those of basic models such as LR, LDA, DT, and SVM and multiple hybrid models that use SMOTE, feature selection, and ensemble methods. AdaFNDFS outperforms the basic models by at least 0.0073 (0.0344, 0.0349, and 0.0071) in terms of the AUC (KS, AP, and accuracy). AdaFNDFS has outstanding advantages in predicting enterprise credit risk in the supply chain and can support interested decision-makers.

Abstract Image

查看原文本刊更多论文

AdaFNDFS：利用快速非支配特征选择的 AdaBoost 集合模型预测供应链中的企业信贷风险

基于供应链场景的企业信用风险预警有助于防止企业信用风险恶化，化解系统性风险。供应链中的企业信用风险数据具有信息维度高、类不平衡的特点。类不平衡影响特征选择效果，而特征子集与后续学习算法的预测性能密切相关。因此，确保特征选择和后续面向类不平衡的分类模型的适应性是一个关键问题。我们提出了一种具有快速非支配特征选择的 AdaBoost 集合模型（AdaFNDFS）。AdaFNDFS 使用 AdaBoost 算法中的 FNDFS 方法迭代选择特征，并使用分类器评估特征子集的性能，以训练面向类不平衡的分类器和最佳匹配的特征子集，从而确保特征选择和后续分类器的自适应性。差分采样率（DSR）方法的进一步使用，使得AdaFNDFS能够集成更多不同知识的训练模型，在面对高维信息和类不平衡的预测任务时获得更高的精度和更好的泛化能力。利用包含供应链信息的中国上市企业信用风险数据进行的测试表明，AdaFNDFS的AUC、KS、AP和准确率等预测评分指标均优于LR、LDA、DT和SVM等基本模型以及使用SMOTE、特征选择和集合方法的多种混合模型。在 AUC（KS、AP 和准确度）方面，AdaFNDFS 至少比基本模型高出 0.0073（0.0344、0.0349 和 0.0071）。AdaFNDFS 在预测供应链中的企业信贷风险方面具有突出优势，可为相关决策者提供支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Intelligent Systems 工程技术-计算机：人工智能

CiteScore

11.30

自引率

14.30%

发文量

304

审稿时长

9 months

期刊介绍： The International Journal of Intelligent Systems serves as a forum for individuals interested in tapping into the vast theories based on intelligent systems construction. With its peer-reviewed format, the journal explores several fascinating editorials written by today''s experts in the field. Because new developments are being introduced each day, there''s much to be learned — examination, analysis creation, information retrieval, man–computer interactions, and more. The International Journal of Intelligent Systems uses charts and illustrations to demonstrate these ground-breaking issues, and encourages readers to share their thoughts and experiences.