{"title":"Optimal Prediction Model of Default Probability Based on Multiple Machine Learning Methods","authors":"Zhanjiang Li, Xueting Ren, Hua Tao","doi":"10.3103/S0146411625700105","DOIUrl":null,"url":null,"abstract":"<p>The prediction of the probability of default can help banks and other financial institutions to effectively identify and assess the potential default risk associated with family farms, thereby reducing losses due to bad debts. Although many methods are available for constructing models for the probability of default, the choice of optimal models is still inconclusive. Taking the survey data of 722 family farms in China Inner Mongolia as the empirical objects, 4 machine learning methods, including binary classification logistic regression, decision tree CART algorithm, random forest, and kernel support vector machine, were used to construct the default probability prediction model for family farms. By comparing and analyzing the four models, we found a better default probability prediction model to help financial institutions better audit the qualifications of family farms and reduce borrowing risks. The results showed that (1) the three models except logistic regression had strong prediction ability, which was higher than 90%, and the classification effect was good; and (2) the random forest model had the best prediction effect, the decision tree was the second, and the logistic regression was the worst.</p>","PeriodicalId":46238,"journal":{"name":"AUTOMATIC CONTROL AND COMPUTER SCIENCES","volume":"59 1","pages":"116 - 125"},"PeriodicalIF":0.5000,"publicationDate":"2025-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC CONTROL AND COMPUTER SCIENCES","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0146411625700105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The prediction of the probability of default can help banks and other financial institutions to effectively identify and assess the potential default risk associated with family farms, thereby reducing losses due to bad debts. Although many methods are available for constructing models for the probability of default, the choice of optimal models is still inconclusive. Taking the survey data of 722 family farms in China Inner Mongolia as the empirical objects, 4 machine learning methods, including binary classification logistic regression, decision tree CART algorithm, random forest, and kernel support vector machine, were used to construct the default probability prediction model for family farms. By comparing and analyzing the four models, we found a better default probability prediction model to help financial institutions better audit the qualifications of family farms and reduce borrowing risks. The results showed that (1) the three models except logistic regression had strong prediction ability, which was higher than 90%, and the classification effect was good; and (2) the random forest model had the best prediction effect, the decision tree was the second, and the logistic regression was the worst.
期刊介绍:
Automatic Control and Computer Sciences is a peer reviewed journal that publishes articles on• Control systems, cyber-physical system, real-time systems, robotics, smart sensors, embedded intelligence • Network information technologies, information security, statistical methods of data processing, distributed artificial intelligence, complex systems modeling, knowledge representation, processing and management • Signal and image processing, machine learning, machine perception, computer vision