Dr. Manjula Shastri, Dr. Surajit Das, Akansh Garg, Mr. Gourab Dutta, Ms. Aneeqa, Dr. Abhishek Tripathi
{"title":"Machine Learning Based Risk Management of Credit Sales in Small and Mid-Size Business","authors":"Dr. Manjula Shastri, Dr. Surajit Das, Akansh Garg, Mr. Gourab Dutta, Ms. Aneeqa, Dr. Abhishek Tripathi","doi":"10.52783/jier.v4i2.842","DOIUrl":null,"url":null,"abstract":"This is a study that uses ML algorithms applications for effective credit risk prediction and management in small and mid-size businesses (SMBs). One of the ways this was achieved was by using comprehensive data sets, which consisted of historical credit sales transactions, customer demographics, and economic indicators. As a result, four specific ML algorithms, namely logistic regression, decision trees, random forest and gradient boosting, were assessed as the methodology. Findings show that gradient boosting yielded the best results, reaching an accuracy score of 90 %, precision of 89 %, recall value of 91 %, F1-score of 90 %, and area under the receiver operating characteristic curve is 0.95. Logistic regression has shown highly competitive results, in excess of 85% accuracy, and an AUC-ROC of 0.91. The findings demonstrate that credit history, the income level, and the age of the client are the most critical features in credit risk analysis of the SMBs.","PeriodicalId":496224,"journal":{"name":"Journal of Informatics Education and Research","volume":"18 S2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Informatics Education and Research","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.52783/jier.v4i2.842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This is a study that uses ML algorithms applications for effective credit risk prediction and management in small and mid-size businesses (SMBs). One of the ways this was achieved was by using comprehensive data sets, which consisted of historical credit sales transactions, customer demographics, and economic indicators. As a result, four specific ML algorithms, namely logistic regression, decision trees, random forest and gradient boosting, were assessed as the methodology. Findings show that gradient boosting yielded the best results, reaching an accuracy score of 90 %, precision of 89 %, recall value of 91 %, F1-score of 90 %, and area under the receiver operating characteristic curve is 0.95. Logistic regression has shown highly competitive results, in excess of 85% accuracy, and an AUC-ROC of 0.91. The findings demonstrate that credit history, the income level, and the age of the client are the most critical features in credit risk analysis of the SMBs.