{"title":"企业信用评级预测与机器学习的比较研究","authors":"Seyyide Doğan, Yasin Büyükkör, Murat Atan","doi":"10.37190/ord220102","DOIUrl":null,"url":null,"abstract":"Credit scores are critical for financial sector investors and government officials, so it is important to develop reliable, transparent and appropriate tools for obtaining ratings. This study aims to predict company credit scores with machine learning and modern statistical methods, both in sectoral and aggregated data. Analyses are made on 1881 companies operating in three different sectors that applied for loans from Turkey’s largest public bank. The results of the experiment are compared in terms of classification accuracy, sensitivity, specificity, precision and Mathews correlation coefficient. When the credit ratings are estimated on a sectoral basis, it is observed that the classification rate considerably changes. Considering the analysis results, it is seen that logistic regression analysis, support vector machines, random forest and XGBoost have better performance than decision tree and k-nearest neighbour for all data sets.","PeriodicalId":43244,"journal":{"name":"Operations Research and Decisions","volume":"391 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A comparative study of corporate credit ratings prediction with machine learning\",\"authors\":\"Seyyide Doğan, Yasin Büyükkör, Murat Atan\",\"doi\":\"10.37190/ord220102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Credit scores are critical for financial sector investors and government officials, so it is important to develop reliable, transparent and appropriate tools for obtaining ratings. This study aims to predict company credit scores with machine learning and modern statistical methods, both in sectoral and aggregated data. Analyses are made on 1881 companies operating in three different sectors that applied for loans from Turkey’s largest public bank. The results of the experiment are compared in terms of classification accuracy, sensitivity, specificity, precision and Mathews correlation coefficient. When the credit ratings are estimated on a sectoral basis, it is observed that the classification rate considerably changes. Considering the analysis results, it is seen that logistic regression analysis, support vector machines, random forest and XGBoost have better performance than decision tree and k-nearest neighbour for all data sets.\",\"PeriodicalId\":43244,\"journal\":{\"name\":\"Operations Research and Decisions\",\"volume\":\"391 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Operations Research and Decisions\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37190/ord220102\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"OPERATIONS RESEARCH & MANAGEMENT SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Operations Research and Decisions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37190/ord220102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
A comparative study of corporate credit ratings prediction with machine learning
Credit scores are critical for financial sector investors and government officials, so it is important to develop reliable, transparent and appropriate tools for obtaining ratings. This study aims to predict company credit scores with machine learning and modern statistical methods, both in sectoral and aggregated data. Analyses are made on 1881 companies operating in three different sectors that applied for loans from Turkey’s largest public bank. The results of the experiment are compared in terms of classification accuracy, sensitivity, specificity, precision and Mathews correlation coefficient. When the credit ratings are estimated on a sectoral basis, it is observed that the classification rate considerably changes. Considering the analysis results, it is seen that logistic regression analysis, support vector machines, random forest and XGBoost have better performance than decision tree and k-nearest neighbour for all data sets.