{"title":"通过综合预处理和特征选择改进贷款违约预测模型","authors":"Ahmad Al-qerem, Ghazi Al-Naymat, M. Alhasan","doi":"10.1109/ACIT47987.2019.8991084","DOIUrl":null,"url":null,"abstract":"For financial institutions and the banking industry, it is very crucial to have predictive models for their financial activities, as they play a major role in risk management. Predicting loan default is one of the critical issues that they focus on, as huge revenue loss could be prevented by predicting customer’s ability to pay back on time. In this paper, different classification methods (Naïve Bayes, Decision Tree, and Random Forest) are being used for prediction, comprehensive different pre-processing techniques are being applied on the dataset, and three different feature extractions algorithms are used to enhance the accuracy and the performance. Results are compared using F1 accuracy measure, and an improvement of over 3% has been obtained.","PeriodicalId":314091,"journal":{"name":"2019 International Arab Conference on Information Technology (ACIT)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Loan Default Prediction Model Improvement through Comprehensive Preprocessing and Features Selection\",\"authors\":\"Ahmad Al-qerem, Ghazi Al-Naymat, M. Alhasan\",\"doi\":\"10.1109/ACIT47987.2019.8991084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For financial institutions and the banking industry, it is very crucial to have predictive models for their financial activities, as they play a major role in risk management. Predicting loan default is one of the critical issues that they focus on, as huge revenue loss could be prevented by predicting customer’s ability to pay back on time. In this paper, different classification methods (Naïve Bayes, Decision Tree, and Random Forest) are being used for prediction, comprehensive different pre-processing techniques are being applied on the dataset, and three different feature extractions algorithms are used to enhance the accuracy and the performance. Results are compared using F1 accuracy measure, and an improvement of over 3% has been obtained.\",\"PeriodicalId\":314091,\"journal\":{\"name\":\"2019 International Arab Conference on Information Technology (ACIT)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Arab Conference on Information Technology (ACIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACIT47987.2019.8991084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Arab Conference on Information Technology (ACIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIT47987.2019.8991084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Loan Default Prediction Model Improvement through Comprehensive Preprocessing and Features Selection
For financial institutions and the banking industry, it is very crucial to have predictive models for their financial activities, as they play a major role in risk management. Predicting loan default is one of the critical issues that they focus on, as huge revenue loss could be prevented by predicting customer’s ability to pay back on time. In this paper, different classification methods (Naïve Bayes, Decision Tree, and Random Forest) are being used for prediction, comprehensive different pre-processing techniques are being applied on the dataset, and three different feature extractions algorithms are used to enhance the accuracy and the performance. Results are compared using F1 accuracy measure, and an improvement of over 3% has been obtained.