{"title":"Forecasting loan, deferred rate and customer segmentation in banking industry: A computational intelligence approach","authors":"Mahtab Vasheghani, Ebrahim Nazari Farokhi, Behrooz Dolatshah","doi":"10.1016/j.array.2025.100460","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate loan default prediction and customer segmentation are critical challenges in the banking industry. This study proposes a novel hybrid model integrating Multi-Layer Perceptron (MLP) neural networks with Self-Adaptive Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) frameworks. GA handles feature selection, while PSO optimizes MLP hyperparameters (e.g., learning rate, neurons, activation functions). The model dynamically enhances classification accuracy and resilience, particularly for imbalanced datasets in loan default prediction. Using real-world data from Sina Bank, the system outperforms Logistic Regression, Decision Trees, and Random Forests. The GA-PSO optimization process, which integrates both PSO and GA to optimize the MLP model's parameters, plays a crucial role in enhancing the accuracy and scalability of the system. Specifically, the GA-PSO-MLP model achieves a 15 % higher classification accuracy than Logistic Regression, a 12 % improvement over Decision Trees, and an 8 % gain over Random Forests. Additionally, false positive rates are reduced by 20 %, and mean squared error (MSE) is lowered by 18 %. The F1-score of the proposed model is 92.3 %, compared to 79.8 % (Logistic Regression), 81.5 % (Decision Trees), and 85.2 % (Random Forests), further highlighting its advantage in handling imbalanced datasets. Extensive numerical validation and sensitivity analysis further highlight the model's effectiveness in delivering actionable insights that enhance customer management strategies and mitigate financial risks. This research makes a substantial contribution to the application of machine learning in banking, facilitating more accurate data-driven decision-making and more robust risk management practices.</div></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":"27 ","pages":"Article 100460"},"PeriodicalIF":4.5000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005625000876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate loan default prediction and customer segmentation are critical challenges in the banking industry. This study proposes a novel hybrid model integrating Multi-Layer Perceptron (MLP) neural networks with Self-Adaptive Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) frameworks. GA handles feature selection, while PSO optimizes MLP hyperparameters (e.g., learning rate, neurons, activation functions). The model dynamically enhances classification accuracy and resilience, particularly for imbalanced datasets in loan default prediction. Using real-world data from Sina Bank, the system outperforms Logistic Regression, Decision Trees, and Random Forests. The GA-PSO optimization process, which integrates both PSO and GA to optimize the MLP model's parameters, plays a crucial role in enhancing the accuracy and scalability of the system. Specifically, the GA-PSO-MLP model achieves a 15 % higher classification accuracy than Logistic Regression, a 12 % improvement over Decision Trees, and an 8 % gain over Random Forests. Additionally, false positive rates are reduced by 20 %, and mean squared error (MSE) is lowered by 18 %. The F1-score of the proposed model is 92.3 %, compared to 79.8 % (Logistic Regression), 81.5 % (Decision Trees), and 85.2 % (Random Forests), further highlighting its advantage in handling imbalanced datasets. Extensive numerical validation and sensitivity analysis further highlight the model's effectiveness in delivering actionable insights that enhance customer management strategies and mitigate financial risks. This research makes a substantial contribution to the application of machine learning in banking, facilitating more accurate data-driven decision-making and more robust risk management practices.