Anna Kusetogullari, H. Kusetogullari, Amir Yavariabdi, Martin Andersson, Johan Eklund
{"title":"Genetic Algorithm-based Variable Selection Approach for High-Growth Firm Prediction","authors":"Anna Kusetogullari, H. Kusetogullari, Amir Yavariabdi, Martin Andersson, Johan Eklund","doi":"10.1109/ICECCME55909.2022.9988729","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel method for high-growth firm prediction by minimizing a cost function using a Genetic Algorithm (GA). To achieve it, the GA is used to search to find a set of important variables which provide the best fit for machine learning models so that accurate predictions can be made for high-growth firm prediction. The GA is employed to optimize the mean square error (MSE) between the accurate results and the predicted results of the machine learning methods by evolving the initially generated binary solutions through iterations. The proposed method obtains the best fitting set of variables for the machine learning methods for high-growth firm prediction. Four different machine learning methods which are Support Vector Machines (SVM), Logistic Regression, Random Forest (RF) and K-Nearest Neighbor (K-NN) have been employed with the GA and experimental results show that using RF with the GA achieves the best accuracy results with 94.93%.","PeriodicalId":202568,"journal":{"name":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCME55909.2022.9988729","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose a novel method for high-growth firm prediction by minimizing a cost function using a Genetic Algorithm (GA). To achieve it, the GA is used to search to find a set of important variables which provide the best fit for machine learning models so that accurate predictions can be made for high-growth firm prediction. The GA is employed to optimize the mean square error (MSE) between the accurate results and the predicted results of the machine learning methods by evolving the initially generated binary solutions through iterations. The proposed method obtains the best fitting set of variables for the machine learning methods for high-growth firm prediction. Four different machine learning methods which are Support Vector Machines (SVM), Logistic Regression, Random Forest (RF) and K-Nearest Neighbor (K-NN) have been employed with the GA and experimental results show that using RF with the GA achieves the best accuracy results with 94.93%.