{"title":"PREDICTING HIGH-GROWTH FIRMS IN KAZAKHSTAN WITH MACHINE LEARNING METHODS","authors":"Yelzhas Kadyr, A. Aituar, S. Kemelbayeva","doi":"10.52123/1994-2370-2022-668","DOIUrl":null,"url":null,"abstract":"In this paper, we study the effectiveness of popular machine learning methods for predicting high-growth firms in Kazakhstan and analyze this question with a set of 2012-2018 panel datasets. Moreover, we study the most important variables for the prediction of high-growth firms out of 50 variables included in the analysis. We develop a predictive design, where the past values are used to train classifiers that are applied in predicting future outcomes. Hereto, a test sample was used to evaluate the predictive performance of the classifiers. The results indicate that the best performing classifier increases the area under the curve equal to 0.8746. In terms of the variable importance, the firm’s past growth in size, past growth in employment, past growth in revenue, and second derivative of the growth of financial variables contributed the most to predicting high-growth firms.","PeriodicalId":101499,"journal":{"name":"PUBLIC ADMINISTRATION AND CIVIL SERVICE","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PUBLIC ADMINISTRATION AND CIVIL SERVICE","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52123/1994-2370-2022-668","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we study the effectiveness of popular machine learning methods for predicting high-growth firms in Kazakhstan and analyze this question with a set of 2012-2018 panel datasets. Moreover, we study the most important variables for the prediction of high-growth firms out of 50 variables included in the analysis. We develop a predictive design, where the past values are used to train classifiers that are applied in predicting future outcomes. Hereto, a test sample was used to evaluate the predictive performance of the classifiers. The results indicate that the best performing classifier increases the area under the curve equal to 0.8746. In terms of the variable importance, the firm’s past growth in size, past growth in employment, past growth in revenue, and second derivative of the growth of financial variables contributed the most to predicting high-growth firms.