{"title":"用机器学习方法预测哈萨克斯坦的高增长公司","authors":"Yelzhas Kadyr, A. Aituar, S. Kemelbayeva","doi":"10.52123/1994-2370-2022-668","DOIUrl":null,"url":null,"abstract":"In this paper, we study the effectiveness of popular machine learning methods for predicting high-growth firms in Kazakhstan and analyze this question with a set of 2012-2018 panel datasets. Moreover, we study the most important variables for the prediction of high-growth firms out of 50 variables included in the analysis. We develop a predictive design, where the past values are used to train classifiers that are applied in predicting future outcomes. Hereto, a test sample was used to evaluate the predictive performance of the classifiers. The results indicate that the best performing classifier increases the area under the curve equal to 0.8746. In terms of the variable importance, the firm’s past growth in size, past growth in employment, past growth in revenue, and second derivative of the growth of financial variables contributed the most to predicting high-growth firms.","PeriodicalId":101499,"journal":{"name":"PUBLIC ADMINISTRATION AND CIVIL SERVICE","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PREDICTING HIGH-GROWTH FIRMS IN KAZAKHSTAN WITH MACHINE LEARNING METHODS\",\"authors\":\"Yelzhas Kadyr, A. Aituar, S. Kemelbayeva\",\"doi\":\"10.52123/1994-2370-2022-668\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we study the effectiveness of popular machine learning methods for predicting high-growth firms in Kazakhstan and analyze this question with a set of 2012-2018 panel datasets. Moreover, we study the most important variables for the prediction of high-growth firms out of 50 variables included in the analysis. We develop a predictive design, where the past values are used to train classifiers that are applied in predicting future outcomes. Hereto, a test sample was used to evaluate the predictive performance of the classifiers. The results indicate that the best performing classifier increases the area under the curve equal to 0.8746. In terms of the variable importance, the firm’s past growth in size, past growth in employment, past growth in revenue, and second derivative of the growth of financial variables contributed the most to predicting high-growth firms.\",\"PeriodicalId\":101499,\"journal\":{\"name\":\"PUBLIC ADMINISTRATION AND CIVIL SERVICE\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PUBLIC ADMINISTRATION AND CIVIL SERVICE\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52123/1994-2370-2022-668\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PUBLIC ADMINISTRATION AND CIVIL SERVICE","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52123/1994-2370-2022-668","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
PREDICTING HIGH-GROWTH FIRMS IN KAZAKHSTAN WITH MACHINE LEARNING METHODS
In this paper, we study the effectiveness of popular machine learning methods for predicting high-growth firms in Kazakhstan and analyze this question with a set of 2012-2018 panel datasets. Moreover, we study the most important variables for the prediction of high-growth firms out of 50 variables included in the analysis. We develop a predictive design, where the past values are used to train classifiers that are applied in predicting future outcomes. Hereto, a test sample was used to evaluate the predictive performance of the classifiers. The results indicate that the best performing classifier increases the area under the curve equal to 0.8746. In terms of the variable importance, the firm’s past growth in size, past growth in employment, past growth in revenue, and second derivative of the growth of financial variables contributed the most to predicting high-growth firms.