{"title":"缺失的价值对盈利预测重要吗?机器学习的视角。","authors":"Ajim Uddin, Xinyuan Tao, Chia-Ching Chou, Dantong Yu","doi":"10.1080/14697688.2021.1963825","DOIUrl":null,"url":null,"abstract":"<p><p>Analysts' forecast is one of the most common and important estimators for firms' future earnings. However, it is challenging to fully utilize because of the missing values. This study applies machine learning techniques to impute missing values in individual analysts' forecasts and subsequently to predict firms' future earnings based on both imputed and observed forecasts. After imputing missing values, the forecast error is reduced by 41% compared to the mean forecast, suggesting that missing values after imputation indeed useful for earnings forecast. We analyze multiple imputation methods and show that the out-performance of matrix factorization (MF) is consistent using different evaluation measures and across firms. Finally, we propose a stochastic gradient descent based coupled matrix factorization (CMF) to augment the imputation quality of missing values with multiple datasets. CMF further reduces the error of earnings forecast by 19% compared to the MF with a single dataset.</p>","PeriodicalId":20747,"journal":{"name":"Quantitative Finance","volume":"22 6","pages":"1113-1132"},"PeriodicalIF":1.5000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9246338/pdf/nihms-1739019.pdf","citationCount":"6","resultStr":"{\"title\":\"Are missing values important for earnings forecast? a machine learning perspective.\",\"authors\":\"Ajim Uddin, Xinyuan Tao, Chia-Ching Chou, Dantong Yu\",\"doi\":\"10.1080/14697688.2021.1963825\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Analysts' forecast is one of the most common and important estimators for firms' future earnings. However, it is challenging to fully utilize because of the missing values. This study applies machine learning techniques to impute missing values in individual analysts' forecasts and subsequently to predict firms' future earnings based on both imputed and observed forecasts. After imputing missing values, the forecast error is reduced by 41% compared to the mean forecast, suggesting that missing values after imputation indeed useful for earnings forecast. We analyze multiple imputation methods and show that the out-performance of matrix factorization (MF) is consistent using different evaluation measures and across firms. Finally, we propose a stochastic gradient descent based coupled matrix factorization (CMF) to augment the imputation quality of missing values with multiple datasets. CMF further reduces the error of earnings forecast by 19% compared to the MF with a single dataset.</p>\",\"PeriodicalId\":20747,\"journal\":{\"name\":\"Quantitative Finance\",\"volume\":\"22 6\",\"pages\":\"1113-1132\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9246338/pdf/nihms-1739019.pdf\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Finance\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1080/14697688.2021.1963825\",\"RegionNum\":4,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Finance","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1080/14697688.2021.1963825","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
Are missing values important for earnings forecast? a machine learning perspective.
Analysts' forecast is one of the most common and important estimators for firms' future earnings. However, it is challenging to fully utilize because of the missing values. This study applies machine learning techniques to impute missing values in individual analysts' forecasts and subsequently to predict firms' future earnings based on both imputed and observed forecasts. After imputing missing values, the forecast error is reduced by 41% compared to the mean forecast, suggesting that missing values after imputation indeed useful for earnings forecast. We analyze multiple imputation methods and show that the out-performance of matrix factorization (MF) is consistent using different evaluation measures and across firms. Finally, we propose a stochastic gradient descent based coupled matrix factorization (CMF) to augment the imputation quality of missing values with multiple datasets. CMF further reduces the error of earnings forecast by 19% compared to the MF with a single dataset.
期刊介绍:
The frontiers of finance are shifting rapidly, driven in part by the increasing use of quantitative methods in the field. Quantitative Finance welcomes original research articles that reflect the dynamism of this area. The journal provides an interdisciplinary forum for presenting both theoretical and empirical approaches and offers rapid publication of original new work with high standards of quality. The readership is broad, embracing researchers and practitioners across a range of specialisms and within a variety of organizations. All articles should aim to be of interest to this broad readership.