{"title":"基于领域知识的非线性张量补全:在分析师收益预测中的应用","authors":"Ajim Uddin, Xinyuan Tao, Chia-Ching Chou, Dantong Yu","doi":"10.1109/ICDMW51313.2020.00059","DOIUrl":null,"url":null,"abstract":"Financial analysts' earnings forecast is one of the most critical inputs for security valuation and investment decisions. However, it is challenging to utilize such information for two main reasons: missing values and heterogeneity among analysts. In this paper, we show that one recent breakthrough in nonlinear tensor completion algorithm, CoSTCo [1], overcomes the difficulty by imputing missing values and significantly improves the forecast accuracy in earnings. Compared with conventional imputation approaches, CoSTCo effectively captures latent information and reduces the tensor completion errors by 50%, even with 98% missing values. Furthermore, we show that using firm characteristics as auxiliary information we can improve firms' earnings prediction accuracy by 6%. Results are consistent using different performance metrics and across various industry sectors. Notably, the performance improvement is more salient for the sectors with high heterogeneity. Our findings imply the successful application of advanced ML techniques in a real financial problem.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"16 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Nonlinear Tensor Completion Using Domain Knowledge: An Application in Analysts' Earnings Forecast\",\"authors\":\"Ajim Uddin, Xinyuan Tao, Chia-Ching Chou, Dantong Yu\",\"doi\":\"10.1109/ICDMW51313.2020.00059\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Financial analysts' earnings forecast is one of the most critical inputs for security valuation and investment decisions. However, it is challenging to utilize such information for two main reasons: missing values and heterogeneity among analysts. In this paper, we show that one recent breakthrough in nonlinear tensor completion algorithm, CoSTCo [1], overcomes the difficulty by imputing missing values and significantly improves the forecast accuracy in earnings. Compared with conventional imputation approaches, CoSTCo effectively captures latent information and reduces the tensor completion errors by 50%, even with 98% missing values. Furthermore, we show that using firm characteristics as auxiliary information we can improve firms' earnings prediction accuracy by 6%. Results are consistent using different performance metrics and across various industry sectors. Notably, the performance improvement is more salient for the sectors with high heterogeneity. Our findings imply the successful application of advanced ML techniques in a real financial problem.\",\"PeriodicalId\":426846,\"journal\":{\"name\":\"2020 International Conference on Data Mining Workshops (ICDMW)\",\"volume\":\"16 9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Data Mining Workshops (ICDMW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDMW51313.2020.00059\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW51313.2020.00059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Nonlinear Tensor Completion Using Domain Knowledge: An Application in Analysts' Earnings Forecast
Financial analysts' earnings forecast is one of the most critical inputs for security valuation and investment decisions. However, it is challenging to utilize such information for two main reasons: missing values and heterogeneity among analysts. In this paper, we show that one recent breakthrough in nonlinear tensor completion algorithm, CoSTCo [1], overcomes the difficulty by imputing missing values and significantly improves the forecast accuracy in earnings. Compared with conventional imputation approaches, CoSTCo effectively captures latent information and reduces the tensor completion errors by 50%, even with 98% missing values. Furthermore, we show that using firm characteristics as auxiliary information we can improve firms' earnings prediction accuracy by 6%. Results are consistent using different performance metrics and across various industry sectors. Notably, the performance improvement is more salient for the sectors with high heterogeneity. Our findings imply the successful application of advanced ML techniques in a real financial problem.