{"title":"Fundamental Analysis of Detailed Financial Data: A Machine Learning Approach","authors":"Xi Chen, Yang Ha Cho, Y. Dou, B. Lev","doi":"10.2139/ssrn.3741015","DOIUrl":null,"url":null,"abstract":"We conduct a fundamental analysis of detailed financial information to predict earnings. Since 2012, all U.S. public companies must tag quantitative amounts in financial statements and footnotes of their 10-K reports using the eXtensible Business Reporting Language (XBRL). Leveraging machine learning methods, we combine the high-dimensional XBRL-tagged financial data into a summary measure for the direction of one-year-ahead earnings changes. The measure shows significant out-of-sample predictive power concerning the direction of earnings changes. Hedge portfolios are formed based on this measure during 2015-2018. The annual size-adjusted returns to the hedge portfolios range from 5.02 to 9.7 percent. Our measure and strategies outperform those of Ou and Penman (1989), who extract the summary measure from 65 accounting variables using logistic regressions. Additional analyses suggest that the outperformance stems from both nonlinear predictor interactions missed by regressions and the use of more detailed financial data.","PeriodicalId":134520,"journal":{"name":"ERPN: Finance (Innovation) (Sub-Topic)","volume":"167 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERPN: Finance (Innovation) (Sub-Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3741015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
We conduct a fundamental analysis of detailed financial information to predict earnings. Since 2012, all U.S. public companies must tag quantitative amounts in financial statements and footnotes of their 10-K reports using the eXtensible Business Reporting Language (XBRL). Leveraging machine learning methods, we combine the high-dimensional XBRL-tagged financial data into a summary measure for the direction of one-year-ahead earnings changes. The measure shows significant out-of-sample predictive power concerning the direction of earnings changes. Hedge portfolios are formed based on this measure during 2015-2018. The annual size-adjusted returns to the hedge portfolios range from 5.02 to 9.7 percent. Our measure and strategies outperform those of Ou and Penman (1989), who extract the summary measure from 65 accounting variables using logistic regressions. Additional analyses suggest that the outperformance stems from both nonlinear predictor interactions missed by regressions and the use of more detailed financial data.