{"title":"预测大数据和学习时代的全球股票收益分布","authors":"Jozef Barunik, Martin Hronec, Ondrej Tobek","doi":"arxiv-2408.07497","DOIUrl":null,"url":null,"abstract":"This paper presents a method for accurately predicting the full distribution\nof stock returns, given a comprehensive set of 194 stock characteristics and\nmarket variables. Such distributions, learned from rich data using a machine\nlearning algorithm, are not constrained by restrictive model assumptions and\nallow the exploration of non-Gaussian, heavy-tailed data and their non-linear\ninteractions. The method uses a two-stage quantile neural network combined with\nspline interpolation. The results show that the proposed approach outperforms\nalternative models in terms of out-of-sample losses. Furthermore, we show that\nthe moments derived from such distributions can be useful as alternative\nempirical estimates in many cases, including mean estimation and forecasting.\nFinally, we examine the relationship between cross-sectional returns and\nseveral distributional characteristics. The results are robust to a wide range\nof US and international data.","PeriodicalId":501045,"journal":{"name":"arXiv - QuantFin - Portfolio Management","volume":"80 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting the distributions of stock returns around the globe in the era of big data and learning\",\"authors\":\"Jozef Barunik, Martin Hronec, Ondrej Tobek\",\"doi\":\"arxiv-2408.07497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a method for accurately predicting the full distribution\\nof stock returns, given a comprehensive set of 194 stock characteristics and\\nmarket variables. Such distributions, learned from rich data using a machine\\nlearning algorithm, are not constrained by restrictive model assumptions and\\nallow the exploration of non-Gaussian, heavy-tailed data and their non-linear\\ninteractions. The method uses a two-stage quantile neural network combined with\\nspline interpolation. The results show that the proposed approach outperforms\\nalternative models in terms of out-of-sample losses. Furthermore, we show that\\nthe moments derived from such distributions can be useful as alternative\\nempirical estimates in many cases, including mean estimation and forecasting.\\nFinally, we examine the relationship between cross-sectional returns and\\nseveral distributional characteristics. The results are robust to a wide range\\nof US and international data.\",\"PeriodicalId\":501045,\"journal\":{\"name\":\"arXiv - QuantFin - Portfolio Management\",\"volume\":\"80 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuantFin - Portfolio Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.07497\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Portfolio Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.07497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting the distributions of stock returns around the globe in the era of big data and learning
This paper presents a method for accurately predicting the full distribution
of stock returns, given a comprehensive set of 194 stock characteristics and
market variables. Such distributions, learned from rich data using a machine
learning algorithm, are not constrained by restrictive model assumptions and
allow the exploration of non-Gaussian, heavy-tailed data and their non-linear
interactions. The method uses a two-stage quantile neural network combined with
spline interpolation. The results show that the proposed approach outperforms
alternative models in terms of out-of-sample losses. Furthermore, we show that
the moments derived from such distributions can be useful as alternative
empirical estimates in many cases, including mean estimation and forecasting.
Finally, we examine the relationship between cross-sectional returns and
several distributional characteristics. The results are robust to a wide range
of US and international data.