{"title":"将股票市场投资者建模为强化学习代理","authors":"A. Pastore, Umberto Esposito, E. Vasilaki","doi":"10.1109/EAIS.2015.7368789","DOIUrl":null,"url":null,"abstract":"Decision making in uncertain and risky environments is a prominent area of research. Standard economic theories fail to fully explain human behaviour, while a potentially promising alternative may lie in the direction of Reinforcement Learning (RL) theory. We analyse data for 46 players extracted from a financial market online game and test whether Reinforcement Learning (Q-Learning) could capture these players behaviour using a riskiness measure based on financial modeling. Moreover we test an earlier hypothesis that players are “naíve” (short-sighted). Our results indicate that Reinforcement Learning is a component of the decision-making process. We also find that there is a significant improvement of fitting for some of the players when using a full RL model against a reduced version (myopic), where only immediate reward is valued by the players, indicating that not all players are naíve.","PeriodicalId":325875,"journal":{"name":"2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Modelling stock-market investors as Reinforcement Learning agents\",\"authors\":\"A. Pastore, Umberto Esposito, E. Vasilaki\",\"doi\":\"10.1109/EAIS.2015.7368789\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Decision making in uncertain and risky environments is a prominent area of research. Standard economic theories fail to fully explain human behaviour, while a potentially promising alternative may lie in the direction of Reinforcement Learning (RL) theory. We analyse data for 46 players extracted from a financial market online game and test whether Reinforcement Learning (Q-Learning) could capture these players behaviour using a riskiness measure based on financial modeling. Moreover we test an earlier hypothesis that players are “naíve” (short-sighted). Our results indicate that Reinforcement Learning is a component of the decision-making process. We also find that there is a significant improvement of fitting for some of the players when using a full RL model against a reduced version (myopic), where only immediate reward is valued by the players, indicating that not all players are naíve.\",\"PeriodicalId\":325875,\"journal\":{\"name\":\"2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EAIS.2015.7368789\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EAIS.2015.7368789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Modelling stock-market investors as Reinforcement Learning agents
Decision making in uncertain and risky environments is a prominent area of research. Standard economic theories fail to fully explain human behaviour, while a potentially promising alternative may lie in the direction of Reinforcement Learning (RL) theory. We analyse data for 46 players extracted from a financial market online game and test whether Reinforcement Learning (Q-Learning) could capture these players behaviour using a riskiness measure based on financial modeling. Moreover we test an earlier hypothesis that players are “naíve” (short-sighted). Our results indicate that Reinforcement Learning is a component of the decision-making process. We also find that there is a significant improvement of fitting for some of the players when using a full RL model against a reduced version (myopic), where only immediate reward is valued by the players, indicating that not all players are naíve.