{"title":"交互式偏好分析:强化学习框架","authors":"","doi":"10.1016/j.ejor.2024.06.033","DOIUrl":null,"url":null,"abstract":"<div><p>Automated investment managers are increasingly popular in personal wealth management due to their cost effectiveness, objectivity, and accessibility. However, it still suffers from several dilemmas, e.g., cold start, over-specialization, and black boxes. To solve these issues, we develop an online reinforcement learning framework based on the multi-armed bandit algorithm to offer personalized investment advice. We provide a comprehensive theoretical procedure for developing this framework. This framework not only enables us to capture the evolving preferences of investors effectively but also has a strong explainability power to provide more implications regarding why one financial product is preferred. We further evaluate our basic model through a large-scale, real-world data set from a leading wealth management platform. The results show a stronger effectiveness of the proposed framework compared to other well-recognized benchmark models. Furthermore, we extend our basic model to address the potential agency problem between the robo-advisor and the investors. Another extension is also provided through an optimization scheme to account for the investors’ demands for diversification in multiple aspects.</p></div>","PeriodicalId":55161,"journal":{"name":"European Journal of Operational Research","volume":null,"pages":null},"PeriodicalIF":6.0000,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interactive preference analysis: A reinforcement learning framework\",\"authors\":\"\",\"doi\":\"10.1016/j.ejor.2024.06.033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Automated investment managers are increasingly popular in personal wealth management due to their cost effectiveness, objectivity, and accessibility. However, it still suffers from several dilemmas, e.g., cold start, over-specialization, and black boxes. To solve these issues, we develop an online reinforcement learning framework based on the multi-armed bandit algorithm to offer personalized investment advice. We provide a comprehensive theoretical procedure for developing this framework. This framework not only enables us to capture the evolving preferences of investors effectively but also has a strong explainability power to provide more implications regarding why one financial product is preferred. We further evaluate our basic model through a large-scale, real-world data set from a leading wealth management platform. The results show a stronger effectiveness of the proposed framework compared to other well-recognized benchmark models. Furthermore, we extend our basic model to address the potential agency problem between the robo-advisor and the investors. Another extension is also provided through an optimization scheme to account for the investors’ demands for diversification in multiple aspects.</p></div>\",\"PeriodicalId\":55161,\"journal\":{\"name\":\"European Journal of Operational Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2024-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Operational Research\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0377221724005125\",\"RegionNum\":2,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OPERATIONS RESEARCH & MANAGEMENT SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Operational Research","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377221724005125","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
Interactive preference analysis: A reinforcement learning framework
Automated investment managers are increasingly popular in personal wealth management due to their cost effectiveness, objectivity, and accessibility. However, it still suffers from several dilemmas, e.g., cold start, over-specialization, and black boxes. To solve these issues, we develop an online reinforcement learning framework based on the multi-armed bandit algorithm to offer personalized investment advice. We provide a comprehensive theoretical procedure for developing this framework. This framework not only enables us to capture the evolving preferences of investors effectively but also has a strong explainability power to provide more implications regarding why one financial product is preferred. We further evaluate our basic model through a large-scale, real-world data set from a leading wealth management platform. The results show a stronger effectiveness of the proposed framework compared to other well-recognized benchmark models. Furthermore, we extend our basic model to address the potential agency problem between the robo-advisor and the investors. Another extension is also provided through an optimization scheme to account for the investors’ demands for diversification in multiple aspects.
期刊介绍:
The European Journal of Operational Research (EJOR) publishes high quality, original papers that contribute to the methodology of operational research (OR) and to the practice of decision making.