强化学习的溢价控制

IF 1.8 3区经济学 Q2 ECONOMICS

ASTIN Bulletin Pub Date : 2023-04-11 DOI:10.1017/asb.2023.13

L. Palmborg, F. Lindskog

{"title":"强化学习的溢价控制","authors":"L. Palmborg, F. Lindskog","doi":"10.1017/asb.2023.13","DOIUrl":null,"url":null,"abstract":"Abstract We consider a premium control problem in discrete time, formulated in terms of a Markov decision process. In a simplified setting, the optimal premium rule can be derived with dynamic programming methods. However, these classical methods are not feasible in a more realistic setting due to the dimension of the state space and lack of explicit expressions for transition probabilities. We explore reinforcement learning techniques, using function approximation, to solve the premium control problem for realistic stochastic models. We illustrate the appropriateness of the approximate optimal premium rule compared with the true optimal premium rule in a simplified setting and further demonstrate that the approximate optimal premium rule outperforms benchmark rules in more realistic settings where classical approaches fail.","PeriodicalId":8617,"journal":{"name":"ASTIN Bulletin","volume":"83 1","pages":"233 - 257"},"PeriodicalIF":1.8000,"publicationDate":"2023-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Premium control with reinforcement learning\",\"authors\":\"L. Palmborg, F. Lindskog\",\"doi\":\"10.1017/asb.2023.13\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract We consider a premium control problem in discrete time, formulated in terms of a Markov decision process. In a simplified setting, the optimal premium rule can be derived with dynamic programming methods. However, these classical methods are not feasible in a more realistic setting due to the dimension of the state space and lack of explicit expressions for transition probabilities. We explore reinforcement learning techniques, using function approximation, to solve the premium control problem for realistic stochastic models. We illustrate the appropriateness of the approximate optimal premium rule compared with the true optimal premium rule in a simplified setting and further demonstrate that the approximate optimal premium rule outperforms benchmark rules in more realistic settings where classical approaches fail.\",\"PeriodicalId\":8617,\"journal\":{\"name\":\"ASTIN Bulletin\",\"volume\":\"83 1\",\"pages\":\"233 - 257\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2023-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ASTIN Bulletin\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1017/asb.2023.13\",\"RegionNum\":3,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ASTIN Bulletin","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1017/asb.2023.13","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

摘要考虑离散时间下的溢价控制问题，该问题用马尔可夫决策过程表示。在简化情况下，可以用动态规划方法推导出最优溢价规则。然而，由于状态空间的维度和缺乏转移概率的显式表达式，这些经典方法在更现实的情况下是不可行的。我们探索强化学习技术，使用函数逼近，来解决实际随机模型的溢价控制问题。我们将近似最优保费规则与真正最优保费规则在简化设置中的适当性进行了比较，并进一步证明了近似最优保费规则在更现实的设置中优于基准规则，其中经典方法失败。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Premium control with reinforcement learning

Abstract We consider a premium control problem in discrete time, formulated in terms of a Markov decision process. In a simplified setting, the optimal premium rule can be derived with dynamic programming methods. However, these classical methods are not feasible in a more realistic setting due to the dimension of the state space and lack of explicit expressions for transition probabilities. We explore reinforcement learning techniques, using function approximation, to solve the premium control problem for realistic stochastic models. We illustrate the appropriateness of the approximate optimal premium rule compared with the true optimal premium rule in a simplified setting and further demonstrate that the approximate optimal premium rule outperforms benchmark rules in more realistic settings where classical approaches fail.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ASTIN Bulletin 数学-数学跨学科应用

CiteScore

3.20

自引率

5.30%

发文量

审稿时长

>12 weeks

期刊介绍： ASTIN Bulletin publishes papers that are relevant to any branch of actuarial science and insurance mathematics. Its papers are quantitative and scientific in nature, and draw on theory and methods developed in any branch of the mathematical sciences including actuarial mathematics, statistics, probability, financial mathematics and econometrics.