基于混合树的短期保险理赔方法研究

IF 0.7 3区 工程技术 Q4 ENGINEERING, INDUSTRIAL
Zhiyu Quan, Zhiguo Wang, Guojun Gan, Emiliano A. Valdez
{"title":"基于混合树的短期保险理赔方法研究","authors":"Zhiyu Quan, Zhiguo Wang, Guojun Gan, Emiliano A. Valdez","doi":"10.1017/S0269964823000074","DOIUrl":null,"url":null,"abstract":"Abstract Two-part framework and the Tweedie generalized linear model (GLM) have traditionally been used to model loss costs for short-term insurance contracts. For most portfolios of insurance claims, there is typically a large proportion of zero claims that leads to imbalances, resulting in lower prediction accuracy of these traditional approaches. In this article, we propose the use of tree-based methods with a hybrid structure that involves a two-step algorithm as an alternative approach. For example, the first step is the construction of a classification tree to build the probability model for claim frequency. The second step is the application of elastic net regression models at each terminal node from the classification tree to build the distribution models for claim severity. This hybrid structure captures the benefits of tuning hyperparameters at each step of the algorithm; this allows for improved prediction accuracy, and tuning can be performed to meet specific business objectives. An obvious major advantage of this hybrid structure is improved model interpretability. We examine and compare the predictive performance of this hybrid structure relative to the traditional Tweedie GLM using both simulated and real datasets. Our empirical results show that these hybrid tree-based methods produce more accurate and informative predictions.","PeriodicalId":54582,"journal":{"name":"Probability in the Engineering and Informational Sciences","volume":"24 1","pages":"597 - 620"},"PeriodicalIF":0.7000,"publicationDate":"2023-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On hybrid tree-based methods for short-term insurance claims\",\"authors\":\"Zhiyu Quan, Zhiguo Wang, Guojun Gan, Emiliano A. Valdez\",\"doi\":\"10.1017/S0269964823000074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Two-part framework and the Tweedie generalized linear model (GLM) have traditionally been used to model loss costs for short-term insurance contracts. For most portfolios of insurance claims, there is typically a large proportion of zero claims that leads to imbalances, resulting in lower prediction accuracy of these traditional approaches. In this article, we propose the use of tree-based methods with a hybrid structure that involves a two-step algorithm as an alternative approach. For example, the first step is the construction of a classification tree to build the probability model for claim frequency. The second step is the application of elastic net regression models at each terminal node from the classification tree to build the distribution models for claim severity. This hybrid structure captures the benefits of tuning hyperparameters at each step of the algorithm; this allows for improved prediction accuracy, and tuning can be performed to meet specific business objectives. An obvious major advantage of this hybrid structure is improved model interpretability. We examine and compare the predictive performance of this hybrid structure relative to the traditional Tweedie GLM using both simulated and real datasets. Our empirical results show that these hybrid tree-based methods produce more accurate and informative predictions.\",\"PeriodicalId\":54582,\"journal\":{\"name\":\"Probability in the Engineering and Informational Sciences\",\"volume\":\"24 1\",\"pages\":\"597 - 620\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-03-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Probability in the Engineering and Informational Sciences\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1017/S0269964823000074\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Probability in the Engineering and Informational Sciences","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1017/S0269964823000074","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 1

摘要

摘要对于短期保险合同的损失成本模型,传统上采用两部分框架和Tweedie广义线性模型(GLM)进行建模。对于大多数保险索赔组合,通常存在很大比例的零索赔,导致不平衡,导致这些传统方法的预测精度较低。在本文中,我们建议使用混合结构的基于树的方法,其中包括两步算法作为替代方法。例如,第一步是构建分类树来构建索赔频率的概率模型。第二步是在分类树的每个终端节点上应用弹性网络回归模型来构建索赔严重程度的分布模型。这种混合结构抓住了在算法的每一步调优超参数的好处;这允许改进预测准确性,并且可以执行调优以满足特定的业务目标。这种混合结构的一个明显的主要优点是提高了模型的可解释性。我们使用模拟和真实数据集检查并比较了这种混合结构相对于传统Tweedie GLM的预测性能。我们的实证结果表明,这些基于混合树的方法产生了更准确和信息丰富的预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On hybrid tree-based methods for short-term insurance claims
Abstract Two-part framework and the Tweedie generalized linear model (GLM) have traditionally been used to model loss costs for short-term insurance contracts. For most portfolios of insurance claims, there is typically a large proportion of zero claims that leads to imbalances, resulting in lower prediction accuracy of these traditional approaches. In this article, we propose the use of tree-based methods with a hybrid structure that involves a two-step algorithm as an alternative approach. For example, the first step is the construction of a classification tree to build the probability model for claim frequency. The second step is the application of elastic net regression models at each terminal node from the classification tree to build the distribution models for claim severity. This hybrid structure captures the benefits of tuning hyperparameters at each step of the algorithm; this allows for improved prediction accuracy, and tuning can be performed to meet specific business objectives. An obvious major advantage of this hybrid structure is improved model interpretability. We examine and compare the predictive performance of this hybrid structure relative to the traditional Tweedie GLM using both simulated and real datasets. Our empirical results show that these hybrid tree-based methods produce more accurate and informative predictions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.20
自引率
18.20%
发文量
45
审稿时长
>12 weeks
期刊介绍: The primary focus of the journal is on stochastic modelling in the physical and engineering sciences, with particular emphasis on queueing theory, reliability theory, inventory theory, simulation, mathematical finance and probabilistic networks and graphs. Papers on analytic properties and related disciplines are also considered, as well as more general papers on applied and computational probability, if appropriate. Readers include academics working in statistics, operations research, computer science, engineering, management science and physical sciences as well as industrial practitioners engaged in telecommunications, computer science, financial engineering, operations research and management science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信