Non-Differentiable Loss Function Optimization and Interaction Effect Discovery in Insurance Pricing Using the Genetic Algorithm

IF 2 Q2 BUSINESS, FINANCE
Risks Pub Date : 2024-05-14 DOI:10.3390/risks12050079
Robin Van Oirbeek, Félix Vandervorst, Thomas Bury, Gireg Willame, Christopher Grumiau, Tim Verdonck
{"title":"Non-Differentiable Loss Function Optimization and Interaction Effect Discovery in Insurance Pricing Using the Genetic Algorithm","authors":"Robin Van Oirbeek, Félix Vandervorst, Thomas Bury, Gireg Willame, Christopher Grumiau, Tim Verdonck","doi":"10.3390/risks12050079","DOIUrl":null,"url":null,"abstract":"Insurance pricing is the process of determining the premiums that policyholders pay in exchange for insurance coverage. In order to estimate premiums, actuaries use statistical based methods, assessing various factors such as the probability of certain events occurring (like accidents or damages), where the Generalized Linear Models (GLMs) are the industry standard method. Traditional GLM approaches face limitations due to non-differentiable loss functions and expansive variable spaces, including both main and interaction terms. In this study, we address the challenge of selecting relevant variables for GLMs used in non-life insurance pricing both for frequency or severity analyses, amidst an increasing volume of data and variables. We propose a novel application of the Genetic Algorithm (GA) to efficiently identify pertinent main and interaction effects in GLMs, even in scenarios with a high variable count and diverse loss functions. Our approach uniquely aligns GLM predictions with those of black box machine learning models, enhancing their interpretability and reliability. Using a publicly available non-life motor data set, we demonstrate the GA’s effectiveness by comparing its selected GLM with a Gradient Boosted Machine (GBM) model. The results show a strong consistency between the main and interaction terms identified by GA for the GLM and those revealed in the GBM analysis, highlighting the potential of our method to refine and improve pricing models in the insurance sector.","PeriodicalId":21282,"journal":{"name":"Risks","volume":"52 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Risks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/risks12050079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0

Abstract

Insurance pricing is the process of determining the premiums that policyholders pay in exchange for insurance coverage. In order to estimate premiums, actuaries use statistical based methods, assessing various factors such as the probability of certain events occurring (like accidents or damages), where the Generalized Linear Models (GLMs) are the industry standard method. Traditional GLM approaches face limitations due to non-differentiable loss functions and expansive variable spaces, including both main and interaction terms. In this study, we address the challenge of selecting relevant variables for GLMs used in non-life insurance pricing both for frequency or severity analyses, amidst an increasing volume of data and variables. We propose a novel application of the Genetic Algorithm (GA) to efficiently identify pertinent main and interaction effects in GLMs, even in scenarios with a high variable count and diverse loss functions. Our approach uniquely aligns GLM predictions with those of black box machine learning models, enhancing their interpretability and reliability. Using a publicly available non-life motor data set, we demonstrate the GA’s effectiveness by comparing its selected GLM with a Gradient Boosted Machine (GBM) model. The results show a strong consistency between the main and interaction terms identified by GA for the GLM and those revealed in the GBM analysis, highlighting the potential of our method to refine and improve pricing models in the insurance sector.
利用遗传算法优化保险定价中的无差异损失函数并发现交互效应
保险定价是确定投保人为换取保险而支付的保费的过程。为了估算保费,精算师使用基于统计的方法,评估各种因素,如某些事件(如事故或损害)发生的概率,其中广义线性模型(GLM)是行业标准方法。传统的 GLM 方法由于损失函数的不可分性和变量空间的广阔性(包括主项和交互项)而受到限制。在本研究中,我们要解决的难题是,在数据量和变量不断增加的情况下,如何为用于频率或严重性分析的非寿险定价 GLMs 选择相关变量。我们提出了一种新颖的遗传算法(GA)应用,即使在变量数量较多、损失函数多样的情况下,也能有效识别 GLM 中的相关主效应和交互效应。我们的方法独特地使 GLM 预测与黑盒机器学习模型的预测相一致,从而提高了预测的可解释性和可靠性。我们使用公开的非生命电机数据集,通过比较所选的 GLM 与梯度提升机(GBM)模型,证明了 GA 的有效性。结果表明,GA 为 GLM 确定的主项和交互项与 GBM 分析中揭示的主项和交互项之间具有很强的一致性,这凸显了我们的方法在完善和改进保险行业定价模型方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Risks
Risks Economics, Econometrics and Finance-Economics, Econometrics and Finance (miscellaneous)
CiteScore
3.80
自引率
22.70%
发文量
205
审稿时长
11 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信