{"title":"Insurance loss modeling with gradient tree-boosted mixture models","authors":"Yanxi Hou , Jiahong Li , Guangyuan Gao","doi":"10.1016/j.insmatheco.2024.12.007","DOIUrl":null,"url":null,"abstract":"<div><div>In actuarial practice, finite mixture model is one widely applied statistical method to model the insurance loss. Although the Expectation-Maximization (EM) algorithm usually plays an essential tool for the parameter estimation of mixture models, it suffers from other issues which cause unstable predictions. For example, feature engineering and variable selection are two crucial modeling issues that are challenging for mixture models as they involve several component models. Avoiding overfitting is another technical concern of the modeling method for the prediction of future losses. To address those issues, we propose an Expectation-Boosting (EB) algorithm, which implements the gradient boosting decision trees to adaptively increase the likelihood in the second step. Our proposed EB algorithm can estimate both the mixing probabilities and the component parameters non-parametrically and overfitting-sensitively, and further perform automated feature engineering, model fitting, and variable selection simultaneously, which fully explores the predictive power of feature space. Moreover, the proposed algorithm can be combined with parallel computation methods to improve computation efficiency. Finally, we conduct two simulation studies to show the good performance of the proposed algorithm and an empirical analysis of the claim amounts for illustration.</div></div>","PeriodicalId":54974,"journal":{"name":"Insurance Mathematics & Economics","volume":"121 ","pages":"Pages 45-62"},"PeriodicalIF":1.9000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insurance Mathematics & Economics","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016766872400132X","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
In actuarial practice, finite mixture model is one widely applied statistical method to model the insurance loss. Although the Expectation-Maximization (EM) algorithm usually plays an essential tool for the parameter estimation of mixture models, it suffers from other issues which cause unstable predictions. For example, feature engineering and variable selection are two crucial modeling issues that are challenging for mixture models as they involve several component models. Avoiding overfitting is another technical concern of the modeling method for the prediction of future losses. To address those issues, we propose an Expectation-Boosting (EB) algorithm, which implements the gradient boosting decision trees to adaptively increase the likelihood in the second step. Our proposed EB algorithm can estimate both the mixing probabilities and the component parameters non-parametrically and overfitting-sensitively, and further perform automated feature engineering, model fitting, and variable selection simultaneously, which fully explores the predictive power of feature space. Moreover, the proposed algorithm can be combined with parallel computation methods to improve computation efficiency. Finally, we conduct two simulation studies to show the good performance of the proposed algorithm and an empirical analysis of the claim amounts for illustration.
期刊介绍:
Insurance: Mathematics and Economics publishes leading research spanning all fields of actuarial science research. It appears six times per year and is the largest journal in actuarial science research around the world.
Insurance: Mathematics and Economics is an international academic journal that aims to strengthen the communication between individuals and groups who develop and apply research results in actuarial science. The journal feels a particular obligation to facilitate closer cooperation between those who conduct research in insurance mathematics and quantitative insurance economics, and practicing actuaries who are interested in the implementation of the results. To this purpose, Insurance: Mathematics and Economics publishes high-quality articles of broad international interest, concerned with either the theory of insurance mathematics and quantitative insurance economics or the inventive application of it, including empirical or experimental results. Articles that combine several of these aspects are particularly considered.