Automobile Insurance Fraud Detection Based on PSO-XGBoost Model and Interpretable Machine Learning Method

IF 1.9 2区 经济学 Q2 ECONOMICS
Ning Ding , Xiao Ruan , Hao Wang , Yuan Liu
{"title":"Automobile Insurance Fraud Detection Based on PSO-XGBoost Model and Interpretable Machine Learning Method","authors":"Ning Ding ,&nbsp;Xiao Ruan ,&nbsp;Hao Wang ,&nbsp;Yuan Liu","doi":"10.1016/j.insmatheco.2024.11.006","DOIUrl":null,"url":null,"abstract":"<div><div>Automobile insurance fraud has become a critical concern for the insurance industry, posing significant threats to socio-economic stability and commercial interests. To tackle these challenges, this paper proposes a PSO-XGBoost fraud detection framework and uses explainable artificial intelligence to interpret the predictions. The framework combines an XGBoost classifier with the particle swarm optimization algorithm and is validated through a comparative evaluation against other models. Traditional methods, including SVM, Naive Bayes, Logistic Regression, and BP Neural Network, demonstrate moderate accuracy, ranging from 54.1% to 68.6%, while more advanced models like Random Forest reach up to 78.4%. Compared to the standard XGBoost, the PSO-optimized model achieves 3% superior accuracy, achieving an impressive 95% success rate. Moreover, SHAP is used to extract and visually depict the contribution of each feature to the model's predictions. It turns out that the policyholder's claim amount is the most significant factor in detecting automobile insurance fraud, with other factors such as vehicle type, responsible party, and the insurer's age also considerably influencing the prediction performance. This paper therefore proves that combining the PSO-XGBoost model with SHAP approach can substantially improve the early warning and prevention of automobile insurance fraud.</div></div>","PeriodicalId":54974,"journal":{"name":"Insurance Mathematics & Economics","volume":"120 ","pages":"Pages 51-60"},"PeriodicalIF":1.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insurance Mathematics & Economics","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167668724001112","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

Automobile insurance fraud has become a critical concern for the insurance industry, posing significant threats to socio-economic stability and commercial interests. To tackle these challenges, this paper proposes a PSO-XGBoost fraud detection framework and uses explainable artificial intelligence to interpret the predictions. The framework combines an XGBoost classifier with the particle swarm optimization algorithm and is validated through a comparative evaluation against other models. Traditional methods, including SVM, Naive Bayes, Logistic Regression, and BP Neural Network, demonstrate moderate accuracy, ranging from 54.1% to 68.6%, while more advanced models like Random Forest reach up to 78.4%. Compared to the standard XGBoost, the PSO-optimized model achieves 3% superior accuracy, achieving an impressive 95% success rate. Moreover, SHAP is used to extract and visually depict the contribution of each feature to the model's predictions. It turns out that the policyholder's claim amount is the most significant factor in detecting automobile insurance fraud, with other factors such as vehicle type, responsible party, and the insurer's age also considerably influencing the prediction performance. This paper therefore proves that combining the PSO-XGBoost model with SHAP approach can substantially improve the early warning and prevention of automobile insurance fraud.
基于 PSO-XGBoost 模型和可解释机器学习方法的汽车保险欺诈检测
汽车保险欺诈已成为保险业关注的一个重要问题,对社会经济稳定和商业利益构成重大威胁。为应对这些挑战,本文提出了一个 PSO-XGBoost 欺诈检测框架,并使用可解释人工智能来解释预测结果。该框架将 XGBoost 分类器与粒子群优化算法相结合,并通过与其他模型的对比评估进行了验证。包括 SVM、Naive Bayes、Logistic Regression 和 BP 神经网络在内的传统方法显示出中等准确率,从 54.1% 到 68.6% 不等,而随机森林等更先进的模型则高达 78.4%。与标准的 XGBoost 模型相比,PSO 优化模型的准确率高出 3%,成功率高达 95%。此外,SHAP 还用于提取和直观描述每个特征对模型预测的贡献。结果表明,投保人的理赔金额是检测汽车保险欺诈的最重要因素,其他因素如车辆类型、责任方和保险人的年龄也对预测性能有很大影响。因此,本文证明了将 PSO-XGBoost 模型与 SHAP 方法相结合可以大大提高汽车保险欺诈的预警和预防能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Insurance Mathematics & Economics
Insurance Mathematics & Economics 管理科学-数学跨学科应用
CiteScore
3.40
自引率
15.80%
发文量
90
审稿时长
17.3 weeks
期刊介绍: Insurance: Mathematics and Economics publishes leading research spanning all fields of actuarial science research. It appears six times per year and is the largest journal in actuarial science research around the world. Insurance: Mathematics and Economics is an international academic journal that aims to strengthen the communication between individuals and groups who develop and apply research results in actuarial science. The journal feels a particular obligation to facilitate closer cooperation between those who conduct research in insurance mathematics and quantitative insurance economics, and practicing actuaries who are interested in the implementation of the results. To this purpose, Insurance: Mathematics and Economics publishes high-quality articles of broad international interest, concerned with either the theory of insurance mathematics and quantitative insurance economics or the inventive application of it, including empirical or experimental results. Articles that combine several of these aspects are particularly considered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信