Research and application of random forest model in mining automobile insurance fraud

Yaqi Li, Chun Yan, W. Liu, Maozhen Li
{"title":"Research and application of random forest model in mining automobile insurance fraud","authors":"Yaqi Li, Chun Yan, W. Liu, Maozhen Li","doi":"10.1109/FSKD.2016.7603443","DOIUrl":null,"url":null,"abstract":"Automobile insurance fraud is gradually spreading in the global scope, and mining automobile insurance fraud is more and more concerned by the society. Concerning that the number of samples in the actual automobile insurance claims data is not balance and the amount of data is large, the real data of a automobile insurance company were selected to establish the random forest fraud mining model based on the theory of automobile insurance fraud mining. The data were processed to screen the index and the importance analysis of each input variable to the output variable was obtained. The error of the model was analyzed. Finally the method has been verified by empirical analysis. The empirical results show that: compared with the traditional model, the automobile insurance fraud mining model introducing Random Forest is suitable for large data sets and unbalanced data. It can be better used for the classification and prediction of the automobile insurance claims data and mining fraud rules. And it has the better accuracy and robustness.","PeriodicalId":373155,"journal":{"name":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2016.7603443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20

Abstract

Automobile insurance fraud is gradually spreading in the global scope, and mining automobile insurance fraud is more and more concerned by the society. Concerning that the number of samples in the actual automobile insurance claims data is not balance and the amount of data is large, the real data of a automobile insurance company were selected to establish the random forest fraud mining model based on the theory of automobile insurance fraud mining. The data were processed to screen the index and the importance analysis of each input variable to the output variable was obtained. The error of the model was analyzed. Finally the method has been verified by empirical analysis. The empirical results show that: compared with the traditional model, the automobile insurance fraud mining model introducing Random Forest is suitable for large data sets and unbalanced data. It can be better used for the classification and prediction of the automobile insurance claims data and mining fraud rules. And it has the better accuracy and robustness.
随机森林模型在汽车保险欺诈中的研究与应用
汽车保险诈骗在全球范围内逐渐蔓延,挖掘汽车保险诈骗越来越受到社会的关注。针对实际车险理赔数据样本数量不均衡且数据量较大的问题,选取某车险公司的真实数据,基于车险欺诈挖掘理论,建立随机森林欺诈挖掘模型。对数据进行处理筛选指标,得到各输入变量对输出变量的重要性分析。对模型的误差进行了分析。最后通过实证分析对该方法进行了验证。实证结果表明:与传统模型相比,引入随机森林的车险欺诈挖掘模型适用于大数据集和不平衡数据。它可以更好地用于汽车保险理赔数据的分类和预测以及欺诈规则的挖掘。具有较好的精度和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信