Detecting Fraudulent Insurance Claims Using Random Forests and Synthetic Minority Oversampling Technique

2019 4th International Conference on Information Systems and Computer Networks (ISCON) Pub Date : 2019-11-01 DOI:10.1109/ISCON47742.2019.9036162

Sonakshi Harjai, S. Khatri, Gurinder Singh

{"title":"Detecting Fraudulent Insurance Claims Using Random Forests and Synthetic Minority Oversampling Technique","authors":"Sonakshi Harjai, S. Khatri, Gurinder Singh","doi":"10.1109/ISCON47742.2019.9036162","DOIUrl":null,"url":null,"abstract":"There has been a significant amount of growth in the number of fraudulent activities by the policy-holders over the last couple of years. Deliberately deceiving the insurance providers by omitting facts and hiding details while claiming for insurance has led to significant loss of money and customer value. To keeps these risks under control; a proper framework is required for judiciously monitoring insurance fraud. In this paper, we demonstrate a novel approach for building a machine- learning based auto-insurance fraud detector which will predict fraudulent insurance claims from the dataset of over 15,420 car-claim records. The proposed model is built using synthetic minority oversampling technique (SMOTE) which removes the class imbalance-ness of the dataset. We use random forests classification method to classify the claim records. The data used in our experiment is taken from a publically available auto insurance datasets. The outcomes of our approach were compared with other existing models based on various performance metrics.","PeriodicalId":124412,"journal":{"name":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON47742.2019.9036162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

There has been a significant amount of growth in the number of fraudulent activities by the policy-holders over the last couple of years. Deliberately deceiving the insurance providers by omitting facts and hiding details while claiming for insurance has led to significant loss of money and customer value. To keeps these risks under control; a proper framework is required for judiciously monitoring insurance fraud. In this paper, we demonstrate a novel approach for building a machine- learning based auto-insurance fraud detector which will predict fraudulent insurance claims from the dataset of over 15,420 car-claim records. The proposed model is built using synthetic minority oversampling technique (SMOTE) which removes the class imbalance-ness of the dataset. We use random forests classification method to classify the claim records. The data used in our experiment is taken from a publically available auto insurance datasets. The outcomes of our approach were compared with other existing models based on various performance metrics.

查看原文本刊更多论文

利用随机森林和合成少数过采样技术检测欺诈性保险索赔

在过去几年中，保单持有人的欺诈活动数量有了显著增长。在索赔过程中故意隐瞒事实和细节，欺骗保险公司，造成了重大的金钱损失和客户价值损失。控制这些风险;要审慎地监督保险欺诈，需要一个适当的框架。在本文中，我们展示了一种构建基于机器学习的汽车保险欺诈检测器的新方法，该检测器将从超过15,420个汽车索赔记录的数据集中预测欺诈性保险索赔。该模型采用综合少数派过采样技术(SMOTE)建立，消除了数据集的类不平衡。采用随机森林分类法对理赔记录进行分类。我们实验中使用的数据来自一个公开的汽车保险数据集。基于各种性能指标，将我们方法的结果与其他现有模型进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 4th International Conference on Information Systems and Computer Networks (ISCON)

自引率

0.00%

发文量