{"title":"Detecting Fraudulent Insurance Claims Using Random Forests and Synthetic Minority Oversampling Technique","authors":"Sonakshi Harjai, S. Khatri, Gurinder Singh","doi":"10.1109/ISCON47742.2019.9036162","DOIUrl":null,"url":null,"abstract":"There has been a significant amount of growth in the number of fraudulent activities by the policy-holders over the last couple of years. Deliberately deceiving the insurance providers by omitting facts and hiding details while claiming for insurance has led to significant loss of money and customer value. To keeps these risks under control; a proper framework is required for judiciously monitoring insurance fraud. In this paper, we demonstrate a novel approach for building a machine- learning based auto-insurance fraud detector which will predict fraudulent insurance claims from the dataset of over 15,420 car-claim records. The proposed model is built using synthetic minority oversampling technique (SMOTE) which removes the class imbalance-ness of the dataset. We use random forests classification method to classify the claim records. The data used in our experiment is taken from a publically available auto insurance datasets. The outcomes of our approach were compared with other existing models based on various performance metrics.","PeriodicalId":124412,"journal":{"name":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON47742.2019.9036162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
There has been a significant amount of growth in the number of fraudulent activities by the policy-holders over the last couple of years. Deliberately deceiving the insurance providers by omitting facts and hiding details while claiming for insurance has led to significant loss of money and customer value. To keeps these risks under control; a proper framework is required for judiciously monitoring insurance fraud. In this paper, we demonstrate a novel approach for building a machine- learning based auto-insurance fraud detector which will predict fraudulent insurance claims from the dataset of over 15,420 car-claim records. The proposed model is built using synthetic minority oversampling technique (SMOTE) which removes the class imbalance-ness of the dataset. We use random forests classification method to classify the claim records. The data used in our experiment is taken from a publically available auto insurance datasets. The outcomes of our approach were compared with other existing models based on various performance metrics.