{"title":"Auto insurance fraud detection: Leveraging cost sensitive and insensitive algorithms for comprehensive analysis","authors":"Meryem Yankol Schalck","doi":"10.1016/j.insmatheco.2025.02.001","DOIUrl":null,"url":null,"abstract":"<div><div>As technology and the economy continue to grow, fraud has a significant negative impact on business and society, and insurance fraud remains an important issue, posing challenges in both detection and prevention. This article provides a direct cost-sensitive learning approaches on enhancing traditional motor insurance fraud detection by leveraging real-world data sets. In this approach, the results are obtained by using the information available at the opening of the claim, FNOL. The data set (FNOL) contains numerical, categorical, and textual variables. The results show that machine learning techniques perform better statistically and can also be more effective than standard approaches in reducing fraud-related costs. Extreme Gradient Boosting (XGB) outperforms both cost-sensitive and cost-insensitive approaches based on performance measures. Our study indicates that a cost-sensitive strategy delivers greater financial benefits than a cost-insensitive approach.</div></div>","PeriodicalId":54974,"journal":{"name":"Insurance Mathematics & Economics","volume":"122 ","pages":"Pages 44-60"},"PeriodicalIF":1.9000,"publicationDate":"2025-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insurance Mathematics & Economics","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167668725000216","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
As technology and the economy continue to grow, fraud has a significant negative impact on business and society, and insurance fraud remains an important issue, posing challenges in both detection and prevention. This article provides a direct cost-sensitive learning approaches on enhancing traditional motor insurance fraud detection by leveraging real-world data sets. In this approach, the results are obtained by using the information available at the opening of the claim, FNOL. The data set (FNOL) contains numerical, categorical, and textual variables. The results show that machine learning techniques perform better statistically and can also be more effective than standard approaches in reducing fraud-related costs. Extreme Gradient Boosting (XGB) outperforms both cost-sensitive and cost-insensitive approaches based on performance measures. Our study indicates that a cost-sensitive strategy delivers greater financial benefits than a cost-insensitive approach.
期刊介绍:
Insurance: Mathematics and Economics publishes leading research spanning all fields of actuarial science research. It appears six times per year and is the largest journal in actuarial science research around the world.
Insurance: Mathematics and Economics is an international academic journal that aims to strengthen the communication between individuals and groups who develop and apply research results in actuarial science. The journal feels a particular obligation to facilitate closer cooperation between those who conduct research in insurance mathematics and quantitative insurance economics, and practicing actuaries who are interested in the implementation of the results. To this purpose, Insurance: Mathematics and Economics publishes high-quality articles of broad international interest, concerned with either the theory of insurance mathematics and quantitative insurance economics or the inventive application of it, including empirical or experimental results. Articles that combine several of these aspects are particularly considered.