{"title":"CatBoost for Fraud Detection in Financial Transactions","authors":"Yeming Chen, Xinyuan Han","doi":"10.1109/ICCECE51280.2021.9342475","DOIUrl":null,"url":null,"abstract":"Financial fraud is an ever growing menace with severe consequences in the financial industry. Machine learning plays an active role in the fraud detection in financial transactions. However, fraud detection is still a challenging problem due to two major reasons. First, either fraudulent or non-fraudulent behaviors change fast and constantly. Secondly, currently online transactions happen so fast, which require detection algorithms to be efficient and accurate. This paper introduces a machine learning method based on CatBoost for fraud detection. To improve detection accuracy, we apply feature engineering to generate highly important features and feed them into CatBoost for classification. Another key contribution of our work is using memory compression to speed up detection. The performance of our method is evaluated on a publicly IEEE-CIS Fraud dataset, provided by Kaggle competition platform. The experimental results demonstrate that our model based on CatBoost has obtained optimal accuracy of 0.983.","PeriodicalId":229425,"journal":{"name":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE51280.2021.9342475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Financial fraud is an ever growing menace with severe consequences in the financial industry. Machine learning plays an active role in the fraud detection in financial transactions. However, fraud detection is still a challenging problem due to two major reasons. First, either fraudulent or non-fraudulent behaviors change fast and constantly. Secondly, currently online transactions happen so fast, which require detection algorithms to be efficient and accurate. This paper introduces a machine learning method based on CatBoost for fraud detection. To improve detection accuracy, we apply feature engineering to generate highly important features and feed them into CatBoost for classification. Another key contribution of our work is using memory compression to speed up detection. The performance of our method is evaluated on a publicly IEEE-CIS Fraud dataset, provided by Kaggle competition platform. The experimental results demonstrate that our model based on CatBoost has obtained optimal accuracy of 0.983.