基于行为测量距离的有监督和无监督学习欺诈和洗钱检测

2020 IEEE 18th International Conference on Industrial Informatics (INDIN) Pub Date : 2020-07-20 DOI:10.1109/INDIN45582.2020.9442099

Yimin Yang, Min Wu

{"title":"基于行为测量距离的有监督和无监督学习欺诈和洗钱检测","authors":"Yimin Yang, Min Wu","doi":"10.1109/INDIN45582.2020.9442099","DOIUrl":null,"url":null,"abstract":"Money laundering is the process of making large amounts of fund obtained from criminal activities appear to originate from a legitimate source. Fraud occurs when a person or business intentionally deceives another with promises of services or financial benefits that do not exist or were misrepresented. Fraud and Money laundering detections require to analyze abnormal behavioral patterns. To develop a detection model, we present a machine learning-based model which incorporates risk scoring and statistical clustering approaches. Given a customer represented by its values in a set of attributes, we define its Customer Behavior Score based on its percentile rank in each attribute, which measures the behavior of the customer against the median or “normal” customers in the group. The Customer Behavior Score induces a distance, called Behavior Measuring Distance, between any two customers. The k-medoids clustering technique based on the Behavior Measuring Distance is then applied iteratively to classify customers. The key features of the model are that the abnormality of customers' behaviors are measured based on their percentile ranks in their respective classes and that such measurement is dynamically updated based on the reclassification after each iteration during the training. Finally, the model is tested using the country risk data collected from public and internal sources, and the model outcomes are compared against a benchmark model. The experimental results show convergence and effectiveness of the model.","PeriodicalId":185948,"journal":{"name":"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Supervised and Unsupervised Learning for Fraud and Money Laundering Detection using Behavior Measuring Distance\",\"authors\":\"Yimin Yang, Min Wu\",\"doi\":\"10.1109/INDIN45582.2020.9442099\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Money laundering is the process of making large amounts of fund obtained from criminal activities appear to originate from a legitimate source. Fraud occurs when a person or business intentionally deceives another with promises of services or financial benefits that do not exist or were misrepresented. Fraud and Money laundering detections require to analyze abnormal behavioral patterns. To develop a detection model, we present a machine learning-based model which incorporates risk scoring and statistical clustering approaches. Given a customer represented by its values in a set of attributes, we define its Customer Behavior Score based on its percentile rank in each attribute, which measures the behavior of the customer against the median or “normal” customers in the group. The Customer Behavior Score induces a distance, called Behavior Measuring Distance, between any two customers. The k-medoids clustering technique based on the Behavior Measuring Distance is then applied iteratively to classify customers. The key features of the model are that the abnormality of customers' behaviors are measured based on their percentile ranks in their respective classes and that such measurement is dynamically updated based on the reclassification after each iteration during the training. Finally, the model is tested using the country risk data collected from public and internal sources, and the model outcomes are compared against a benchmark model. The experimental results show convergence and effectiveness of the model.\",\"PeriodicalId\":185948,\"journal\":{\"name\":\"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIN45582.2020.9442099\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN45582.2020.9442099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

洗钱是使从犯罪活动中获得的大量资金看起来来自合法来源的过程。当一个人或企业故意用不存在或虚假陈述的服务或经济利益承诺欺骗他人时，就会发生欺诈。欺诈和洗钱的侦查需要分析异常的行为模式。为了开发检测模型，我们提出了一个基于机器学习的模型，该模型结合了风险评分和统计聚类方法。给定由一组属性中的值表示的客户，我们根据其在每个属性中的百分位数排名定义其客户行为得分，该分数衡量客户与组中位数或“正常”客户的行为。顾客行为得分在任意两个顾客之间产生一个距离，称为行为测量距离。然后应用基于行为测量距离的k-介质聚类技术对客户进行迭代分类。该模型的关键特征是基于客户在各自类别中的百分位排名来衡量客户行为的异常程度，并且在训练过程中每次迭代后都会根据重新分类动态更新该测量值。最后，使用从公共和内部来源收集的国家风险数据对模型进行测试，并将模型结果与基准模型进行比较。实验结果表明了该模型的收敛性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Supervised and Unsupervised Learning for Fraud and Money Laundering Detection using Behavior Measuring Distance

Money laundering is the process of making large amounts of fund obtained from criminal activities appear to originate from a legitimate source. Fraud occurs when a person or business intentionally deceives another with promises of services or financial benefits that do not exist or were misrepresented. Fraud and Money laundering detections require to analyze abnormal behavioral patterns. To develop a detection model, we present a machine learning-based model which incorporates risk scoring and statistical clustering approaches. Given a customer represented by its values in a set of attributes, we define its Customer Behavior Score based on its percentile rank in each attribute, which measures the behavior of the customer against the median or “normal” customers in the group. The Customer Behavior Score induces a distance, called Behavior Measuring Distance, between any two customers. The k-medoids clustering technique based on the Behavior Measuring Distance is then applied iteratively to classify customers. The key features of the model are that the abnormality of customers' behaviors are measured based on their percentile ranks in their respective classes and that such measurement is dynamically updated based on the reclassification after each iteration during the training. Finally, the model is tested using the country risk data collected from public and internal sources, and the model outcomes are compared against a benchmark model. The experimental results show convergence and effectiveness of the model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 18th International Conference on Industrial Informatics (INDIN)

自引率

0.00%

发文量