基于联邦学习的鲁棒协同欺诈交易检测

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA) Pub Date : 2021-12-01 DOI:10.1109/ICMLA52953.2021.00064

Delton Myalil, M. Rajan, Manoj M. Apte, S. Lodha

{"title":"基于联邦学习的鲁棒协同欺诈交易检测","authors":"Delton Myalil, M. Rajan, Manoj M. Apte, S. Lodha","doi":"10.1109/ICMLA52953.2021.00064","DOIUrl":null,"url":null,"abstract":"Fraudulent transaction detection is a difficult problem for an individual bank, since the number of fraudulent transactions within a single bank’s records is significantly less compared to the day-to-day regular transactions it processes. Hence, due to this extreme data imbalance, training a classifier is difficult. Also, the model will not be able to learn from different types of fraudulent transactions, which a single bank’s database lacks. Collaboration between banks is the only way to achieve a generalized model, but banks will not share their data with each other due to competition and regulatory restrictions. Federated Learning can be leveraged here to solve this problem. However, in a cross-silo setting like this, the data held by different banks will be different in terms of distribution and hence follows a non-IID scenario across the participants’ datasets. Moreover, we are considering that a minority of the banks could be malicious and will try to disrupt this federated learning process. Hence the problem is to perform federated learning in a non-IID setting with active adversaries involved, which is a new research area under fraud detection. We perform non-IID partitioning of the transaction dataset to simulate 10 banks or silos. Then, for benchmark, we perform federated averaging with a subset of the banks set as malicious. Furthermore, we propose a novel algorithm - Epsilon Cluster Selection, a filter-based aggregation technique to recognize and prevent malicious nodes from contributing to the global model being trained. We apply this algorithm to the same setting with malicious banks and compare the results.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"49 1","pages":"373-378"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Robust Collaborative Fraudulent Transaction Detection using Federated Learning\",\"authors\":\"Delton Myalil, M. Rajan, Manoj M. Apte, S. Lodha\",\"doi\":\"10.1109/ICMLA52953.2021.00064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fraudulent transaction detection is a difficult problem for an individual bank, since the number of fraudulent transactions within a single bank’s records is significantly less compared to the day-to-day regular transactions it processes. Hence, due to this extreme data imbalance, training a classifier is difficult. Also, the model will not be able to learn from different types of fraudulent transactions, which a single bank’s database lacks. Collaboration between banks is the only way to achieve a generalized model, but banks will not share their data with each other due to competition and regulatory restrictions. Federated Learning can be leveraged here to solve this problem. However, in a cross-silo setting like this, the data held by different banks will be different in terms of distribution and hence follows a non-IID scenario across the participants’ datasets. Moreover, we are considering that a minority of the banks could be malicious and will try to disrupt this federated learning process. Hence the problem is to perform federated learning in a non-IID setting with active adversaries involved, which is a new research area under fraud detection. We perform non-IID partitioning of the transaction dataset to simulate 10 banks or silos. Then, for benchmark, we perform federated averaging with a subset of the banks set as malicious. Furthermore, we propose a novel algorithm - Epsilon Cluster Selection, a filter-based aggregation technique to recognize and prevent malicious nodes from contributing to the global model being trained. We apply this algorithm to the same setting with malicious banks and compare the results.\",\"PeriodicalId\":6750,\"journal\":{\"name\":\"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"volume\":\"49 1\",\"pages\":\"373-378\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA52953.2021.00064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA52953.2021.00064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

欺诈性交易检测对于单个银行来说是一个难题，因为单个银行记录中的欺诈性交易数量与其处理的日常常规交易相比要少得多。因此，由于这种极端的数据不平衡，训练分类器是困难的。此外，该模型将无法从不同类型的欺诈交易中学习，这是单个银行数据库所缺乏的。银行之间的合作是实现通用模型的唯一途径，但由于竞争和监管限制，银行不会相互共享数据。这里可以利用联邦学习来解决这个问题。然而，在这样的跨筒仓设置中，不同银行持有的数据在分布方面会有所不同，因此在参与者的数据集中遵循非iid场景。此外，我们正在考虑少数银行可能是恶意的，并将试图破坏这种联合学习过程。因此，如何在主动攻击者参与的非iid环境下进行联邦学习是欺诈检测领域的一个新的研究方向。我们对事务数据集执行非iid分区，以模拟10家银行或竖井。然后，对于基准测试，我们将银行的一个子集设置为恶意，执行联邦平均。此外，我们提出了一种新的算法- Epsilon聚类选择，这是一种基于过滤器的聚合技术，用于识别和防止恶意节点对正在训练的全局模型做出贡献。我们将该算法应用于与恶意银行相同的设置并比较结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Robust Collaborative Fraudulent Transaction Detection using Federated Learning

Fraudulent transaction detection is a difficult problem for an individual bank, since the number of fraudulent transactions within a single bank’s records is significantly less compared to the day-to-day regular transactions it processes. Hence, due to this extreme data imbalance, training a classifier is difficult. Also, the model will not be able to learn from different types of fraudulent transactions, which a single bank’s database lacks. Collaboration between banks is the only way to achieve a generalized model, but banks will not share their data with each other due to competition and regulatory restrictions. Federated Learning can be leveraged here to solve this problem. However, in a cross-silo setting like this, the data held by different banks will be different in terms of distribution and hence follows a non-IID scenario across the participants’ datasets. Moreover, we are considering that a minority of the banks could be malicious and will try to disrupt this federated learning process. Hence the problem is to perform federated learning in a non-IID setting with active adversaries involved, which is a new research area under fraud detection. We perform non-IID partitioning of the transaction dataset to simulate 10 banks or silos. Then, for benchmark, we perform federated averaging with a subset of the banks set as malicious. Furthermore, we propose a novel algorithm - Epsilon Cluster Selection, a filter-based aggregation technique to recognize and prevent malicious nodes from contributing to the global model being trained. We apply this algorithm to the same setting with malicious banks and compare the results.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

自引率

0.00%

发文量