高频市场中的深度无监督异常检测

Q1 Mathematics

Journal of Finance and Data Science Pub Date : 2024-04-19 DOI:10.1016/j.jfds.2024.100129

Cédric Poutré , Didier Chételat , Manuel Morales

{"title":"高频市场中的深度无监督异常检测","authors":"Cédric Poutré , Didier Chételat , Manuel Morales","doi":"10.1016/j.jfds.2024.100129","DOIUrl":null,"url":null,"abstract":"<div><p>Inspired by recent advances in the deep learning literature, this article introduces a novel hybrid anomaly detection framework specifically designed for limit order book (LOB) data. A modified Transformer autoencoder architecture is proposed to learn rich temporal LOB subsequence representations, which eases the separability of normal and fraudulent time series. A dissimilarity function is then learned in the representation space to characterize normal LOB behavior, enabling the detection of any anomalous subsequences out-of-sample. We also develop a complete trade-based manipulation simulation methodology able to generate a variety of scenarios derived from actual trade–based fraud cases. The complete framework is tested on LOB data of five NASDAQ stocks in which we randomly insert synthetic quote stuffing, layering, and pump-and-dump manipulations. We show that the proposed asset-independent approach achieves new state-of-the-art fraud detection performance, without requiring any prior knowledge of manipulation patterns.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"10 ","pages":"Article 100129"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S240591882400014X/pdfft?md5=438a8d321f83d33f31a8f9a74e366e8e&pid=1-s2.0-S240591882400014X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Deep unsupervised anomaly detection in high-frequency markets\",\"authors\":\"Cédric Poutré , Didier Chételat , Manuel Morales\",\"doi\":\"10.1016/j.jfds.2024.100129\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Inspired by recent advances in the deep learning literature, this article introduces a novel hybrid anomaly detection framework specifically designed for limit order book (LOB) data. A modified Transformer autoencoder architecture is proposed to learn rich temporal LOB subsequence representations, which eases the separability of normal and fraudulent time series. A dissimilarity function is then learned in the representation space to characterize normal LOB behavior, enabling the detection of any anomalous subsequences out-of-sample. We also develop a complete trade-based manipulation simulation methodology able to generate a variety of scenarios derived from actual trade–based fraud cases. The complete framework is tested on LOB data of five NASDAQ stocks in which we randomly insert synthetic quote stuffing, layering, and pump-and-dump manipulations. We show that the proposed asset-independent approach achieves new state-of-the-art fraud detection performance, without requiring any prior knowledge of manipulation patterns.</p></div>\",\"PeriodicalId\":36340,\"journal\":{\"name\":\"Journal of Finance and Data Science\",\"volume\":\"10 \",\"pages\":\"Article 100129\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S240591882400014X/pdfft?md5=438a8d321f83d33f31a8f9a74e366e8e&pid=1-s2.0-S240591882400014X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Finance and Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S240591882400014X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Finance and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S240591882400014X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

摘要

受深度学习文献最新进展的启发，本文介绍了一种专为限价订单簿（LOB）数据设计的新型混合异常检测框架。本文提出了一种改进的 Transformer 自动编码器架构，用于学习丰富的时间 LOB 子序列表示，从而简化了正常时间序列和欺诈性时间序列的可分离性。然后，在表征空间中学习异质性函数，以描述正常的 LOB 行为，从而能够在样本外检测到任何异常子序列。我们还开发了一套完整的基于贸易的操纵模拟方法，能够生成源自实际贸易欺诈案例的各种情景。我们在五只纳斯达克股票的 LOB 数据上测试了完整的框架，并在其中随机插入了合成报价填充、分层和抽水操纵。我们的测试结果表明，所提出的独立于资产的方法无需事先了解操纵模式，就能实现最先进的欺诈检测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep unsupervised anomaly detection in high-frequency markets

Inspired by recent advances in the deep learning literature, this article introduces a novel hybrid anomaly detection framework specifically designed for limit order book (LOB) data. A modified Transformer autoencoder architecture is proposed to learn rich temporal LOB subsequence representations, which eases the separability of normal and fraudulent time series. A dissimilarity function is then learned in the representation space to characterize normal LOB behavior, enabling the detection of any anomalous subsequences out-of-sample. We also develop a complete trade-based manipulation simulation methodology able to generate a variety of scenarios derived from actual trade–based fraud cases. The complete framework is tested on LOB data of five NASDAQ stocks in which we randomly insert synthetic quote stuffing, layering, and pump-and-dump manipulations. We show that the proposed asset-independent approach achieves new state-of-the-art fraud detection performance, without requiring any prior knowledge of manipulation patterns.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Finance and Data Science Mathematics-Statistics and Probability

CiteScore

3.90

自引率

0.00%

发文量

审稿时长

30 days