在分布式系统中实现高效的多粒度异常检测

IF 2.3 Q2 COMPUTER SCIENCE, THEORY & METHODS
Array Pub Date : 2023-11-30 DOI:10.1016/j.array.2023.100330
Chao Tu , Ming Chen , Liwen Zhang , Long Zhao , Di Wu , Ziyang Yue
{"title":"在分布式系统中实现高效的多粒度异常检测","authors":"Chao Tu ,&nbsp;Ming Chen ,&nbsp;Liwen Zhang ,&nbsp;Long Zhao ,&nbsp;Di Wu ,&nbsp;Ziyang Yue","doi":"10.1016/j.array.2023.100330","DOIUrl":null,"url":null,"abstract":"<div><p>Distributed systems often consist of a large number of computing and data nodes, which makes it both significant and challenging to detect anomalies efficiently and accurately in distributed systems. Generally, we not only need to determine whether an anomaly has occurred at a certain time (the time level anomaly), but also need to detect whether anomalies occur in a node (the node level anomaly) and which key performance indicators (KPIs) are anomalies (the KPI level anomaly), that is, to perform multi-granular anomaly detection in distributed systems. However, most existing algorithms only focus on the time level anomalies in centralized systems. For distributed systems, a simple way is to train a model for each node and then detect anomalies independently. An obvious disadvantage is that the cost of model inferring is unacceptable in practice. Therefore, we propose a <strong>M</strong>ulti-<strong>G</strong>ranular <strong>A</strong>nomaly <strong>D</strong>etection (MGAD) framework that utilizes a tree structure to perform anomaly detection hierarchically from the node level to time and KPI levels, which greatly reduces the cost of model inferring. Specifically, at the time level, we propose a novel model named <strong>M</strong>asked <strong>S</strong>liding <strong>S</strong>patial-<strong>T</strong>emporal <strong>A</strong>dversarial <strong>N</strong>etwork (MS2TAN) that considers spatial and temporal dependencies simultaneously. Extensive experiments with real-world data offer insights into the performance of the proposals, showing that MGAD is at least 5<span><math><mo>×</mo></math></span> faster for inferring when compared with the baselines.</p></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590005623000553/pdfft?md5=a8b79cf32296c7cea873bc6dab0e3b2b&pid=1-s2.0-S2590005623000553-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Towards efficient multi-granular anomaly detection in distributed systems\",\"authors\":\"Chao Tu ,&nbsp;Ming Chen ,&nbsp;Liwen Zhang ,&nbsp;Long Zhao ,&nbsp;Di Wu ,&nbsp;Ziyang Yue\",\"doi\":\"10.1016/j.array.2023.100330\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Distributed systems often consist of a large number of computing and data nodes, which makes it both significant and challenging to detect anomalies efficiently and accurately in distributed systems. Generally, we not only need to determine whether an anomaly has occurred at a certain time (the time level anomaly), but also need to detect whether anomalies occur in a node (the node level anomaly) and which key performance indicators (KPIs) are anomalies (the KPI level anomaly), that is, to perform multi-granular anomaly detection in distributed systems. However, most existing algorithms only focus on the time level anomalies in centralized systems. For distributed systems, a simple way is to train a model for each node and then detect anomalies independently. An obvious disadvantage is that the cost of model inferring is unacceptable in practice. Therefore, we propose a <strong>M</strong>ulti-<strong>G</strong>ranular <strong>A</strong>nomaly <strong>D</strong>etection (MGAD) framework that utilizes a tree structure to perform anomaly detection hierarchically from the node level to time and KPI levels, which greatly reduces the cost of model inferring. Specifically, at the time level, we propose a novel model named <strong>M</strong>asked <strong>S</strong>liding <strong>S</strong>patial-<strong>T</strong>emporal <strong>A</strong>dversarial <strong>N</strong>etwork (MS2TAN) that considers spatial and temporal dependencies simultaneously. Extensive experiments with real-world data offer insights into the performance of the proposals, showing that MGAD is at least 5<span><math><mo>×</mo></math></span> faster for inferring when compared with the baselines.</p></div>\",\"PeriodicalId\":8417,\"journal\":{\"name\":\"Array\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2023-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2590005623000553/pdfft?md5=a8b79cf32296c7cea873bc6dab0e3b2b&pid=1-s2.0-S2590005623000553-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Array\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2590005623000553\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005623000553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

摘要

分布式系统通常由大量计算和数据节点组成,因此在分布式系统中高效、准确地检测异常既重要又具有挑战性。一般来说,我们不仅需要确定某个时间是否发生了异常(时间级异常),还需要检测某个节点是否发生了异常(节点级异常)以及哪些关键性能指标(KPI)是异常的(KPI 级异常),即在分布式系统中进行多粒度异常检测。然而,现有算法大多只关注集中式系统中的时间级异常。对于分布式系统,一种简单的方法是为每个节点训练一个模型,然后独立检测异常。一个明显的缺点是,模型推断的成本在实践中是不可接受的。因此,我们提出了一个多粒度异常检测(MGAD)框架,利用树形结构从节点级到时间级和关键绩效指标级分层执行异常检测,从而大大降低了模型推断的成本。具体来说,在时间层面,我们提出了一种名为 "屏蔽滑动时空对抗网络"(MS2TAN)的新型模型,该模型同时考虑了空间和时间依赖性。利用真实世界数据进行的大量实验深入分析了这些建议的性能,结果表明,与基线相比,MGAD 的推断速度至少快 5 倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards efficient multi-granular anomaly detection in distributed systems

Distributed systems often consist of a large number of computing and data nodes, which makes it both significant and challenging to detect anomalies efficiently and accurately in distributed systems. Generally, we not only need to determine whether an anomaly has occurred at a certain time (the time level anomaly), but also need to detect whether anomalies occur in a node (the node level anomaly) and which key performance indicators (KPIs) are anomalies (the KPI level anomaly), that is, to perform multi-granular anomaly detection in distributed systems. However, most existing algorithms only focus on the time level anomalies in centralized systems. For distributed systems, a simple way is to train a model for each node and then detect anomalies independently. An obvious disadvantage is that the cost of model inferring is unacceptable in practice. Therefore, we propose a Multi-Granular Anomaly Detection (MGAD) framework that utilizes a tree structure to perform anomaly detection hierarchically from the node level to time and KPI levels, which greatly reduces the cost of model inferring. Specifically, at the time level, we propose a novel model named Masked Sliding Spatial-Temporal Adversarial Network (MS2TAN) that considers spatial and temporal dependencies simultaneously. Extensive experiments with real-world data offer insights into the performance of the proposals, showing that MGAD is at least 5× faster for inferring when compared with the baselines.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Array
Array Computer Science-General Computer Science
CiteScore
4.40
自引率
0.00%
发文量
93
审稿时长
45 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信