DBCatcher: A Cloud Database Online Anomaly Detection System based on Indicator Correlation

Guangyu Zhang, Chun-hua Li, Ke Zhou, Li Liu, Ce Zhang, Wancheng Chen, Haotian Fang, Bin Cheng, Jie Yang, Jiashu Xing
{"title":"DBCatcher: A Cloud Database Online Anomaly Detection System based on Indicator Correlation","authors":"Guangyu Zhang, Chun-hua Li, Ke Zhou, Li Liu, Ce Zhang, Wancheng Chen, Haotian Fang, Bin Cheng, Jie Yang, Jiashu Xing","doi":"10.1109/ICDE55515.2023.00091","DOIUrl":null,"url":null,"abstract":"Anomaly detection system plays an important role in maintaining the stability of cloud database. Existing studies mainly focus on significant deviations in multivariate time series, such as a combination of CPU utilization, transactions per second, etc, to detect abnormal issues. Due to the complexity of cloud database structure and functions, these approaches are difficult to achieve a balance among detection performance, detection efficiency and workload adaptability. In this paper, we propose DBCatcher, a cloud database online anomaly detection system based on indicator correlation. Through extensive analysis of real-world cloud database time series, we find the correlations among trends in the same key performance indicators across databases within the same unit, which inspires us to explore a time series correlation measurement method that can efficiently detect abnormal issues. Meanwhile, we design a flexible time window observation mechanism and an adaptive threshold learning policy to minimize misjudgment caused by key performance indicator fluctuations, greatly enhancing the detection performance and workload adaptability. We conduct extensive experiments under real-world and synthetic workloads. Experimental results show that DBCatcher significantly improves the detection performance and detection efficiency compared to existing methods.","PeriodicalId":434744,"journal":{"name":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE55515.2023.00091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Anomaly detection system plays an important role in maintaining the stability of cloud database. Existing studies mainly focus on significant deviations in multivariate time series, such as a combination of CPU utilization, transactions per second, etc, to detect abnormal issues. Due to the complexity of cloud database structure and functions, these approaches are difficult to achieve a balance among detection performance, detection efficiency and workload adaptability. In this paper, we propose DBCatcher, a cloud database online anomaly detection system based on indicator correlation. Through extensive analysis of real-world cloud database time series, we find the correlations among trends in the same key performance indicators across databases within the same unit, which inspires us to explore a time series correlation measurement method that can efficiently detect abnormal issues. Meanwhile, we design a flexible time window observation mechanism and an adaptive threshold learning policy to minimize misjudgment caused by key performance indicator fluctuations, greatly enhancing the detection performance and workload adaptability. We conduct extensive experiments under real-world and synthetic workloads. Experimental results show that DBCatcher significantly improves the detection performance and detection efficiency compared to existing methods.
DBCatcher:基于指标相关性的云数据库在线异常检测系统
异常检测系统对维护云数据库的稳定性起着重要的作用。现有的研究主要集中在多变量时间序列中的显著偏差,如CPU利用率、每秒事务数等的组合,以检测异常问题。由于云数据库结构和功能的复杂性,这些方法很难在检测性能、检测效率和工作负载适应性之间取得平衡。本文提出了一种基于指标相关性的云数据库在线异常检测系统DBCatcher。通过对现实世界云数据库时间序列的广泛分析,我们发现了同一单位内不同数据库中相同关键绩效指标的趋势之间的相关性,这启发了我们探索一种能够有效检测异常问题的时间序列相关性度量方法。同时,我们设计了灵活的时间窗观察机制和自适应阈值学习策略,最大限度地减少了关键性能指标波动带来的误判,大大提高了检测性能和工作负载适应性。我们在真实世界和合成工作负载下进行了广泛的实验。实验结果表明,与现有方法相比,DBCatcher显著提高了检测性能和检测效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信