{"title":"HUMAN - Hierarchical Clustering for Unsupervised Anomaly Detection & Interpretation","authors":"Pavol Mulinka, P. Casas, K. Fukuda, L. Kencl","doi":"10.1109/NoF50125.2020.9249194","DOIUrl":null,"url":null,"abstract":"The automatic detection and interpretation of network traffic anomalies through machine learning is a well-known problem, for which no general solution is available. Both supervised and unsupervised (i.e., anomaly detection) approaches require prior knowledge on the monitoring data, either in terms of normal operation profiles or on the specific anomalies to detect. As a consequence, both approaches have clear limitations when it comes to detecting, and in particular interpreting, previously unseen events. We present HUMAN, a general hierarchical-clustering-based approach for unsupervised network traffic analysis, which can both detect and interpret anomalous behaviors in a completely black-box manner, without relying on ground-truth on the system under analysis. HUMAN can detect and interpret complex patterns in the analyzed data, using a structural approach which exploits hierarchical cluster relationships and correlation among features. We describe the building blocks of HUMAN and explain its functioning in detail, using as case study the detection and interpretation of performance issues in major cloud platforms, through the unsupervised analysis of distributed active cloud latency measurements. The HUMAN approach can be applied to the unsupervised analysis of any kind of nested or hierarchically structured multi-dimensional data, showing the potential of hierarchical clustering for general unsupervised data analytics.","PeriodicalId":405626,"journal":{"name":"2020 11th International Conference on Network of the Future (NoF)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th International Conference on Network of the Future (NoF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NoF50125.2020.9249194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The automatic detection and interpretation of network traffic anomalies through machine learning is a well-known problem, for which no general solution is available. Both supervised and unsupervised (i.e., anomaly detection) approaches require prior knowledge on the monitoring data, either in terms of normal operation profiles or on the specific anomalies to detect. As a consequence, both approaches have clear limitations when it comes to detecting, and in particular interpreting, previously unseen events. We present HUMAN, a general hierarchical-clustering-based approach for unsupervised network traffic analysis, which can both detect and interpret anomalous behaviors in a completely black-box manner, without relying on ground-truth on the system under analysis. HUMAN can detect and interpret complex patterns in the analyzed data, using a structural approach which exploits hierarchical cluster relationships and correlation among features. We describe the building blocks of HUMAN and explain its functioning in detail, using as case study the detection and interpretation of performance issues in major cloud platforms, through the unsupervised analysis of distributed active cloud latency measurements. The HUMAN approach can be applied to the unsupervised analysis of any kind of nested or hierarchically structured multi-dimensional data, showing the potential of hierarchical clustering for general unsupervised data analytics.