{"title":"基于偏相关的多层系统有效度量选择与异常检测","authors":"O. J. A. Pinno, S. Correa, A. Santos, K. Cardoso","doi":"10.1109/SBRC.2015.39","DOIUrl":null,"url":null,"abstract":"Large-scale data centers allow organizations to gain access to computer resources without incurring high costs in purchasing and maintaining IT infrastructure. In these environments, due to the large number of hardware and software involved, anomaly detection is difficult but essential for service provisioning. Computer systems hosted in data centers usually involve multiple layers and provide a large set of metrics for tracking their operation. The analysis of all available metrics generates drawbacks associated with communication, storage and processing. A more efficient way to support anomaly detection and minimize the cost of monitoring is to use stable statistical correlations among metrics that reflect the system state. We present the strategies PCTN, MST-PCTN and PCTN-MST, based on partial correlation, for selecting metrics to support anomaly detection in multi-tier systems. We evaluate the proposed strategies using an e-commerce, Web transaction benchmark. Results show that the PCTN-MST strategy allowed the construction of a monitoring network with 8% less metrics than that obtained with MST and achieved a fault coverage up to 10% larger.","PeriodicalId":307266,"journal":{"name":"2015 XXXIII Brazilian Symposium on Computer Networks and Distributed Systems","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using Partial Correlation for Effective Metric Selection and Anomaly Detection in Multi-tier System\",\"authors\":\"O. J. A. Pinno, S. Correa, A. Santos, K. Cardoso\",\"doi\":\"10.1109/SBRC.2015.39\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large-scale data centers allow organizations to gain access to computer resources without incurring high costs in purchasing and maintaining IT infrastructure. In these environments, due to the large number of hardware and software involved, anomaly detection is difficult but essential for service provisioning. Computer systems hosted in data centers usually involve multiple layers and provide a large set of metrics for tracking their operation. The analysis of all available metrics generates drawbacks associated with communication, storage and processing. A more efficient way to support anomaly detection and minimize the cost of monitoring is to use stable statistical correlations among metrics that reflect the system state. We present the strategies PCTN, MST-PCTN and PCTN-MST, based on partial correlation, for selecting metrics to support anomaly detection in multi-tier systems. We evaluate the proposed strategies using an e-commerce, Web transaction benchmark. Results show that the PCTN-MST strategy allowed the construction of a monitoring network with 8% less metrics than that obtained with MST and achieved a fault coverage up to 10% larger.\",\"PeriodicalId\":307266,\"journal\":{\"name\":\"2015 XXXIII Brazilian Symposium on Computer Networks and Distributed Systems\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 XXXIII Brazilian Symposium on Computer Networks and Distributed Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SBRC.2015.39\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 XXXIII Brazilian Symposium on Computer Networks and Distributed Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SBRC.2015.39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Partial Correlation for Effective Metric Selection and Anomaly Detection in Multi-tier System
Large-scale data centers allow organizations to gain access to computer resources without incurring high costs in purchasing and maintaining IT infrastructure. In these environments, due to the large number of hardware and software involved, anomaly detection is difficult but essential for service provisioning. Computer systems hosted in data centers usually involve multiple layers and provide a large set of metrics for tracking their operation. The analysis of all available metrics generates drawbacks associated with communication, storage and processing. A more efficient way to support anomaly detection and minimize the cost of monitoring is to use stable statistical correlations among metrics that reflect the system state. We present the strategies PCTN, MST-PCTN and PCTN-MST, based on partial correlation, for selecting metrics to support anomaly detection in multi-tier systems. We evaluate the proposed strategies using an e-commerce, Web transaction benchmark. Results show that the PCTN-MST strategy allowed the construction of a monitoring network with 8% less metrics than that obtained with MST and achieved a fault coverage up to 10% larger.