Machine Learning for Achieving Self-* Properties and Seamless Execution of Applications in the Cloud

2015 IEEE Fourth Symposium on Network Cloud Computing and Applications (NCCA) Pub Date : 2015-06-11 DOI:10.1109/NCCA.2015.18

P. D. Sanzo, Alessandro Pellegrini, D. Avresky

{"title":"Machine Learning for Achieving Self-* Properties and Seamless Execution of Applications in the Cloud","authors":"P. D. Sanzo, Alessandro Pellegrini, D. Avresky","doi":"10.1109/NCCA.2015.18","DOIUrl":null,"url":null,"abstract":"Software anomalies are recognized as a major problem affecting the performance and availability of many computer systems. Accumulation of anomalies of different nature, such as memory leaks and unterminated threads, may lead the system to both fail or work with suboptimal performance levels. This problem particularly affects web servers, where hosted applications are typically intended to continuously run, thus incrementing the probability, therefore the associated effects, of accumulation of anomalies. Given the unpredictability of occurrence of anomalies, continuous system monitoring would be required to detect possible system failures and/or excessive performance degradation in order to timely start some recovering procedure. In this paper, we present a Machine Learning-based framework for proactive management of client-server applications in the cloud. Through optimized Machine Learning models and continually measuring system features, the framework predicts the remaining time to the occurrence of some unexpected event (system failure, service level agreement violation, etc.) of a virtual machine hosting a server instance of the application. The framework is able to manage virtual machines in the presence of different types anomalies and with different anomaly occurrence patterns. We show the effectiveness of the proposed solution by presenting results of a set of experiments we carried out in the context of a real world-inspired scenario.","PeriodicalId":309782,"journal":{"name":"2015 IEEE Fourth Symposium on Network Cloud Computing and Applications (NCCA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE Fourth Symposium on Network Cloud Computing and Applications (NCCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCCA.2015.18","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

Software anomalies are recognized as a major problem affecting the performance and availability of many computer systems. Accumulation of anomalies of different nature, such as memory leaks and unterminated threads, may lead the system to both fail or work with suboptimal performance levels. This problem particularly affects web servers, where hosted applications are typically intended to continuously run, thus incrementing the probability, therefore the associated effects, of accumulation of anomalies. Given the unpredictability of occurrence of anomalies, continuous system monitoring would be required to detect possible system failures and/or excessive performance degradation in order to timely start some recovering procedure. In this paper, we present a Machine Learning-based framework for proactive management of client-server applications in the cloud. Through optimized Machine Learning models and continually measuring system features, the framework predicts the remaining time to the occurrence of some unexpected event (system failure, service level agreement violation, etc.) of a virtual machine hosting a server instance of the application. The framework is able to manage virtual machines in the presence of different types anomalies and with different anomaly occurrence patterns. We show the effectiveness of the proposed solution by presenting results of a set of experiments we carried out in the context of a real world-inspired scenario.

查看原文本刊更多论文

机器学习实现自我属性和无缝执行的应用程序在云中

软件异常被认为是影响许多计算机系统性能和可用性的主要问题。不同性质的异常(如内存泄漏和未终止的线程)的积累可能导致系统失败或以次优性能水平工作。这个问题特别影响web服务器，其中托管的应用程序通常打算持续运行，从而增加了异常积累的概率，从而增加了相关的影响。考虑到异常发生的不可预测性，需要持续的系统监控来检测可能的系统故障和/或过度的性能下降，以便及时启动一些恢复过程。在本文中，我们提出了一个基于机器学习的框架，用于主动管理云中的客户端-服务器应用程序。通过优化的机器学习模型和持续测量系统特征，该框架预测了托管应用程序服务器实例的虚拟机发生一些意外事件(系统故障、服务水平协议违反等)的剩余时间。该框架能够管理存在不同类型异常和不同异常发生模式的虚拟机。我们通过展示我们在真实世界场景中进行的一组实验的结果来展示所提出解决方案的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE Fourth Symposium on Network Cloud Computing and Applications (NCCA)

自引率

0.00%

发文量