Xiaoyin Wang, Dong-Jun Lan, G. Wang, Xing Fang, Meng Ye, Ying Chen, Qingbo Wang
{"title":"Appliance-Based Autonomic Provisioning Framework for Virtualized Outsourcing Data Center","authors":"Xiaoyin Wang, Dong-Jun Lan, G. Wang, Xing Fang, Meng Ye, Ying Chen, Qingbo Wang","doi":"10.1109/ICAC.2007.6","DOIUrl":"https://doi.org/10.1109/ICAC.2007.6","url":null,"abstract":"As outsourcing data centers emerge to host applications or services from many different organizations and companies, it is critical for data center owners to isolate different applications while dynamically and optimally allocate resources among them. To address this problem, we propose a virtual-appliance-based autonomic resource provisioning framework for large virtualized data centers. Firstly, we present the architecture of the data center with enriched autonomic features. Secondly, we define a non-linear constrained optimization model for dynamic resource provisioning and present its novel analytic solution. Key factors including virtualization overhead and reconfiguration delay are incorporated into the model. Experimental results based on a prototype system demonstrate that system-level performance has been greatly improved by taking advantage of fine-grained server consolidation. Experiments with the impact of switching delay also show the efficiency of the framework through significantly reducing provisioning time.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122897510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Autonomous Return on Investment Analysis of Additional Processing Resources","authors":"Jonathan Wildstrom, P. Stone, E. Witchel","doi":"10.1504/IJAC.2010.033010","DOIUrl":"https://doi.org/10.1504/IJAC.2010.033010","url":null,"abstract":"This paper consider the situation where compute time can be purchased for the database machine of a simple online bookstore, which we model using the standardized TPC-W benchmark. A service level agreement (SLA) defines the value of the system, using throughput, response time, and expected response time as metrics. The autonomous agent must weigh the potential gain (or loss) in value defined by the SLA against the cost of purchasing (or relinquishing) compute time. This paper implement such an autonomous agent on a simulated partitionable system and show that it is possible to outperform many static choices of compute power, over a range of test workloads and resource costs. The agent uses only raw, low-level system statistics, without the need for custom instrumentation of the middleware or operating system.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130431633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Evaluation of Optimization Methods for Network Bottleneck Diagnosis","authors":"A. Beygelzimer, J. Kephart, I. Rish","doi":"10.1109/ICAC.2007.15","DOIUrl":"https://doi.org/10.1109/ICAC.2007.15","url":null,"abstract":"We consider the problem of localizing network performance bottlenecks and evaluate how various optimization techniques developed for reconstructing link delays perform on this decision problem. We provide some practical suggestions on which approach to use in different scenarios.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134313323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Time-Sharing Parallel Applications with Performance Isolation and Control","authors":"Bin Lin, Ananth I. Sundararaj, P. Dinda","doi":"10.1109/ICAC.2007.39","DOIUrl":"https://doi.org/10.1109/ICAC.2007.39","url":null,"abstract":"Most parallel machines, such as clusters, are space-shared in order to isolate batch parallel applications from each other and optimize their performance. However, this leads to low utilization or potentially long waiting times. We propose a self-adaptive approach to time-sharing such machines that provides isolation and allows the execution rate of an application to be tightly controlled by the administrator. Our approach combines a periodic real-time scheduler on each node with a global feedback-based control system that governs the local schedulers. We have developed an online system that implements our approach. The system takes as input a target execution rate for each application, and automatically and continuously adjusts the applications' realtime schedules to achieve those rates with proportional CPU utilization. Target rates can be dynamically adjusted. Applications are performance-isolated from each other and from other work that is not using our system. We present an extensive evaluation that shows that the system remains stable with low response times, and that our focus on CPU isolation and control does not come at the significant expense of network I/O, disk I/O, or memory isolation.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129055661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Online Monitoring of Database Structural Deterioration","authors":"T. Hoshino, K. Goda, M. Kitsuregawa","doi":"10.1109/ICAC.2007.29","DOIUrl":"https://doi.org/10.1109/ICAC.2007.29","url":null,"abstract":"The paper proposes a structural deterioration monitor of database, which is an essential building block to realize an autonomic database reorganizer. Experimental results with our prototype show that the monitor can keep track of structural deterioration with high resolution, high accuracy, and low overhead.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122560817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Multi-levels Dictionaries and Singular Value Decomposition Techniques for Autonomic Problem Determination","authors":"H. Chan, T. Kwok","doi":"10.1109/ICAC.2007.4","DOIUrl":"https://doi.org/10.1109/ICAC.2007.4","url":null,"abstract":"An autonomic problem determination system can adapt to changing environments, react to existing or new error condition and predict possible problems. In this report, we propose such a system using dynamic and adaptive multi-levels dictionaries and \"singular value decomposition techniques\" (SVD). Compared to standard SVD, our system uses an iterative method that enables dynamic interaction between events and the current dictionaries with its entries being updated continuously to reflect relative importance of each event, thereby accelerating its convergence. The system captures knowledge in a hierarchical form for complex knowledge representation. It does not require a formal knowledge model or intensive training by examples. It is efficient with sufficient accuracy for autonomic problem determination.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130708211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Saeed Ghanbari, G. Soundararajan, Jin Chen, C. Amza
{"title":"Adaptive Learning of Metric Correlations for Temperature-Aware Database Provisioning","authors":"Saeed Ghanbari, G. Soundararajan, Jin Chen, C. Amza","doi":"10.1109/ICAC.2007.3","DOIUrl":"https://doi.org/10.1109/ICAC.2007.3","url":null,"abstract":"This paper introduces a transparent self-configuring architecture for automatic scaling with temperature awareness in the database tier of a dynamic content Web server. We use a unified approach to achieving the joint objectives of performance, efficient resource usage and avoiding temperature hot-spots in a replicated database cluster. The key novelty in our approach is a lightweight on-line learning method for fast adaptations to bottleneck situations. Our approach is based on deriving a lightweight performance model of the replicated database cluster on the fly. The system trains its own model based on perceived correlations between various system and application metrics and the query latency for the application. The model adjusts itself dynamically to changes in the application workload mix. We use our performance model for query latency pre diction and determining the number of database replicas necessary to meet the incoming load. We adapt by adding the necessary replicas, pro-actively in anticipation of a bottleneck situation and we remove them automatically in underload. Finally, the system adjusts its query scheduling algorithm dynamically in order to avoid temperature hot- spots within the replicated database cluster. We investigate our transparent database provisioning mechanism in the database tier using the TPC-W industry- standard e-commerce benchmark. We demonstrate that our technique provides quality of service in terms of both performance and avoiding hot-spot machines under different load scenarios. We further show that our method is robust to dynamic changes in the workload mix of the application.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131737540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gabriela Jacques-Silva, J. Challenger, Lou Degenaro, J. Giles, R. Wagle
{"title":"Towards Autonomic Fault Recovery in System-S","authors":"Gabriela Jacques-Silva, J. Challenger, Lou Degenaro, J. Giles, R. Wagle","doi":"10.1109/ICAC.2007.40","DOIUrl":"https://doi.org/10.1109/ICAC.2007.40","url":null,"abstract":"System-S is a stream processing infrastructure which enables program fragments to be distributed and connected to form complex applications. There may be potentially tens of thousands of interdependent and heterogeneous program fragments running across thousands of nodes. While the scale and interconnection imply the need for automation to manage the program fragments, the need is intensified because the applications operate on live streaming data and thus need to be highly available. System-S has been designed with components that autonomically manage the program fragments, but the system components themselves are also susceptible to failures which can jeopardize the system and its applications. The work we present addresses the self healing nature of these management components in System-S. In particular, we show how one key component of System-S, the job management orchestrator, can be abruptly terminated and then recover without interrupting any of the running program fragments by reconciling with other autonomous system components. We also describe techniques that we have developed to validate that the system is able to autonomically respond to a wide variety of error conditions including the abrupt termination and recovery of key system components. Finally, we show the performance of the job management orchestrator recovery for a variety of workloads.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129946151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Server-Level Power Control","authors":"C. Lefurgy, Xiaorui Wang, Malcolm S. Allen-Ware","doi":"10.1109/ICAC.2007.35","DOIUrl":"https://doi.org/10.1109/ICAC.2007.35","url":null,"abstract":"We present a technique that controls the peak power consumption of a high-density server by implementing a feedback controller that uses precise, system-level power measurement to periodically select the highest performance state while keeping the system within a fixed power constraint. A control theoretic methodology is applied to systematically design this control loop with analytic assurances of system stability and controller performance, despite unpredictable workloads and running environments. In a real server we are able to control power over a 1 second period to within 1 W. Additionally, we have observed that power over an 8 second period can be controlled to within 0.1 W. We believe that we are the first to demonstrate such precise control of power in a real server. Conventional servers respond to power supply constraint situations by using simple open-loop policies to set a safe performance level in order to limit peak power consumption. We show that closed-loop control can provide higher performance under these conditions and test this technique on an IBM BladeCenter HS20 server. Experimental results demonstrate that closed-loop control provides up to 82% higher application performance compared to open-loop control and up to 17% higher performance compared to a widely used ad-hoc technique.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131225122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fault-Adaptive Control for Robust Performance Management of Computing Systems","authors":"S. Abdelwahed, Nagarajan Kandasamy","doi":"10.1109/ICAC.2007.17","DOIUrl":"https://doi.org/10.1109/ICAC.2007.17","url":null,"abstract":"This paper introduces a fault-adaptive control approach for the robust and reliable performance management of computing systems. Fault adaptation involves the detection and isolation of faults, and then taking appropriate control actions to mitigate the fault effects and maintain control.","PeriodicalId":179923,"journal":{"name":"Fourth International Conference on Autonomic Computing (ICAC'07)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124696891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}