Auto-WLM: Amazon Redshift中机器学习增强的工作负载管理

Companion of the 2023 International Conference on Management of Data Pub Date : 2023-06-04 DOI:10.1145/3555041.3589677

Gaurav Saxena, Mohammad Rahman, Naresh Chainani, Chunbin Lin, George C. Caragea, Fahim Chowdhury, Ryan Marcus, Tim Kraska, I. Pandis, Balakrishnan Narayanaswamy

{"title":"Auto-WLM: Amazon Redshift中机器学习增强的工作负载管理","authors":"Gaurav Saxena, Mohammad Rahman, Naresh Chainani, Chunbin Lin, George C. Caragea, Fahim Chowdhury, Ryan Marcus, Tim Kraska, I. Pandis, Balakrishnan Narayanaswamy","doi":"10.1145/3555041.3589677","DOIUrl":null,"url":null,"abstract":"There has been a lot of excitement around using machine learning to improve the performance and usability of database systems. However, few of these techniques have actually been used in the critical path of customer-facing database services. In this paper, we describe Auto-WLM, a machine learning based automatic workload manager currently used in production in Amazon Redshift. Auto-WLM is an example of how machine learning can improve the performance of large data-warehouses in practice and at scale. Auto-WLM intelligently schedules workloads to maximize throughput and horizontally scales clusters in response to workload spikes. While traditional heuristic-based workload management requires a lot of manual tuning (e.g. of the concurrency level, memory allocated to queries etc.) for each specific workload, Auto-WLM does this tuning automatically and as a result is able to quickly adapt and react to workload changes and demand spikes. At its core, Auto-WLM uses locally-trained query performance models to predict the query execution time and memory needs for each query, and uses this to make intelligent scheduling decisions. Currently, Auto-WLM makes millions of decisions every day, and constantly optimizes the performance for each individual Amazon Redshift cluster. In this paper, we will describe the advantages and challenges of implementing and deploying Auto-WLM, as well as outline areas of research that may be of interest to those in the \"ML for systems'' community with an eye for practicality.","PeriodicalId":161812,"journal":{"name":"Companion of the 2023 International Conference on Management of Data","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift\",\"authors\":\"Gaurav Saxena, Mohammad Rahman, Naresh Chainani, Chunbin Lin, George C. Caragea, Fahim Chowdhury, Ryan Marcus, Tim Kraska, I. Pandis, Balakrishnan Narayanaswamy\",\"doi\":\"10.1145/3555041.3589677\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There has been a lot of excitement around using machine learning to improve the performance and usability of database systems. However, few of these techniques have actually been used in the critical path of customer-facing database services. In this paper, we describe Auto-WLM, a machine learning based automatic workload manager currently used in production in Amazon Redshift. Auto-WLM is an example of how machine learning can improve the performance of large data-warehouses in practice and at scale. Auto-WLM intelligently schedules workloads to maximize throughput and horizontally scales clusters in response to workload spikes. While traditional heuristic-based workload management requires a lot of manual tuning (e.g. of the concurrency level, memory allocated to queries etc.) for each specific workload, Auto-WLM does this tuning automatically and as a result is able to quickly adapt and react to workload changes and demand spikes. At its core, Auto-WLM uses locally-trained query performance models to predict the query execution time and memory needs for each query, and uses this to make intelligent scheduling decisions. Currently, Auto-WLM makes millions of decisions every day, and constantly optimizes the performance for each individual Amazon Redshift cluster. In this paper, we will describe the advantages and challenges of implementing and deploying Auto-WLM, as well as outline areas of research that may be of interest to those in the \\\"ML for systems'' community with an eye for practicality.\",\"PeriodicalId\":161812,\"journal\":{\"name\":\"Companion of the 2023 International Conference on Management of Data\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Companion of the 2023 International Conference on Management of Data\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3555041.3589677\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion of the 2023 International Conference on Management of Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3555041.3589677","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

关于使用机器学习来提高数据库系统的性能和可用性，有很多令人兴奋的事情。然而，在面向客户的数据库服务的关键路径中，实际上很少使用这些技术。在本文中，我们描述了Auto-WLM，一个基于机器学习的自动工作负载管理器，目前在Amazon Redshift的生产中使用。Auto-WLM是机器学习如何在实践和规模上提高大型数据仓库性能的一个例子。Auto-WLM智能地调度工作负载，以最大限度地提高吞吐量，并根据工作负载峰值水平扩展集群。传统的基于启发式的工作负载管理需要对每个特定的工作负载进行大量的手动调优(例如并发级别、分配给查询的内存等)，而Auto-WLM可以自动进行调优，因此能够快速适应和响应工作负载变化和需求峰值。Auto-WLM的核心是使用本地训练的查询性能模型来预测每个查询的查询执行时间和内存需求，并使用它来做出智能的调度决策。目前，Auto-WLM每天做出数百万个决策，并不断优化每个单独的Amazon Redshift集群的性能。在本文中，我们将描述实现和部署Auto-WLM的优势和挑战，并概述那些关注实用性的“系统机器学习”社区可能感兴趣的研究领域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift

There has been a lot of excitement around using machine learning to improve the performance and usability of database systems. However, few of these techniques have actually been used in the critical path of customer-facing database services. In this paper, we describe Auto-WLM, a machine learning based automatic workload manager currently used in production in Amazon Redshift. Auto-WLM is an example of how machine learning can improve the performance of large data-warehouses in practice and at scale. Auto-WLM intelligently schedules workloads to maximize throughput and horizontally scales clusters in response to workload spikes. While traditional heuristic-based workload management requires a lot of manual tuning (e.g. of the concurrency level, memory allocated to queries etc.) for each specific workload, Auto-WLM does this tuning automatically and as a result is able to quickly adapt and react to workload changes and demand spikes. At its core, Auto-WLM uses locally-trained query performance models to predict the query execution time and memory needs for each query, and uses this to make intelligent scheduling decisions. Currently, Auto-WLM makes millions of decisions every day, and constantly optimizes the performance for each individual Amazon Redshift cluster. In this paper, we will describe the advantages and challenges of implementing and deploying Auto-WLM, as well as outline areas of research that may be of interest to those in the "ML for systems'' community with an eye for practicality.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Companion of the 2023 International Conference on Management of Data

自引率

0.00%

发文量