{"title":"Reinforcement Learning-Based Adaptive Load Balancing for Dynamic Cloud Environments","authors":"Kavish Chawla","doi":"arxiv-2409.04896","DOIUrl":null,"url":null,"abstract":"Efficient load balancing is crucial in cloud computing environments to ensure\noptimal resource utilization, minimize response times, and prevent server\noverload. Traditional load balancing algorithms, such as round-robin or least\nconnections, are often static and unable to adapt to the dynamic and\nfluctuating nature of cloud workloads. In this paper, we propose a novel\nadaptive load balancing framework using Reinforcement Learning (RL) to address\nthese challenges. The RL-based approach continuously learns and improves the\ndistribution of tasks by observing real-time system performance and making\ndecisions based on traffic patterns and resource availability. Our framework is\ndesigned to dynamically reallocate tasks to minimize latency and ensure\nbalanced resource usage across servers. Experimental results show that the\nproposed RL-based load balancer outperforms traditional algorithms in terms of\nresponse time, resource utilization, and adaptability to changing workloads.\nThese findings highlight the potential of AI-driven solutions for enhancing the\nefficiency and scalability of cloud infrastructures.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.04896","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Efficient load balancing is crucial in cloud computing environments to ensure
optimal resource utilization, minimize response times, and prevent server
overload. Traditional load balancing algorithms, such as round-robin or least
connections, are often static and unable to adapt to the dynamic and
fluctuating nature of cloud workloads. In this paper, we propose a novel
adaptive load balancing framework using Reinforcement Learning (RL) to address
these challenges. The RL-based approach continuously learns and improves the
distribution of tasks by observing real-time system performance and making
decisions based on traffic patterns and resource availability. Our framework is
designed to dynamically reallocate tasks to minimize latency and ensure
balanced resource usage across servers. Experimental results show that the
proposed RL-based load balancer outperforms traditional algorithms in terms of
response time, resource utilization, and adaptability to changing workloads.
These findings highlight the potential of AI-driven solutions for enhancing the
efficiency and scalability of cloud infrastructures.