Balancing function performance and cluster load in serverless computing: A reinforcement learning solution

IF 8 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Menglin Zhou , Bingbing Zheng , Li Pan , Shijun Liu
{"title":"Balancing function performance and cluster load in serverless computing: A reinforcement learning solution","authors":"Menglin Zhou ,&nbsp;Bingbing Zheng ,&nbsp;Li Pan ,&nbsp;Shijun Liu","doi":"10.1016/j.jnca.2025.104299","DOIUrl":null,"url":null,"abstract":"<div><div>Serverless computing, as an emerging cloud computing service model, enables developers to focus on business logic without concerning underlying resource management by decomposing applications into fine-grained functions that execute on demand. However, in heterogeneous server cluster environments, the bursty and transient nature of function requests presents significant resource scheduling challenges. To ensure the performance of function execution, newly created function instances are often scheduled to nodes with abundant resources. This leads to resource allocation imbalances under high loads, which could potentially trigger node failures. In this paper we model function scheduling as an optimization problem that balances performance and load. We then propose a scheduling method based on the PPO algorithm, which guides decisions by analyzing node load and performance metrics in real time. For validation, we conducted experiments on the OpenFaaS platform using both real and simulated traces. The experimental results demonstrate that our method not only effectively reduces the risks associated with load imbalance but also achieves improvements in function performance.</div></div>","PeriodicalId":54784,"journal":{"name":"Journal of Network and Computer Applications","volume":"243 ","pages":"Article 104299"},"PeriodicalIF":8.0000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Network and Computer Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1084804525001961","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Serverless computing, as an emerging cloud computing service model, enables developers to focus on business logic without concerning underlying resource management by decomposing applications into fine-grained functions that execute on demand. However, in heterogeneous server cluster environments, the bursty and transient nature of function requests presents significant resource scheduling challenges. To ensure the performance of function execution, newly created function instances are often scheduled to nodes with abundant resources. This leads to resource allocation imbalances under high loads, which could potentially trigger node failures. In this paper we model function scheduling as an optimization problem that balances performance and load. We then propose a scheduling method based on the PPO algorithm, which guides decisions by analyzing node load and performance metrics in real time. For validation, we conducted experiments on the OpenFaaS platform using both real and simulated traces. The experimental results demonstrate that our method not only effectively reduces the risks associated with load imbalance but also achieves improvements in function performance.
无服务器计算中平衡功能性能和集群负载:一种强化学习解决方案
无服务器计算作为一种新兴的云计算服务模型,通过将应用程序分解为按需执行的细粒度功能,使开发人员能够专注于业务逻辑,而无需关注底层资源管理。然而,在异构服务器集群环境中,功能请求的突发和瞬态特性带来了重大的资源调度挑战。为了保证函数执行的性能,通常将新创建的函数实例调度到资源丰富的节点。这将导致高负载下的资源分配不平衡,从而可能触发节点故障。本文将函数调度建模为一个平衡性能和负载的优化问题。然后,我们提出了一种基于PPO算法的调度方法,该方法通过实时分析节点负载和性能指标来指导决策。为了验证,我们在OpenFaaS平台上使用真实和模拟痕迹进行了实验。实验结果表明,我们的方法不仅有效地降低了负载不平衡带来的风险,而且在功能性能上也得到了提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Network and Computer Applications
Journal of Network and Computer Applications 工程技术-计算机:跨学科应用
CiteScore
21.50
自引率
3.40%
发文量
142
审稿时长
37 days
期刊介绍: The Journal of Network and Computer Applications welcomes research contributions, surveys, and notes in all areas relating to computer networks and applications thereof. Sample topics include new design techniques, interesting or novel applications, components or standards; computer networks with tools such as WWW; emerging standards for internet protocols; Wireless networks; Mobile Computing; emerging computing models such as cloud computing, grid computing; applications of networked systems for remote collaboration and telemedicine, etc. The journal is abstracted and indexed in Scopus, Engineering Index, Web of Science, Science Citation Index Expanded and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信