Mustafa Daraghmeh , Yaser Jararweh , Anjali Agarwal
{"title":"Leveraging machine learning and feature engineering for optimal data-driven scaling decision in serverless computing","authors":"Mustafa Daraghmeh , Yaser Jararweh , Anjali Agarwal","doi":"10.1016/j.simpat.2025.103090","DOIUrl":null,"url":null,"abstract":"<div><div>Serverless computing offers scalability and cost-efficiency, but balancing performance and cost remains challenging, particularly in scaling decisions that can lead to cold starts or resource misallocation. This research is motivated by the need to minimize the impact of cold starts and optimize resource utilization in serverless applications by developing intelligent, data-driven scaling decisions. We delve into using machine learning and feature engineering to model and simulate predictions for optimal scaling decisions for Azure Function Apps (AFA). Our focus lies in predicting the ideal timing for provisioning or de-provisioning the Function App’s environment. Using historical invocation data, we applied a sliding window to transform the time-series data into patterns categorized as load or unload classes, considering various target periods. To identify the most effective model, we compared the performance of various baseline models with and without calibration (isotonic and sigmoid) to enhance precision. In addition, we assess multiple feature extraction methods in invocation patterns and explore the use of Principal Component Analysis (PCA) for dimensionality reduction to reduce computation costs. Using the best-identified configurations, we model and simulate the class patterns over time to compare the actual classes with the predicted ones, focusing on memory usage and the costs associated with cold starts. The proposed model is thoroughly evaluated using various metrics under different setups, revealing notable improvements in scaling decisions achieved by applying calibration and feature engineering methods. These findings demonstrate the potential of machine learning for intelligent, data-driven scaling decisions in serverless computing, offering valuable insights for cloud providers to optimize resource allocation and for developers to build more efficient and responsive serverless applications. Specifically, the proposed method can be integrated into serverless platforms to automatically adjust resource provisioning based on predicted workload demands, reducing cold start latency and improving cost-effectiveness.</div></div>","PeriodicalId":49518,"journal":{"name":"Simulation Modelling Practice and Theory","volume":"140 ","pages":"Article 103090"},"PeriodicalIF":3.5000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Simulation Modelling Practice and Theory","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569190X25000255","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Serverless computing offers scalability and cost-efficiency, but balancing performance and cost remains challenging, particularly in scaling decisions that can lead to cold starts or resource misallocation. This research is motivated by the need to minimize the impact of cold starts and optimize resource utilization in serverless applications by developing intelligent, data-driven scaling decisions. We delve into using machine learning and feature engineering to model and simulate predictions for optimal scaling decisions for Azure Function Apps (AFA). Our focus lies in predicting the ideal timing for provisioning or de-provisioning the Function App’s environment. Using historical invocation data, we applied a sliding window to transform the time-series data into patterns categorized as load or unload classes, considering various target periods. To identify the most effective model, we compared the performance of various baseline models with and without calibration (isotonic and sigmoid) to enhance precision. In addition, we assess multiple feature extraction methods in invocation patterns and explore the use of Principal Component Analysis (PCA) for dimensionality reduction to reduce computation costs. Using the best-identified configurations, we model and simulate the class patterns over time to compare the actual classes with the predicted ones, focusing on memory usage and the costs associated with cold starts. The proposed model is thoroughly evaluated using various metrics under different setups, revealing notable improvements in scaling decisions achieved by applying calibration and feature engineering methods. These findings demonstrate the potential of machine learning for intelligent, data-driven scaling decisions in serverless computing, offering valuable insights for cloud providers to optimize resource allocation and for developers to build more efficient and responsive serverless applications. Specifically, the proposed method can be integrated into serverless platforms to automatically adjust resource provisioning based on predicted workload demands, reducing cold start latency and improving cost-effectiveness.
期刊介绍:
The journal Simulation Modelling Practice and Theory provides a forum for original, high-quality papers dealing with any aspect of systems simulation and modelling.
The journal aims at being a reference and a powerful tool to all those professionally active and/or interested in the methods and applications of simulation. Submitted papers will be peer reviewed and must significantly contribute to modelling and simulation in general or use modelling and simulation in application areas.
Paper submission is solicited on:
• theoretical aspects of modelling and simulation including formal modelling, model-checking, random number generators, sensitivity analysis, variance reduction techniques, experimental design, meta-modelling, methods and algorithms for validation and verification, selection and comparison procedures etc.;
• methodology and application of modelling and simulation in any area, including computer systems, networks, real-time and embedded systems, mobile and intelligent agents, manufacturing and transportation systems, management, engineering, biomedical engineering, economics, ecology and environment, education, transaction handling, etc.;
• simulation languages and environments including those, specific to distributed computing, grid computing, high performance computers or computer networks, etc.;
• distributed and real-time simulation, simulation interoperability;
• tools for high performance computing simulation, including dedicated architectures and parallel computing.