{"title":"Cold Start Prediction and Provisioning Optimization in Serverless Computing Using Deep Learning","authors":"N. Saravana Kumar, S. Selvakumara Samy","doi":"10.1002/cpe.8392","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Serverless computing has emerged as a significant framework for application development, offering benefits such as simplified deployment and enhanced developer productivity. Serverless designs accelerate app development, but user experience and performance are put at risk through the delay during a cold start. In this paper, an optimized concurrent provisioning methodology for the AWS Lambda environment has been proposed along with a cold start prediction technique based on deep learning. Employing historical data and real-time features like timestamp, invoke frequency, cold start indicator, duration, previous cold starts, event type, historical cold starts, time since the last cold start, and consecutive cold starts, an attention-based bi-directional gated recurrent unit (ABiGRU) is used to predict the cold start occurrences with high precision. In the AWS Lambda environment, the proposed DL model was able to predict the cold start likelihood for incoming Lambda invocations very accurately. In addition, the performance of the ABiGRU model is enhanced by hyperparameter tuning using the RMSProp optimizer. The findings of the experiments establish the proposed DL model to perform in the reduction of cold starts compared to the existing approach. Further, the ODL-CSP technique achieves an accuracy of 90.36%, a precision of 91.87%, a recall of 90.42%, an F1_score of 90.28%, and an MCC of 82.28% when applied to the testing dataset. Additionally, the proposed paradigm optimizes Lambdas using provisioned concurrency, similar to a function warmer. The proposed DL paradigm will eliminate cold start times by early deployment of Lambdas so that the ice age of the serverless architecture is eliminated.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 4-5","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.8392","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Serverless computing has emerged as a significant framework for application development, offering benefits such as simplified deployment and enhanced developer productivity. Serverless designs accelerate app development, but user experience and performance are put at risk through the delay during a cold start. In this paper, an optimized concurrent provisioning methodology for the AWS Lambda environment has been proposed along with a cold start prediction technique based on deep learning. Employing historical data and real-time features like timestamp, invoke frequency, cold start indicator, duration, previous cold starts, event type, historical cold starts, time since the last cold start, and consecutive cold starts, an attention-based bi-directional gated recurrent unit (ABiGRU) is used to predict the cold start occurrences with high precision. In the AWS Lambda environment, the proposed DL model was able to predict the cold start likelihood for incoming Lambda invocations very accurately. In addition, the performance of the ABiGRU model is enhanced by hyperparameter tuning using the RMSProp optimizer. The findings of the experiments establish the proposed DL model to perform in the reduction of cold starts compared to the existing approach. Further, the ODL-CSP technique achieves an accuracy of 90.36%, a precision of 91.87%, a recall of 90.42%, an F1_score of 90.28%, and an MCC of 82.28% when applied to the testing dataset. Additionally, the proposed paradigm optimizes Lambdas using provisioned concurrency, similar to a function warmer. The proposed DL paradigm will eliminate cold start times by early deployment of Lambdas so that the ice age of the serverless architecture is eliminated.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.