Cold Start Prediction and Provisioning Optimization in Serverless Computing Using Deep Learning

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Concurrency and Computation-Practice & Experience Pub Date : 2025-02-11 DOI:10.1002/cpe.8392

N. Saravana Kumar, S. Selvakumara Samy

{"title":"Cold Start Prediction and Provisioning Optimization in Serverless Computing Using Deep Learning","authors":"N. Saravana Kumar, S. Selvakumara Samy","doi":"10.1002/cpe.8392","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Serverless computing has emerged as a significant framework for application development, offering benefits such as simplified deployment and enhanced developer productivity. Serverless designs accelerate app development, but user experience and performance are put at risk through the delay during a cold start. In this paper, an optimized concurrent provisioning methodology for the AWS Lambda environment has been proposed along with a cold start prediction technique based on deep learning. Employing historical data and real-time features like timestamp, invoke frequency, cold start indicator, duration, previous cold starts, event type, historical cold starts, time since the last cold start, and consecutive cold starts, an attention-based bi-directional gated recurrent unit (ABiGRU) is used to predict the cold start occurrences with high precision. In the AWS Lambda environment, the proposed DL model was able to predict the cold start likelihood for incoming Lambda invocations very accurately. In addition, the performance of the ABiGRU model is enhanced by hyperparameter tuning using the RMSProp optimizer. The findings of the experiments establish the proposed DL model to perform in the reduction of cold starts compared to the existing approach. Further, the ODL-CSP technique achieves an accuracy of 90.36%, a precision of 91.87%, a recall of 90.42%, an F1_score of 90.28%, and an MCC of 82.28% when applied to the testing dataset. Additionally, the proposed paradigm optimizes Lambdas using provisioned concurrency, similar to a function warmer. The proposed DL paradigm will eliminate cold start times by early deployment of Lambdas so that the ice age of the serverless architecture is eliminated.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 4-5","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.8392","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Serverless computing has emerged as a significant framework for application development, offering benefits such as simplified deployment and enhanced developer productivity. Serverless designs accelerate app development, but user experience and performance are put at risk through the delay during a cold start. In this paper, an optimized concurrent provisioning methodology for the AWS Lambda environment has been proposed along with a cold start prediction technique based on deep learning. Employing historical data and real-time features like timestamp, invoke frequency, cold start indicator, duration, previous cold starts, event type, historical cold starts, time since the last cold start, and consecutive cold starts, an attention-based bi-directional gated recurrent unit (ABiGRU) is used to predict the cold start occurrences with high precision. In the AWS Lambda environment, the proposed DL model was able to predict the cold start likelihood for incoming Lambda invocations very accurately. In addition, the performance of the ABiGRU model is enhanced by hyperparameter tuning using the RMSProp optimizer. The findings of the experiments establish the proposed DL model to perform in the reduction of cold starts compared to the existing approach. Further, the ODL-CSP technique achieves an accuracy of 90.36%, a precision of 91.87%, a recall of 90.42%, an F1_score of 90.28%, and an MCC of 82.28% when applied to the testing dataset. Additionally, the proposed paradigm optimizes Lambdas using provisioned concurrency, similar to a function warmer. The proposed DL paradigm will eliminate cold start times by early deployment of Lambdas so that the ice age of the serverless architecture is eliminated.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Concurrency and Computation-Practice & Experience 工程技术-计算机：理论方法

CiteScore

5.00

自引率

10.00%

发文量

664

审稿时长

9.6 months

期刊介绍： Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.