{"title":"Evaluating Serverless Machine Learning Performance on Google Cloud Run","authors":"Prerana Khatiwada, Pranjal Dhakal","doi":"arxiv-2406.16250","DOIUrl":null,"url":null,"abstract":"End-users can get functions-as-a-service from serverless platforms, which\npromise lower hosting costs, high availability, fault tolerance, and dynamic\nflexibility for hosting individual functions known as microservices. Machine\nlearning tools are seen to be reliably useful, and the services created using\nthese tools are in increasing demand on a large scale. The serverless platforms\nare uniquely suited for hosting these machine learning services to be used for\nlarge-scale applications. These platforms are well known for their cost\nefficiency, fault tolerance, resource scaling, robust APIs for communication,\nand global reach. However, machine learning services are different from the\nweb-services in that these serverless platforms were originally designed to\nhost web services. We aimed to understand how these serverless platforms handle\nmachine learning workloads with our study. We examine machine learning\nperformance on one of the serverless platforms - Google Cloud Run, which is a\nGPU-less infrastructure that is not designed for machine learning application\ndeployment.","PeriodicalId":501333,"journal":{"name":"arXiv - CS - Operating Systems","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Operating Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.16250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
End-users can get functions-as-a-service from serverless platforms, which
promise lower hosting costs, high availability, fault tolerance, and dynamic
flexibility for hosting individual functions known as microservices. Machine
learning tools are seen to be reliably useful, and the services created using
these tools are in increasing demand on a large scale. The serverless platforms
are uniquely suited for hosting these machine learning services to be used for
large-scale applications. These platforms are well known for their cost
efficiency, fault tolerance, resource scaling, robust APIs for communication,
and global reach. However, machine learning services are different from the
web-services in that these serverless platforms were originally designed to
host web services. We aimed to understand how these serverless platforms handle
machine learning workloads with our study. We examine machine learning
performance on one of the serverless platforms - Google Cloud Run, which is a
GPU-less infrastructure that is not designed for machine learning application
deployment.