{"title":"VECA: Reliable and Confidential Resource Clustering for Volunteer Edge-Cloud Computing","authors":"Hemanth Sai Yeddulapalli, Mauro Lemus Alarcon, Upasana Roy, Roshan Lal Neupane, Durbek Gafurov, Motahare Mounesan, Saptarshi Debroy, Prasad Calyam","doi":"arxiv-2409.03057","DOIUrl":null,"url":null,"abstract":"Volunteer Edge-Cloud (VEC) computing has a significant potential to support\nscientific workflows in user communities contributing volunteer edge nodes.\nHowever, managing heterogeneous and intermittent resources to support\nmachine/deep learning (ML/DL) based workflows poses challenges in resource\ngovernance for reliability, and confidentiality for model/data privacy\nprotection. There is a need for approaches to handle the volatility of\nvolunteer edge node availability, and also to scale the confidential\ndata-intensive workflow execution across a large number of VEC nodes. In this\npaper, we present VECA, a reliable and confidential VEC resource clustering\nsolution featuring three-fold methods tailored for executing ML/DL-based\nscientific workflows on VEC resources. Firstly, a capacity-based clustering\napproach enhances system reliability and minimizes VEC node search latency.\nSecondly, a novel two-phase, globally distributed scheduling scheme optimizes\njob allocation based on node attributes and using time-series-based Recurrent\nNeural Networks. Lastly, the integration of confidential computing ensures\nprivacy preservation of the scientific workflows, where model and data\ninformation are not shared with VEC resources providers. We evaluate VECA in a\nFunction-as-a-Service (FaaS) cloud testbed that features OpenFaaS and MicroK8S\nto support two ML/DL-based scientific workflows viz., G2P-Deep (bioinformatics)\nand PAS-ML (health informatics). Results from tested experiments demonstrate\nthat our proposed VECA approach outperforms state-of-the-art methods;\nespecially VECA exhibits a two-fold reduction in VEC node search latency and\nover 20% improvement in productivity rates following execution failures\ncompared to the next best method.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":"59 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Volunteer Edge-Cloud (VEC) computing has a significant potential to support
scientific workflows in user communities contributing volunteer edge nodes.
However, managing heterogeneous and intermittent resources to support
machine/deep learning (ML/DL) based workflows poses challenges in resource
governance for reliability, and confidentiality for model/data privacy
protection. There is a need for approaches to handle the volatility of
volunteer edge node availability, and also to scale the confidential
data-intensive workflow execution across a large number of VEC nodes. In this
paper, we present VECA, a reliable and confidential VEC resource clustering
solution featuring three-fold methods tailored for executing ML/DL-based
scientific workflows on VEC resources. Firstly, a capacity-based clustering
approach enhances system reliability and minimizes VEC node search latency.
Secondly, a novel two-phase, globally distributed scheduling scheme optimizes
job allocation based on node attributes and using time-series-based Recurrent
Neural Networks. Lastly, the integration of confidential computing ensures
privacy preservation of the scientific workflows, where model and data
information are not shared with VEC resources providers. We evaluate VECA in a
Function-as-a-Service (FaaS) cloud testbed that features OpenFaaS and MicroK8S
to support two ML/DL-based scientific workflows viz., G2P-Deep (bioinformatics)
and PAS-ML (health informatics). Results from tested experiments demonstrate
that our proposed VECA approach outperforms state-of-the-art methods;
especially VECA exhibits a two-fold reduction in VEC node search latency and
over 20% improvement in productivity rates following execution failures
compared to the next best method.