Smruti Rekha Swain;Deepika Saxena;Jatinder Kumar;Ashutosh Kumar Singh;Chung-Nan Lee
{"title":"面向可持续云环境的智能滞留者流量管理框架","authors":"Smruti Rekha Swain;Deepika Saxena;Jatinder Kumar;Ashutosh Kumar Singh;Chung-Nan Lee","doi":"10.1109/TSUSC.2024.3393357","DOIUrl":null,"url":null,"abstract":"Large-scale computing systems in the modern era distribute tasks into smaller units that can be executed simultaneously to speed up job completion and decrease energy usage. However, cloud computing systems encounter a significant challenge called the Long Tail problem, where a small subset of slow-performing tasks hinders the overall progress of parallel job execution. This behavior leads to longer service response times and reduced system efficiency. This paper introduces a novel approach called Stochastic Gradient Descent with Momentum-driven Neural Network to analyze and classify heterogeneous tasks as either stragglers or non-stragglers. The straggler tasks are further categorized into Resource Hunter and Long-Tail stragglers based on their specific resource requirements. A traffic management policy is implemented to schedule and assign resources among user job requests, considering the task category, to achieve parallelism and improve sustainability within the cloud infrastructure. Extensive simulations are conducted using the Google Cluster Dataset (GCD) to assess the effectiveness of the proposed framework. The results obtained from these simulations are then compared to state-of-the-art techniques. The experimental findings demonstrate significant reductions in power consumption, carbon emissions, active servers, conflicting servers, and VM migration up to 55.16%, 49.76%, 35%, 25.7%, and 87.29%, respectively. Moreover, there has been an enhancement in resource utilization by up to 78.31%, accompanied by a decrease in execution time of up to 67.74%.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"10 1","pages":"82-94"},"PeriodicalIF":3.0000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Intelligent Straggler Traffic Management Framework for Sustainable Cloud Environments\",\"authors\":\"Smruti Rekha Swain;Deepika Saxena;Jatinder Kumar;Ashutosh Kumar Singh;Chung-Nan Lee\",\"doi\":\"10.1109/TSUSC.2024.3393357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large-scale computing systems in the modern era distribute tasks into smaller units that can be executed simultaneously to speed up job completion and decrease energy usage. However, cloud computing systems encounter a significant challenge called the Long Tail problem, where a small subset of slow-performing tasks hinders the overall progress of parallel job execution. This behavior leads to longer service response times and reduced system efficiency. This paper introduces a novel approach called Stochastic Gradient Descent with Momentum-driven Neural Network to analyze and classify heterogeneous tasks as either stragglers or non-stragglers. The straggler tasks are further categorized into Resource Hunter and Long-Tail stragglers based on their specific resource requirements. A traffic management policy is implemented to schedule and assign resources among user job requests, considering the task category, to achieve parallelism and improve sustainability within the cloud infrastructure. Extensive simulations are conducted using the Google Cluster Dataset (GCD) to assess the effectiveness of the proposed framework. The results obtained from these simulations are then compared to state-of-the-art techniques. The experimental findings demonstrate significant reductions in power consumption, carbon emissions, active servers, conflicting servers, and VM migration up to 55.16%, 49.76%, 35%, 25.7%, and 87.29%, respectively. Moreover, there has been an enhancement in resource utilization by up to 78.31%, accompanied by a decrease in execution time of up to 67.74%.\",\"PeriodicalId\":13268,\"journal\":{\"name\":\"IEEE Transactions on Sustainable Computing\",\"volume\":\"10 1\",\"pages\":\"82-94\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Sustainable Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10508125/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Sustainable Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10508125/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
An Intelligent Straggler Traffic Management Framework for Sustainable Cloud Environments
Large-scale computing systems in the modern era distribute tasks into smaller units that can be executed simultaneously to speed up job completion and decrease energy usage. However, cloud computing systems encounter a significant challenge called the Long Tail problem, where a small subset of slow-performing tasks hinders the overall progress of parallel job execution. This behavior leads to longer service response times and reduced system efficiency. This paper introduces a novel approach called Stochastic Gradient Descent with Momentum-driven Neural Network to analyze and classify heterogeneous tasks as either stragglers or non-stragglers. The straggler tasks are further categorized into Resource Hunter and Long-Tail stragglers based on their specific resource requirements. A traffic management policy is implemented to schedule and assign resources among user job requests, considering the task category, to achieve parallelism and improve sustainability within the cloud infrastructure. Extensive simulations are conducted using the Google Cluster Dataset (GCD) to assess the effectiveness of the proposed framework. The results obtained from these simulations are then compared to state-of-the-art techniques. The experimental findings demonstrate significant reductions in power consumption, carbon emissions, active servers, conflicting servers, and VM migration up to 55.16%, 49.76%, 35%, 25.7%, and 87.29%, respectively. Moreover, there has been an enhancement in resource utilization by up to 78.31%, accompanied by a decrease in execution time of up to 67.74%.