{"title":"异构云计算框架中具有最小任务故障感知的任务划分模型的能量截止日期优化","authors":"KN Divyaprabha, TSB Sudarshan","doi":"10.1016/j.compeleceng.2025.110438","DOIUrl":null,"url":null,"abstract":"<div><div>The central processing Unit (CPU) and graphical processing unit (GPU) will be used in high-performance computing (HPC) to provide scalable and effective computing paradigms for data-intensive scientific workloads. Nonetheless, energy use is a significant aspect that should be considered due to rising operational costs and green computing standards. Scientific workload scheduling is a challenging task since heterogeneous cloud computing (HCC) infrastructures consume more energy, which raises carbon emissions and lowers the reliability of the infrastructures. Although using the dynamic voltage-frequency scaling (DVFS) approach can improve the energy management of cloud infrastructure, it also decreases dependability and increases the error rate of workload scheduling on a CPU-GPU HCC architecture; thus, reducing task failure and minimizing energy are core issues that the current work addresses. The work first introduces the energy-deadline-aware task scheduling optimization (EDATSO) technique; secondly, it introduces the task-failure minimization-aware optimal scheduling (TFMOS) technique for the execution of scientific workflows. Simulation study demonstrates EDATSO reduces energy usage by 40.3 %, and 33.12 %, reduces makespan by 90.35 %, and 53.56 %, and overhead of additional energies used due to task failures by 95.56 %, 87.59 % as compared to energy minimized scheduling (EMS), multi-objective prioritized workflow scheduling through deep reinforcement learning (MOPWSDRL) for realistic scientific workloads, respectively. Further, TFMOS reduces energy usage by 40.33 %, and 46.4 %, reduces makespan by 90.4 %, and 53.95 %, and overhead of additional energies used due to task failures by 95.58 %, 87.61 % as compared to EMS, MOPWSDRL for realistic scientific workloads, respectively.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"125 ","pages":"Article 110438"},"PeriodicalIF":4.0000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Energy-deadline optimization with minimal task failure aware task partitioning model in heterogeneous cloud computing framework\",\"authors\":\"KN Divyaprabha, TSB Sudarshan\",\"doi\":\"10.1016/j.compeleceng.2025.110438\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The central processing Unit (CPU) and graphical processing unit (GPU) will be used in high-performance computing (HPC) to provide scalable and effective computing paradigms for data-intensive scientific workloads. Nonetheless, energy use is a significant aspect that should be considered due to rising operational costs and green computing standards. Scientific workload scheduling is a challenging task since heterogeneous cloud computing (HCC) infrastructures consume more energy, which raises carbon emissions and lowers the reliability of the infrastructures. Although using the dynamic voltage-frequency scaling (DVFS) approach can improve the energy management of cloud infrastructure, it also decreases dependability and increases the error rate of workload scheduling on a CPU-GPU HCC architecture; thus, reducing task failure and minimizing energy are core issues that the current work addresses. The work first introduces the energy-deadline-aware task scheduling optimization (EDATSO) technique; secondly, it introduces the task-failure minimization-aware optimal scheduling (TFMOS) technique for the execution of scientific workflows. Simulation study demonstrates EDATSO reduces energy usage by 40.3 %, and 33.12 %, reduces makespan by 90.35 %, and 53.56 %, and overhead of additional energies used due to task failures by 95.56 %, 87.59 % as compared to energy minimized scheduling (EMS), multi-objective prioritized workflow scheduling through deep reinforcement learning (MOPWSDRL) for realistic scientific workloads, respectively. Further, TFMOS reduces energy usage by 40.33 %, and 46.4 %, reduces makespan by 90.4 %, and 53.95 %, and overhead of additional energies used due to task failures by 95.58 %, 87.61 % as compared to EMS, MOPWSDRL for realistic scientific workloads, respectively.</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"125 \",\"pages\":\"Article 110438\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625003817\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625003817","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Energy-deadline optimization with minimal task failure aware task partitioning model in heterogeneous cloud computing framework
The central processing Unit (CPU) and graphical processing unit (GPU) will be used in high-performance computing (HPC) to provide scalable and effective computing paradigms for data-intensive scientific workloads. Nonetheless, energy use is a significant aspect that should be considered due to rising operational costs and green computing standards. Scientific workload scheduling is a challenging task since heterogeneous cloud computing (HCC) infrastructures consume more energy, which raises carbon emissions and lowers the reliability of the infrastructures. Although using the dynamic voltage-frequency scaling (DVFS) approach can improve the energy management of cloud infrastructure, it also decreases dependability and increases the error rate of workload scheduling on a CPU-GPU HCC architecture; thus, reducing task failure and minimizing energy are core issues that the current work addresses. The work first introduces the energy-deadline-aware task scheduling optimization (EDATSO) technique; secondly, it introduces the task-failure minimization-aware optimal scheduling (TFMOS) technique for the execution of scientific workflows. Simulation study demonstrates EDATSO reduces energy usage by 40.3 %, and 33.12 %, reduces makespan by 90.35 %, and 53.56 %, and overhead of additional energies used due to task failures by 95.56 %, 87.59 % as compared to energy minimized scheduling (EMS), multi-objective prioritized workflow scheduling through deep reinforcement learning (MOPWSDRL) for realistic scientific workloads, respectively. Further, TFMOS reduces energy usage by 40.33 %, and 46.4 %, reduces makespan by 90.4 %, and 53.95 %, and overhead of additional energies used due to task failures by 95.58 %, 87.61 % as compared to EMS, MOPWSDRL for realistic scientific workloads, respectively.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.