{"title":"PPAS-MiCs: Peak-power-aware scheduling of fault-tolerant mixed-criticality systems","authors":"Shayan Shokri , Sepideh Safari , Shaahin Hessabi , Mohsen Ansari","doi":"10.1016/j.suscom.2025.101156","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-core platforms have become the dominant trend in designing Mixed-Criticality Systems (MCSs). The most well-known MCS is the dual-criticality system, which consists of high and low-criticality tasks. With the increase in the number of cores, the occurrence rate of faults has also increased in MCSs. For this reason, employing fault-tolerant techniques has become crucial. Although exploiting fault-tolerant techniques can improve system reliability, it might lead to increasing the temperature of the system beyond safe limits. In this paper, we present peak-power-aware scheduling for MCSs that employs the checkpointing technique while guaranteeing the timing, reliability, and thermal design power (TDP) constraints. In the proposed method, first, the minimum number of checkpoints for each task is calculated and assigned to the different execution sections of the tasks. Afterward, the cores are divided into safety-critical and non-safety-critical pairs, and tasks are mapped to cores and scheduled. It should be noted that this is a preliminary division and does not mean isolating the cores from each other. At each dedicated point in the schedule, if the TDP is violated, tasks are shifted from the last checkpoint until this constraint is not violated. Finally, the existing slack times are exploited to improve the QoS and reduce the average power consumption of the system. The proposed method is compared with the state-of-the-art fault-tolerant techniques, resulting in 35.6% and 36.5% improvement in all scenarios and in feasible scenarios, respectively, while the TDP constraint is not violated.</div></div>","PeriodicalId":48686,"journal":{"name":"Sustainable Computing-Informatics & Systems","volume":"47 ","pages":"Article 101156"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sustainable Computing-Informatics & Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210537925000770","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-core platforms have become the dominant trend in designing Mixed-Criticality Systems (MCSs). The most well-known MCS is the dual-criticality system, which consists of high and low-criticality tasks. With the increase in the number of cores, the occurrence rate of faults has also increased in MCSs. For this reason, employing fault-tolerant techniques has become crucial. Although exploiting fault-tolerant techniques can improve system reliability, it might lead to increasing the temperature of the system beyond safe limits. In this paper, we present peak-power-aware scheduling for MCSs that employs the checkpointing technique while guaranteeing the timing, reliability, and thermal design power (TDP) constraints. In the proposed method, first, the minimum number of checkpoints for each task is calculated and assigned to the different execution sections of the tasks. Afterward, the cores are divided into safety-critical and non-safety-critical pairs, and tasks are mapped to cores and scheduled. It should be noted that this is a preliminary division and does not mean isolating the cores from each other. At each dedicated point in the schedule, if the TDP is violated, tasks are shifted from the last checkpoint until this constraint is not violated. Finally, the existing slack times are exploited to improve the QoS and reduce the average power consumption of the system. The proposed method is compared with the state-of-the-art fault-tolerant techniques, resulting in 35.6% and 36.5% improvement in all scenarios and in feasible scenarios, respectively, while the TDP constraint is not violated.
期刊介绍:
Sustainable computing is a rapidly expanding research area spanning the fields of computer science and engineering, electrical engineering as well as other engineering disciplines. The aim of Sustainable Computing: Informatics and Systems (SUSCOM) is to publish the myriad research findings related to energy-aware and thermal-aware management of computing resource. Equally important is a spectrum of related research issues such as applications of computing that can have ecological and societal impacts. SUSCOM publishes original and timely research papers and survey articles in current areas of power, energy, temperature, and environment related research areas of current importance to readers. SUSCOM has an editorial board comprising prominent researchers from around the world and selects competitively evaluated peer-reviewed papers.