Santiago Lozano , Javier Fernandez , Jesus Carretero
{"title":"Applying hypervisor-based fault tolerance techniques to safety-critical embedded systems","authors":"Santiago Lozano , Javier Fernandez , Jesus Carretero","doi":"10.1016/j.micpro.2026.105255","DOIUrl":null,"url":null,"abstract":"<div><div>The main objective of this work is the design and implementation of a space use case for applying the Hypervisor-Based Fault Tolerance (HBFT) mechanisms to redundant software applications in independent virtual machines, isolated from each other, and to research the effect of the HBFT mechanism on system safety and reliability. To test the developed fault tolerance mechanism, we decided to use a real use case of space systems: the ESA Near InfraRed (NIR) HAWAII 2-RG Data Processing Algorithms benchmarking software. After testing with an exhaustive fault injection campaign, the evaluation results show that our HBFT for critical real-time embedded systems is able to detect and cover all failures detected for critical real-time tasks, recovering failed virtual machines or containers from degradation to become fully operational again.</div></div>","PeriodicalId":49815,"journal":{"name":"Microprocessors and Microsystems","volume":"121 ","pages":"Article 105255"},"PeriodicalIF":2.6000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microprocessors and Microsystems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141933126000128","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/14 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
The main objective of this work is the design and implementation of a space use case for applying the Hypervisor-Based Fault Tolerance (HBFT) mechanisms to redundant software applications in independent virtual machines, isolated from each other, and to research the effect of the HBFT mechanism on system safety and reliability. To test the developed fault tolerance mechanism, we decided to use a real use case of space systems: the ESA Near InfraRed (NIR) HAWAII 2-RG Data Processing Algorithms benchmarking software. After testing with an exhaustive fault injection campaign, the evaluation results show that our HBFT for critical real-time embedded systems is able to detect and cover all failures detected for critical real-time tasks, recovering failed virtual machines or containers from degradation to become fully operational again.
期刊介绍:
Microprocessors and Microsystems: Embedded Hardware Design (MICPRO) is a journal covering all design and architectural aspects related to embedded systems hardware. This includes different embedded system hardware platforms ranging from custom hardware via reconfigurable systems and application specific processors to general purpose embedded processors. Special emphasis is put on novel complex embedded architectures, such as systems on chip (SoC), systems on a programmable/reconfigurable chip (SoPC) and multi-processor systems on a chip (MPSoC), as well as, their memory and communication methods and structures, such as network-on-chip (NoC).
Design automation of such systems including methodologies, techniques, flows and tools for their design, as well as, novel designs of hardware components fall within the scope of this journal. Novel cyber-physical applications that use embedded systems are also central in this journal. While software is not in the main focus of this journal, methods of hardware/software co-design, as well as, application restructuring and mapping to embedded hardware platforms, that consider interplay between software and hardware components with emphasis on hardware, are also in the journal scope.