Mario Di Mauro;Walter Cerroni;Fabio Postiglione;Massimo Tornatore;Kishor S. Trivedi
{"title":"Reliability and Availability in Virtualized Networks: A Survey on Standards, Modeling Approaches, and Research Challenges","authors":"Mario Di Mauro;Walter Cerroni;Fabio Postiglione;Massimo Tornatore;Kishor S. Trivedi","doi":"10.1109/COMST.2026.3670039","DOIUrl":null,"url":null,"abstract":"Virtualized networks are built on the principle of replacing bulky and rather static hardware-based functions with software-based, virtualized instances of those functions, enabling more agile and cost-effective communication infrastructures. However, this shift brings new challenges for ensuring reliability and availability due to increased dependencies among system components introduced by virtualization technologies. Reliability, i.e., the ability of a system to perform regularly under specified conditions, and availability, i.e., the probability of a system of being ready to use, are critical requirements that must be guaranteed to maintain seamless network operations. Accurate modeling of these aspects is crucial for designing robust, fault-tolerant virtualized systems that can withstand service disruptions, ensuring continuous user access. Accordingly, this survey focuses on reliability and availability attributes of virtualized networks from a modeling perspective. We first introduce the Network Function Virtualization (NFV) architecture and relevant definitions, followed by a review of the European Telecommunications Standards Institute (ETSI) standardization efforts. We then explore key modeling formalisms and illustrate their use in characterizing failure and repair behaviors. A survey of related literature and supporting software tools is provided, along with a discussion on lessons learned and open research challenges to guide future work in designing fault-tolerant NFV systems.","PeriodicalId":55029,"journal":{"name":"IEEE Communications Surveys and Tutorials","volume":"28 ","pages":"5121-5158"},"PeriodicalIF":34.4000,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11418773","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Surveys and Tutorials","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11418773/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Virtualized networks are built on the principle of replacing bulky and rather static hardware-based functions with software-based, virtualized instances of those functions, enabling more agile and cost-effective communication infrastructures. However, this shift brings new challenges for ensuring reliability and availability due to increased dependencies among system components introduced by virtualization technologies. Reliability, i.e., the ability of a system to perform regularly under specified conditions, and availability, i.e., the probability of a system of being ready to use, are critical requirements that must be guaranteed to maintain seamless network operations. Accurate modeling of these aspects is crucial for designing robust, fault-tolerant virtualized systems that can withstand service disruptions, ensuring continuous user access. Accordingly, this survey focuses on reliability and availability attributes of virtualized networks from a modeling perspective. We first introduce the Network Function Virtualization (NFV) architecture and relevant definitions, followed by a review of the European Telecommunications Standards Institute (ETSI) standardization efforts. We then explore key modeling formalisms and illustrate their use in characterizing failure and repair behaviors. A survey of related literature and supporting software tools is provided, along with a discussion on lessons learned and open research challenges to guide future work in designing fault-tolerant NFV systems.
期刊介绍:
IEEE Communications Surveys & Tutorials is an online journal published by the IEEE Communications Society for tutorials and surveys covering all aspects of the communications field. Telecommunications technology is progressing at a rapid pace, and the IEEE Communications Society is committed to providing researchers and other professionals the information and tools to stay abreast. IEEE Communications Surveys and Tutorials focuses on integrating and adding understanding to the existing literature on communications, putting results in context. Whether searching for in-depth information about a familiar area or an introduction into a new area, IEEE Communications Surveys & Tutorials aims to be the premier source of peer-reviewed, comprehensive tutorials and surveys, and pointers to further sources. IEEE Communications Surveys & Tutorials publishes only articles exclusively written for IEEE Communications Surveys & Tutorials and go through a rigorous review process before their publication in the quarterly issues.
A tutorial article in the IEEE Communications Surveys & Tutorials should be designed to help the reader to become familiar with and learn something specific about a chosen topic. In contrast, the term survey, as applied here, is defined to mean a survey of the literature. A survey article in IEEE Communications Surveys & Tutorials should provide a comprehensive review of developments in a selected area, covering its development from its inception to its current state and beyond, and illustrating its development through liberal citations from the literature. Both tutorials and surveys should be tutorial in nature and should be written in a style comprehensible to readers outside the specialty of the article.