{"title":"NetRevert: rollback recovery in SDN","authors":"Ying Zhang, N. Beheshti, Ravi Manghirmalani","doi":"10.1145/2620728.2620779","DOIUrl":null,"url":null,"abstract":"Recognizing the inevitability of human error and hardware failures, which can significantly hamper the computer system performance, the capability of checkpointing and rollback recovery has been proposed and widely used in servers and distributed systems. These facilities allow fast recovery from failures, software bugs, and misconfigurations. Though promising, they are rarely, if ever, provided for networking systems, thus the network operators still rely on ad-hoc and error-prone processes to recover from errors. In this work, we propose a checkpoint and rollback-recovery system for Software-Define Networking (SDN) systems, exploiting SDN's simple abstraction, network-wide view, and direct control properties. We propose an approach to efficiently identify the network-wide consistent states for rollback while preserving correctness.","PeriodicalId":309136,"journal":{"name":"Proceedings of the third workshop on Hot topics in software defined networking","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the third workshop on Hot topics in software defined networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2620728.2620779","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
Recognizing the inevitability of human error and hardware failures, which can significantly hamper the computer system performance, the capability of checkpointing and rollback recovery has been proposed and widely used in servers and distributed systems. These facilities allow fast recovery from failures, software bugs, and misconfigurations. Though promising, they are rarely, if ever, provided for networking systems, thus the network operators still rely on ad-hoc and error-prone processes to recover from errors. In this work, we propose a checkpoint and rollback-recovery system for Software-Define Networking (SDN) systems, exploiting SDN's simple abstraction, network-wide view, and direct control properties. We propose an approach to efficiently identify the network-wide consistent states for rollback while preserving correctness.