{"title":"An Approach to Improving Reliability for Distributed Video-Based Monitoring Systems","authors":"M. Jiang, Yaping Liu, Xinzheng Gu","doi":"10.1109/SSIRI.2009.34","DOIUrl":null,"url":null,"abstract":"A large-scale distributed system may experience software or hardware failures that lead to undesirable down-time of the system. While the failure of a hardware node is common for large distributed systems, the reliability of software can also be a significant factor. System reliability can be improved by integrating both hardware and software based reliability techniques. We presented a combined fault-tolerant approach to improve reliability for a large monitoring system through failure detection, isolation, and recovery. The proposed approach was applied to real-time distributed monitoring system and preliminary experiments showed substantial improvement on reliability. Experiments also showed that our approach is scaleable to meet the needs of large-scale monitoring systems.","PeriodicalId":196276,"journal":{"name":"2009 Third IEEE International Conference on Secure Software Integration and Reliability Improvement","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third IEEE International Conference on Secure Software Integration and Reliability Improvement","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSIRI.2009.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A large-scale distributed system may experience software or hardware failures that lead to undesirable down-time of the system. While the failure of a hardware node is common for large distributed systems, the reliability of software can also be a significant factor. System reliability can be improved by integrating both hardware and software based reliability techniques. We presented a combined fault-tolerant approach to improve reliability for a large monitoring system through failure detection, isolation, and recovery. The proposed approach was applied to real-time distributed monitoring system and preliminary experiments showed substantial improvement on reliability. Experiments also showed that our approach is scaleable to meet the needs of large-scale monitoring systems.