{"title":"关键软件系统分布式容错服务体系结构研究","authors":"Ioan Cristian Schuszter, M. Cioca","doi":"10.1109/SACI51354.2021.9465574","DOIUrl":null,"url":null,"abstract":"Reliable systems have been the subject of much research in the past years, with societal dependency on computer systems becoming more and more apparent. With more and more organizations embracing DevOps culture, there is a persistent need to understand how these systems are built and what their trade-offs are. This paper discusses and benchmarks the components of a modern fault tolerant and easily scalable system, designed to maximize up-time. The paper also describes the techniques used in the development of such a system. The system architecture described is implemented through several services deployed for a new critical single sign-on system deployed at CERN (The European Laboratory for Particle Physics). A case study of several critical components of the system is also performed, identifying several trade-offs, issues and future research directions.","PeriodicalId":321907,"journal":{"name":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Study on Distributed Fault-Tolerant Service Architectures for Critical Software Systems\",\"authors\":\"Ioan Cristian Schuszter, M. Cioca\",\"doi\":\"10.1109/SACI51354.2021.9465574\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reliable systems have been the subject of much research in the past years, with societal dependency on computer systems becoming more and more apparent. With more and more organizations embracing DevOps culture, there is a persistent need to understand how these systems are built and what their trade-offs are. This paper discusses and benchmarks the components of a modern fault tolerant and easily scalable system, designed to maximize up-time. The paper also describes the techniques used in the development of such a system. The system architecture described is implemented through several services deployed for a new critical single sign-on system deployed at CERN (The European Laboratory for Particle Physics). A case study of several critical components of the system is also performed, identifying several trade-offs, issues and future research directions.\",\"PeriodicalId\":321907,\"journal\":{\"name\":\"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SACI51354.2021.9465574\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI51354.2021.9465574","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Study on Distributed Fault-Tolerant Service Architectures for Critical Software Systems
Reliable systems have been the subject of much research in the past years, with societal dependency on computer systems becoming more and more apparent. With more and more organizations embracing DevOps culture, there is a persistent need to understand how these systems are built and what their trade-offs are. This paper discusses and benchmarks the components of a modern fault tolerant and easily scalable system, designed to maximize up-time. The paper also describes the techniques used in the development of such a system. The system architecture described is implemented through several services deployed for a new critical single sign-on system deployed at CERN (The European Laboratory for Particle Physics). A case study of several critical components of the system is also performed, identifying several trade-offs, issues and future research directions.