{"title":"The Extended Finite State Machine and Fault Tolerant Mechanism in Distributed Systems","authors":"Shengfa Gao, Xin Li, Ruihua Zhang","doi":"10.1109/SERA.2009.33","DOIUrl":null,"url":null,"abstract":"Synchronization and fault tolerance of processes are emphasis in the distributed systems research, but only a few people involves in the mathematics model used in processes synchronization and fault tolerance yet. This paper takes distributed system as an event driven system, classify the events that cause system state variety into four classes, and proposed an Extended Finite State Machine (EFSM) with synchronization and fault tolerant message to the distributed system. Accordingly, a checkpoint set up algorithm based in this EFSM is proposed. During the establishing of the checkpoint, the consistency of checkpoint can be determined by calculating the number of sending and receiving messages. In case of lost message, sending and receiving process that lost message can be found by checking the number of sending and receiving messages, and the lost messages can be retransmitted and received. Thus the establishing of the distributed systems global state has been simplified.","PeriodicalId":333607,"journal":{"name":"2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications","volume":"360 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERA.2009.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Synchronization and fault tolerance of processes are emphasis in the distributed systems research, but only a few people involves in the mathematics model used in processes synchronization and fault tolerance yet. This paper takes distributed system as an event driven system, classify the events that cause system state variety into four classes, and proposed an Extended Finite State Machine (EFSM) with synchronization and fault tolerant message to the distributed system. Accordingly, a checkpoint set up algorithm based in this EFSM is proposed. During the establishing of the checkpoint, the consistency of checkpoint can be determined by calculating the number of sending and receiving messages. In case of lost message, sending and receiving process that lost message can be found by checking the number of sending and receiving messages, and the lost messages can be retransmitted and received. Thus the establishing of the distributed systems global state has been simplified.