分布式系统中的扩展有限状态机与容错机制

Shengfa Gao, Xin Li, Ruihua Zhang
{"title":"分布式系统中的扩展有限状态机与容错机制","authors":"Shengfa Gao, Xin Li, Ruihua Zhang","doi":"10.1109/SERA.2009.33","DOIUrl":null,"url":null,"abstract":"Synchronization and fault tolerance of processes are emphasis in the distributed systems research, but only a few people involves in the mathematics model used in processes synchronization and fault tolerance yet. This paper takes distributed system as an event driven system, classify the events that cause system state variety into four classes, and proposed an Extended Finite State Machine (EFSM) with synchronization and fault tolerant message to the distributed system. Accordingly, a checkpoint set up algorithm based in this EFSM is proposed. During the establishing of the checkpoint, the consistency of checkpoint can be determined by calculating the number of sending and receiving messages. In case of lost message, sending and receiving process that lost message can be found by checking the number of sending and receiving messages, and the lost messages can be retransmitted and received. Thus the establishing of the distributed systems global state has been simplified.","PeriodicalId":333607,"journal":{"name":"2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications","volume":"360 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"The Extended Finite State Machine and Fault Tolerant Mechanism in Distributed Systems\",\"authors\":\"Shengfa Gao, Xin Li, Ruihua Zhang\",\"doi\":\"10.1109/SERA.2009.33\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Synchronization and fault tolerance of processes are emphasis in the distributed systems research, but only a few people involves in the mathematics model used in processes synchronization and fault tolerance yet. This paper takes distributed system as an event driven system, classify the events that cause system state variety into four classes, and proposed an Extended Finite State Machine (EFSM) with synchronization and fault tolerant message to the distributed system. Accordingly, a checkpoint set up algorithm based in this EFSM is proposed. During the establishing of the checkpoint, the consistency of checkpoint can be determined by calculating the number of sending and receiving messages. In case of lost message, sending and receiving process that lost message can be found by checking the number of sending and receiving messages, and the lost messages can be retransmitted and received. Thus the establishing of the distributed systems global state has been simplified.\",\"PeriodicalId\":333607,\"journal\":{\"name\":\"2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications\",\"volume\":\"360 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SERA.2009.33\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERA.2009.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

进程的同步与容错是分布式系统研究的重点,但目前对进程同步与容错数学模型的研究还很少。本文将分布式系统作为事件驱动系统,将引起系统状态变化的事件分为四类,提出了一种具有同步和容错消息的扩展有限状态机(EFSM)。据此,提出了一种基于该EFSM的检查点设置算法。在检查点的建立过程中,可以通过计算发送和接收消息的数量来确定检查点的一致性。在丢失消息的情况下,发送和接收过程可以通过检查发送和接收消息的数量来查找丢失的消息,并可以重新发送和接收丢失的消息。从而简化了分布式系统全局状态的建立。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The Extended Finite State Machine and Fault Tolerant Mechanism in Distributed Systems
Synchronization and fault tolerance of processes are emphasis in the distributed systems research, but only a few people involves in the mathematics model used in processes synchronization and fault tolerance yet. This paper takes distributed system as an event driven system, classify the events that cause system state variety into four classes, and proposed an Extended Finite State Machine (EFSM) with synchronization and fault tolerant message to the distributed system. Accordingly, a checkpoint set up algorithm based in this EFSM is proposed. During the establishing of the checkpoint, the consistency of checkpoint can be determined by calculating the number of sending and receiving messages. In case of lost message, sending and receiving process that lost message can be found by checking the number of sending and receiving messages, and the lost messages can be retransmitted and received. Thus the establishing of the distributed systems global state has been simplified.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信