{"title":"Kafka管道架构中实时消息系统的坐标检查点机制","authors":"Thandar Aung, Hla Yin Min, A. Maw","doi":"10.1109/AITC.2019.8921392","DOIUrl":null,"url":null,"abstract":"The real-time messaging system is the critical thing in computing based on time-critical decision making in many organizations. In the real-time messaging system, fault tolerance is the key challenge of developing reliability requirements. Apache Kafka is a popular framework for consuming data stream into the processing platforms. However, there are many challenges in the replication process because of a server failure in Apache Kafka. To develop fault tolerance in Apache Kafka, this paper focuses on defining a fixed checkpoint interval to reduce the recovery time and lost messages in server failure. Then, we measure the cost of checkpointing, cost of rollback and total time cost of overheads due to the fixed checkpointing intervals. The system shows the drawback of real-time processing depend on the number of lost messages on various partitions and server failure processes. The experimental results emphasize the checkpoint interval method to reduce recovery time and lost messages. Conforming to the experimental results, the performance of the total time cost with checkpoint interval saves time approximately 30% than without checkpoint interval.","PeriodicalId":388642,"journal":{"name":"2019 International Conference on Advanced Information Technologies (ICAIT)","volume":"37 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Coordinate Checkpoint Mechanism on Real-Time Messaging System in Kafka Pipeline Architecture\",\"authors\":\"Thandar Aung, Hla Yin Min, A. Maw\",\"doi\":\"10.1109/AITC.2019.8921392\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The real-time messaging system is the critical thing in computing based on time-critical decision making in many organizations. In the real-time messaging system, fault tolerance is the key challenge of developing reliability requirements. Apache Kafka is a popular framework for consuming data stream into the processing platforms. However, there are many challenges in the replication process because of a server failure in Apache Kafka. To develop fault tolerance in Apache Kafka, this paper focuses on defining a fixed checkpoint interval to reduce the recovery time and lost messages in server failure. Then, we measure the cost of checkpointing, cost of rollback and total time cost of overheads due to the fixed checkpointing intervals. The system shows the drawback of real-time processing depend on the number of lost messages on various partitions and server failure processes. The experimental results emphasize the checkpoint interval method to reduce recovery time and lost messages. Conforming to the experimental results, the performance of the total time cost with checkpoint interval saves time approximately 30% than without checkpoint interval.\",\"PeriodicalId\":388642,\"journal\":{\"name\":\"2019 International Conference on Advanced Information Technologies (ICAIT)\",\"volume\":\"37 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advanced Information Technologies (ICAIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AITC.2019.8921392\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Information Technologies (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AITC.2019.8921392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Coordinate Checkpoint Mechanism on Real-Time Messaging System in Kafka Pipeline Architecture
The real-time messaging system is the critical thing in computing based on time-critical decision making in many organizations. In the real-time messaging system, fault tolerance is the key challenge of developing reliability requirements. Apache Kafka is a popular framework for consuming data stream into the processing platforms. However, there are many challenges in the replication process because of a server failure in Apache Kafka. To develop fault tolerance in Apache Kafka, this paper focuses on defining a fixed checkpoint interval to reduce the recovery time and lost messages in server failure. Then, we measure the cost of checkpointing, cost of rollback and total time cost of overheads due to the fixed checkpointing intervals. The system shows the drawback of real-time processing depend on the number of lost messages on various partitions and server failure processes. The experimental results emphasize the checkpoint interval method to reduce recovery time and lost messages. Conforming to the experimental results, the performance of the total time cost with checkpoint interval saves time approximately 30% than without checkpoint interval.