{"title":"动态检查点机制下的Apache Hadoop基准性能评估","authors":"Paulo Vinicius Cardoso, P. Barcelos","doi":"10.1109/SCCC.2018.8705152","DOIUrl":null,"url":null,"abstract":"The increasingly robust High Performance Computing (HPC) systems have promoted many researches on fault tolerance mechanisms. However, the efficiency of these techniques is an important challenge, as the data and resources demand is also getting higher. in this work, we presents how the framework Apache Hadoop implements the Checkpoint and Recovery technique for fault tolerance providing on its distributed file system (Hadoop Distributed File System). The demand of large amounts of data from all applications supported by Hadoop framework can compromise the mechanism efficiency, once its configurations attributes are defined statically. So we present a dynamic configuration mechanism to checkpoint for Apache Hadoop through a resource monitoring, whose goal is to make the Hadoop checkpoint adaptable. The proposed mechanism is then evaluated and submitted to performance tests.","PeriodicalId":235495,"journal":{"name":"2018 37th International Conference of the Chilean Computer Science Society (SCCC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance Evaluation of Apache Hadoop Benchmarks under a Dynamic Checkpointing Mechanism\",\"authors\":\"Paulo Vinicius Cardoso, P. Barcelos\",\"doi\":\"10.1109/SCCC.2018.8705152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasingly robust High Performance Computing (HPC) systems have promoted many researches on fault tolerance mechanisms. However, the efficiency of these techniques is an important challenge, as the data and resources demand is also getting higher. in this work, we presents how the framework Apache Hadoop implements the Checkpoint and Recovery technique for fault tolerance providing on its distributed file system (Hadoop Distributed File System). The demand of large amounts of data from all applications supported by Hadoop framework can compromise the mechanism efficiency, once its configurations attributes are defined statically. So we present a dynamic configuration mechanism to checkpoint for Apache Hadoop through a resource monitoring, whose goal is to make the Hadoop checkpoint adaptable. The proposed mechanism is then evaluated and submitted to performance tests.\",\"PeriodicalId\":235495,\"journal\":{\"name\":\"2018 37th International Conference of the Chilean Computer Science Society (SCCC)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 37th International Conference of the Chilean Computer Science Society (SCCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCCC.2018.8705152\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 37th International Conference of the Chilean Computer Science Society (SCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCCC.2018.8705152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance Evaluation of Apache Hadoop Benchmarks under a Dynamic Checkpointing Mechanism
The increasingly robust High Performance Computing (HPC) systems have promoted many researches on fault tolerance mechanisms. However, the efficiency of these techniques is an important challenge, as the data and resources demand is also getting higher. in this work, we presents how the framework Apache Hadoop implements the Checkpoint and Recovery technique for fault tolerance providing on its distributed file system (Hadoop Distributed File System). The demand of large amounts of data from all applications supported by Hadoop framework can compromise the mechanism efficiency, once its configurations attributes are defined statically. So we present a dynamic configuration mechanism to checkpoint for Apache Hadoop through a resource monitoring, whose goal is to make the Hadoop checkpoint adaptable. The proposed mechanism is then evaluated and submitted to performance tests.