{"title":"Performance comparison of Apache Hadoop and Apache Spark","authors":"Amritpal Singh, A. Khamparia, A. K. Luhach","doi":"10.1145/3339311.3339329","DOIUrl":null,"url":null,"abstract":"The term 'Big Data' is a broad term used for the data sets, which is enormous and traditional data processing applications find it hard to process. Both Apache Spark and Apache Hadoop are one of the significant parts of the big data family. Some of the researchers view both frameworks as the rivals but it is not that easy to compare these two as they perform numerous things same, but there are also some areas where both work differently. Still both Apache Hadoop and Apache Spark are comparable on different parameters. This research intends to compare these two popular frameworks and figure out their strengths, weaknesses, unique characteristics and try to answer whether Spark can replace hadoop or not.","PeriodicalId":206653,"journal":{"name":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3339311.3339329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
The term 'Big Data' is a broad term used for the data sets, which is enormous and traditional data processing applications find it hard to process. Both Apache Spark and Apache Hadoop are one of the significant parts of the big data family. Some of the researchers view both frameworks as the rivals but it is not that easy to compare these two as they perform numerous things same, but there are also some areas where both work differently. Still both Apache Hadoop and Apache Spark are comparable on different parameters. This research intends to compare these two popular frameworks and figure out their strengths, weaknesses, unique characteristics and try to answer whether Spark can replace hadoop or not.