{"title":"Apache Hadoop和Apache Spark的性能比较","authors":"Amritpal Singh, A. Khamparia, A. K. Luhach","doi":"10.1145/3339311.3339329","DOIUrl":null,"url":null,"abstract":"The term 'Big Data' is a broad term used for the data sets, which is enormous and traditional data processing applications find it hard to process. Both Apache Spark and Apache Hadoop are one of the significant parts of the big data family. Some of the researchers view both frameworks as the rivals but it is not that easy to compare these two as they perform numerous things same, but there are also some areas where both work differently. Still both Apache Hadoop and Apache Spark are comparable on different parameters. This research intends to compare these two popular frameworks and figure out their strengths, weaknesses, unique characteristics and try to answer whether Spark can replace hadoop or not.","PeriodicalId":206653,"journal":{"name":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Performance comparison of Apache Hadoop and Apache Spark\",\"authors\":\"Amritpal Singh, A. Khamparia, A. K. Luhach\",\"doi\":\"10.1145/3339311.3339329\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The term 'Big Data' is a broad term used for the data sets, which is enormous and traditional data processing applications find it hard to process. Both Apache Spark and Apache Hadoop are one of the significant parts of the big data family. Some of the researchers view both frameworks as the rivals but it is not that easy to compare these two as they perform numerous things same, but there are also some areas where both work differently. Still both Apache Hadoop and Apache Spark are comparable on different parameters. This research intends to compare these two popular frameworks and figure out their strengths, weaknesses, unique characteristics and try to answer whether Spark can replace hadoop or not.\",\"PeriodicalId\":206653,\"journal\":{\"name\":\"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3339311.3339329\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3339311.3339329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance comparison of Apache Hadoop and Apache Spark
The term 'Big Data' is a broad term used for the data sets, which is enormous and traditional data processing applications find it hard to process. Both Apache Spark and Apache Hadoop are one of the significant parts of the big data family. Some of the researchers view both frameworks as the rivals but it is not that easy to compare these two as they perform numerous things same, but there are also some areas where both work differently. Still both Apache Hadoop and Apache Spark are comparable on different parameters. This research intends to compare these two popular frameworks and figure out their strengths, weaknesses, unique characteristics and try to answer whether Spark can replace hadoop or not.