Research on performance optimization and visualization tool of Hadoop

Yan Xu, Wei Zhou, Binyue Cui, Lingyun Lu
{"title":"Research on performance optimization and visualization tool of Hadoop","authors":"Yan Xu, Wei Zhou, Binyue Cui, Lingyun Lu","doi":"10.1109/ICCSE.2015.7250233","DOIUrl":null,"url":null,"abstract":"Hadoop, a distributed system infrastructure, is developed by Apache Software Foundation and has become a mainstream platform of cloud-computing. How to improve one of its core frame work-MapReduce performance has become a hot topic. However, how to get a better computational performance is still a big challenge for programmers. It appears to be many visualization tools for performance analysis and optimization because of the research of display and analyze program performance by the aid of visualization technology is receiving more and more attention. This paper analyzed sorting performance in Map Phase of Hadoop System and proposed a method to optimize the sorting performance dynamically. It collected and analyzed ten visualization tools that are the most popular in the global world, and found that R language is the tool suited to Hadoop through comparison and analysis, and introduced the combination of R language and Hadoop. In the future, we will apply RHadoop to MapReduce performance optimization.","PeriodicalId":311451,"journal":{"name":"2015 10th International Conference on Computer Science & Education (ICCSE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 10th International Conference on Computer Science & Education (ICCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSE.2015.7250233","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Hadoop, a distributed system infrastructure, is developed by Apache Software Foundation and has become a mainstream platform of cloud-computing. How to improve one of its core frame work-MapReduce performance has become a hot topic. However, how to get a better computational performance is still a big challenge for programmers. It appears to be many visualization tools for performance analysis and optimization because of the research of display and analyze program performance by the aid of visualization technology is receiving more and more attention. This paper analyzed sorting performance in Map Phase of Hadoop System and proposed a method to optimize the sorting performance dynamically. It collected and analyzed ten visualization tools that are the most popular in the global world, and found that R language is the tool suited to Hadoop through comparison and analysis, and introduced the combination of R language and Hadoop. In the future, we will apply RHadoop to MapReduce performance optimization.
Hadoop性能优化与可视化工具的研究
Hadoop是由Apache软件基金会开发的分布式系统基础设施,目前已成为云计算的主流平台。如何提高其核心框架之一mapreduce的性能已经成为一个热门话题。然而,如何获得更好的计算性能仍然是程序员面临的一大挑战。随着利用可视化技术显示和分析程序性能的研究越来越受到重视,出现了许多用于性能分析和优化的可视化工具。本文分析了Hadoop系统Map阶段的排序性能,提出了一种动态优化排序性能的方法。收集并分析了全球最流行的十种可视化工具,通过对比分析发现R语言是适合Hadoop的工具,并介绍了R语言与Hadoop的结合。在未来,我们会将rha应用到MapReduce的性能优化中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信