Apache Storm的性能评估与编写脚本

Lemi Isaac Yoseke Laku, A. F. Y. Mohammed, F. Hazemi, Chan-Hyun Youn
{"title":"Apache Storm的性能评估与编写脚本","authors":"Lemi Isaac Yoseke Laku, A. F. Y. Mohammed, F. Hazemi, Chan-Hyun Youn","doi":"10.23919/ICACT.2019.8701904","DOIUrl":null,"url":null,"abstract":"With the exponential growth of stream data emanating from a variety of sources, today, big data presents a new era in data exploration and usage. The understanding of the performance of real-time stream data processing technologies has become a key pre-requisite while considering any deployment. Although many technologies for stream data analytics exist, little is known about the impact of writing scripts on their performance. Using a MapReduce programming model on a pseudo cluster, we conduct a word count, experimental evaluation of Apache Storm using five writing scripts; English, Arabic, Hindi, Chinese and Japanese. We define our word count as the number of time a word is written in a body of text. Static and structured data made up of 300 English sentences is translated into the scripts under study and loaded into Apache Storm. The results show that Apache Storm analyzes, English, Arabic and Hindi script sentences with ease as compared to those written in Chinese and Japanese script. Apache Storm also executes and distinguishes individual words and performs a word count on English, Arabic and Hindi sentences faster than those written in Chinese and Japanese. However, in terms of processing, the reverse is true, sentences written in Chinese and Japanese are processed faster than those written in English, Arabic and Hindi scripts.","PeriodicalId":226261,"journal":{"name":"2019 21st International Conference on Advanced Communication Technology (ICACT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance Evaluation of Apache Storm With Writing Scripts\",\"authors\":\"Lemi Isaac Yoseke Laku, A. F. Y. Mohammed, F. Hazemi, Chan-Hyun Youn\",\"doi\":\"10.23919/ICACT.2019.8701904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the exponential growth of stream data emanating from a variety of sources, today, big data presents a new era in data exploration and usage. The understanding of the performance of real-time stream data processing technologies has become a key pre-requisite while considering any deployment. Although many technologies for stream data analytics exist, little is known about the impact of writing scripts on their performance. Using a MapReduce programming model on a pseudo cluster, we conduct a word count, experimental evaluation of Apache Storm using five writing scripts; English, Arabic, Hindi, Chinese and Japanese. We define our word count as the number of time a word is written in a body of text. Static and structured data made up of 300 English sentences is translated into the scripts under study and loaded into Apache Storm. The results show that Apache Storm analyzes, English, Arabic and Hindi script sentences with ease as compared to those written in Chinese and Japanese script. Apache Storm also executes and distinguishes individual words and performs a word count on English, Arabic and Hindi sentences faster than those written in Chinese and Japanese. However, in terms of processing, the reverse is true, sentences written in Chinese and Japanese are processed faster than those written in English, Arabic and Hindi scripts.\",\"PeriodicalId\":226261,\"journal\":{\"name\":\"2019 21st International Conference on Advanced Communication Technology (ICACT)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 21st International Conference on Advanced Communication Technology (ICACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICACT.2019.8701904\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 21st International Conference on Advanced Communication Technology (ICACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICACT.2019.8701904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着各种来源的流数据呈指数级增长,大数据呈现出数据探索和使用的新时代。在考虑任何部署时,对实时流数据处理技术性能的理解已经成为一个关键的先决条件。尽管存在许多流数据分析技术,但编写脚本对其性能的影响知之甚少。在伪集群上使用MapReduce编程模型,对Apache Storm使用的五种编写脚本进行了字数统计、实验评估;英语,阿拉伯语,印地语,中文和日语。我们将单词计数定义为一个单词在正文中出现的时间。由300个英语句子组成的静态和结构化数据被翻译成正在研究的脚本并加载到Apache Storm中。结果表明,与中文和日文句子相比,Apache Storm更容易分析英语、阿拉伯语和印地语句子。Apache Storm还可以执行和区分单个单词,并对英语、阿拉伯语和印地语句子进行单词计数,速度比用中文和日语写的句子要快。然而,在处理方面,情况正好相反,用汉语和日语写的句子比用英语、阿拉伯语和印地语写的句子处理得快。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Performance Evaluation of Apache Storm With Writing Scripts
With the exponential growth of stream data emanating from a variety of sources, today, big data presents a new era in data exploration and usage. The understanding of the performance of real-time stream data processing technologies has become a key pre-requisite while considering any deployment. Although many technologies for stream data analytics exist, little is known about the impact of writing scripts on their performance. Using a MapReduce programming model on a pseudo cluster, we conduct a word count, experimental evaluation of Apache Storm using five writing scripts; English, Arabic, Hindi, Chinese and Japanese. We define our word count as the number of time a word is written in a body of text. Static and structured data made up of 300 English sentences is translated into the scripts under study and loaded into Apache Storm. The results show that Apache Storm analyzes, English, Arabic and Hindi script sentences with ease as compared to those written in Chinese and Japanese script. Apache Storm also executes and distinguishes individual words and performs a word count on English, Arabic and Hindi sentences faster than those written in Chinese and Japanese. However, in terms of processing, the reverse is true, sentences written in Chinese and Japanese are processed faster than those written in English, Arabic and Hindi scripts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信