Lemi Isaac Yoseke Laku, A. F. Y. Mohammed, F. Hazemi, Chan-Hyun Youn
{"title":"Apache Storm的性能评估与编写脚本","authors":"Lemi Isaac Yoseke Laku, A. F. Y. Mohammed, F. Hazemi, Chan-Hyun Youn","doi":"10.23919/ICACT.2019.8701904","DOIUrl":null,"url":null,"abstract":"With the exponential growth of stream data emanating from a variety of sources, today, big data presents a new era in data exploration and usage. The understanding of the performance of real-time stream data processing technologies has become a key pre-requisite while considering any deployment. Although many technologies for stream data analytics exist, little is known about the impact of writing scripts on their performance. Using a MapReduce programming model on a pseudo cluster, we conduct a word count, experimental evaluation of Apache Storm using five writing scripts; English, Arabic, Hindi, Chinese and Japanese. We define our word count as the number of time a word is written in a body of text. Static and structured data made up of 300 English sentences is translated into the scripts under study and loaded into Apache Storm. The results show that Apache Storm analyzes, English, Arabic and Hindi script sentences with ease as compared to those written in Chinese and Japanese script. Apache Storm also executes and distinguishes individual words and performs a word count on English, Arabic and Hindi sentences faster than those written in Chinese and Japanese. However, in terms of processing, the reverse is true, sentences written in Chinese and Japanese are processed faster than those written in English, Arabic and Hindi scripts.","PeriodicalId":226261,"journal":{"name":"2019 21st International Conference on Advanced Communication Technology (ICACT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance Evaluation of Apache Storm With Writing Scripts\",\"authors\":\"Lemi Isaac Yoseke Laku, A. F. Y. Mohammed, F. Hazemi, Chan-Hyun Youn\",\"doi\":\"10.23919/ICACT.2019.8701904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the exponential growth of stream data emanating from a variety of sources, today, big data presents a new era in data exploration and usage. The understanding of the performance of real-time stream data processing technologies has become a key pre-requisite while considering any deployment. Although many technologies for stream data analytics exist, little is known about the impact of writing scripts on their performance. Using a MapReduce programming model on a pseudo cluster, we conduct a word count, experimental evaluation of Apache Storm using five writing scripts; English, Arabic, Hindi, Chinese and Japanese. We define our word count as the number of time a word is written in a body of text. Static and structured data made up of 300 English sentences is translated into the scripts under study and loaded into Apache Storm. The results show that Apache Storm analyzes, English, Arabic and Hindi script sentences with ease as compared to those written in Chinese and Japanese script. Apache Storm also executes and distinguishes individual words and performs a word count on English, Arabic and Hindi sentences faster than those written in Chinese and Japanese. However, in terms of processing, the reverse is true, sentences written in Chinese and Japanese are processed faster than those written in English, Arabic and Hindi scripts.\",\"PeriodicalId\":226261,\"journal\":{\"name\":\"2019 21st International Conference on Advanced Communication Technology (ICACT)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 21st International Conference on Advanced Communication Technology (ICACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICACT.2019.8701904\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 21st International Conference on Advanced Communication Technology (ICACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICACT.2019.8701904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance Evaluation of Apache Storm With Writing Scripts
With the exponential growth of stream data emanating from a variety of sources, today, big data presents a new era in data exploration and usage. The understanding of the performance of real-time stream data processing technologies has become a key pre-requisite while considering any deployment. Although many technologies for stream data analytics exist, little is known about the impact of writing scripts on their performance. Using a MapReduce programming model on a pseudo cluster, we conduct a word count, experimental evaluation of Apache Storm using five writing scripts; English, Arabic, Hindi, Chinese and Japanese. We define our word count as the number of time a word is written in a body of text. Static and structured data made up of 300 English sentences is translated into the scripts under study and loaded into Apache Storm. The results show that Apache Storm analyzes, English, Arabic and Hindi script sentences with ease as compared to those written in Chinese and Japanese script. Apache Storm also executes and distinguishes individual words and performs a word count on English, Arabic and Hindi sentences faster than those written in Chinese and Japanese. However, in terms of processing, the reverse is true, sentences written in Chinese and Japanese are processed faster than those written in English, Arabic and Hindi scripts.