基于Hadoop的ClickStream数据洞察发现机制的新方法

P. Anand, G. Vamsi, P. R. Kumar
{"title":"基于Hadoop的ClickStream数据洞察发现机制的新方法","authors":"P. Anand, G. Vamsi, P. R. Kumar","doi":"10.1109/ICICCT.2018.8473232","DOIUrl":null,"url":null,"abstract":"In today's world, there is huge importance for analyzing large data sets in a short span of time. Hadoop is one of such framework that is used to store and process huge unstructured or semi structured data in a distributed manner. The main theme of this paper is to analyze clickstream data that has been gathered from online retail e-commerce website using Hadoop framework. In this process, we are going to use many tools like Pig, Hive, Sqoop which works based on map-reduce algorithm in order to process big data in efficient way. The Insight finding mechanism used to tell us day wise sales report, hourly sales report and top sold item reports based on the clickstream dataset. In the end, the output visualization plots will give the detailed insights based on the clickstream data that we have processed.","PeriodicalId":334934,"journal":{"name":"2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Novel Approach for Insight Finding Mechanism on ClickStream Data Using Hadoop\",\"authors\":\"P. Anand, G. Vamsi, P. R. Kumar\",\"doi\":\"10.1109/ICICCT.2018.8473232\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In today's world, there is huge importance for analyzing large data sets in a short span of time. Hadoop is one of such framework that is used to store and process huge unstructured or semi structured data in a distributed manner. The main theme of this paper is to analyze clickstream data that has been gathered from online retail e-commerce website using Hadoop framework. In this process, we are going to use many tools like Pig, Hive, Sqoop which works based on map-reduce algorithm in order to process big data in efficient way. The Insight finding mechanism used to tell us day wise sales report, hourly sales report and top sold item reports based on the clickstream dataset. In the end, the output visualization plots will give the detailed insights based on the clickstream data that we have processed.\",\"PeriodicalId\":334934,\"journal\":{\"name\":\"2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICCT.2018.8473232\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICCT.2018.8473232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

在当今世界,在短时间内分析大型数据集是非常重要的。Hadoop就是这样一个框架,用于以分布式方式存储和处理大量非结构化或半结构化数据。本文的主题是利用Hadoop框架对在线零售电子商务网站收集的点击流数据进行分析。在这个过程中,我们将使用Pig, Hive, Sqoop等基于map-reduce算法的工具来高效地处理大数据。Insight查找机制用于根据点击流数据集告诉我们每日销售报告、每小时销售报告和最畅销商品报告。最后,输出的可视化图将根据我们处理的点击流数据给出详细的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Novel Approach for Insight Finding Mechanism on ClickStream Data Using Hadoop
In today's world, there is huge importance for analyzing large data sets in a short span of time. Hadoop is one of such framework that is used to store and process huge unstructured or semi structured data in a distributed manner. The main theme of this paper is to analyze clickstream data that has been gathered from online retail e-commerce website using Hadoop framework. In this process, we are going to use many tools like Pig, Hive, Sqoop which works based on map-reduce algorithm in order to process big data in efficient way. The Insight finding mechanism used to tell us day wise sales report, hourly sales report and top sold item reports based on the clickstream dataset. In the end, the output visualization plots will give the detailed insights based on the clickstream data that we have processed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信