{"title":"Beyond Hadoop for e-commerce Big Data Analysis through Amazon","authors":"Ankush Verma, N. Sethi, Neelesh Jai","doi":"10.1109/ICACAT.2018.8933660","DOIUrl":null,"url":null,"abstract":"Analysis of big data is a challenging task as it involves large distributed file systems. The infrastructure require for analyzing big data is different from Amazon analysis technology and data mining on various types of data. Mapreduce is widely popular for analysis of big data. Mapreduce is working with mapping, sorting, shuffling and reducing using Master/Slave architecture. Similarly Amazon MapReduce programming model over large data set is introduced by Amazon, on the web especially used for ecommerce. In this paper Amazon EC2 cloud computing model used for central part of designed web and for collection and storing of large data Amazon uses S3. Amazon clusters is a group of servers which is working together to perform any type of tasks on distributed database on different servers in parallel. Amazon services are used in analysis of big data and to increase business efficiency","PeriodicalId":6575,"journal":{"name":"2018 International Conference on Advanced Computation and Telecommunication (ICACAT)","volume":"84 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Advanced Computation and Telecommunication (ICACAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACAT.2018.8933660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Analysis of big data is a challenging task as it involves large distributed file systems. The infrastructure require for analyzing big data is different from Amazon analysis technology and data mining on various types of data. Mapreduce is widely popular for analysis of big data. Mapreduce is working with mapping, sorting, shuffling and reducing using Master/Slave architecture. Similarly Amazon MapReduce programming model over large data set is introduced by Amazon, on the web especially used for ecommerce. In this paper Amazon EC2 cloud computing model used for central part of designed web and for collection and storing of large data Amazon uses S3. Amazon clusters is a group of servers which is working together to perform any type of tasks on distributed database on different servers in parallel. Amazon services are used in analysis of big data and to increase business efficiency