{"title":"Retail Analytics to anticipate Covid-19 effects Using Big Data Technologies","authors":"Jessica Sharma, Deepikesh Sharma, Krishneel Sharma","doi":"10.1109/CSDE53843.2021.9718390","DOIUrl":null,"url":null,"abstract":"Retail analytics helps a company gain a deeper understanding of customer demand, making shopping more relevant, personalized, and convenient and boosting sales using optimal pricing. This paper aims to demonstrate retail analytics through a prototype that uses big data technologies. Using the big data technologies, the raw data is stored, analyzed and visualized to get valuable decision-making insights. The project objective is to help companies get retail analytics from which they can make decisions to anticipate the Covid-19 effects. The design for the system includes Hadoop Distributed File System (HDFS), Apache Pig, Apache Hive, SparkSQL, Spark MLLib, and Apache Zeppelin. The prototype uses a dataset that contains information for the transactions in the United Kingdom. Therefore it does not relate to covid-19 retail data but helps answer relevant questions. The dataset is used to investigate revenue aggregate by the country for the top 5 countries, daily sales activity, hourly sales activity, basket size distribution, top 20 Items sold by frequency, and market basket analysis. This paper can be used to produce a production possibility curve, reduce shortage, avoid surplus, illustrate demand and supply curves, and detect current economic conditions. All these would help the decision-makers to develop strategies to help them anticipate the impacts of Covid-19.","PeriodicalId":166950,"journal":{"name":"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"156 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE53843.2021.9718390","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Retail analytics helps a company gain a deeper understanding of customer demand, making shopping more relevant, personalized, and convenient and boosting sales using optimal pricing. This paper aims to demonstrate retail analytics through a prototype that uses big data technologies. Using the big data technologies, the raw data is stored, analyzed and visualized to get valuable decision-making insights. The project objective is to help companies get retail analytics from which they can make decisions to anticipate the Covid-19 effects. The design for the system includes Hadoop Distributed File System (HDFS), Apache Pig, Apache Hive, SparkSQL, Spark MLLib, and Apache Zeppelin. The prototype uses a dataset that contains information for the transactions in the United Kingdom. Therefore it does not relate to covid-19 retail data but helps answer relevant questions. The dataset is used to investigate revenue aggregate by the country for the top 5 countries, daily sales activity, hourly sales activity, basket size distribution, top 20 Items sold by frequency, and market basket analysis. This paper can be used to produce a production possibility curve, reduce shortage, avoid surplus, illustrate demand and supply curves, and detect current economic conditions. All these would help the decision-makers to develop strategies to help them anticipate the impacts of Covid-19.