{"title":"利用大数据技术预测新冠肺炎影响的零售分析","authors":"Jessica Sharma, Deepikesh Sharma, Krishneel Sharma","doi":"10.1109/CSDE53843.2021.9718390","DOIUrl":null,"url":null,"abstract":"Retail analytics helps a company gain a deeper understanding of customer demand, making shopping more relevant, personalized, and convenient and boosting sales using optimal pricing. This paper aims to demonstrate retail analytics through a prototype that uses big data technologies. Using the big data technologies, the raw data is stored, analyzed and visualized to get valuable decision-making insights. The project objective is to help companies get retail analytics from which they can make decisions to anticipate the Covid-19 effects. The design for the system includes Hadoop Distributed File System (HDFS), Apache Pig, Apache Hive, SparkSQL, Spark MLLib, and Apache Zeppelin. The prototype uses a dataset that contains information for the transactions in the United Kingdom. Therefore it does not relate to covid-19 retail data but helps answer relevant questions. The dataset is used to investigate revenue aggregate by the country for the top 5 countries, daily sales activity, hourly sales activity, basket size distribution, top 20 Items sold by frequency, and market basket analysis. This paper can be used to produce a production possibility curve, reduce shortage, avoid surplus, illustrate demand and supply curves, and detect current economic conditions. All these would help the decision-makers to develop strategies to help them anticipate the impacts of Covid-19.","PeriodicalId":166950,"journal":{"name":"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"156 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Retail Analytics to anticipate Covid-19 effects Using Big Data Technologies\",\"authors\":\"Jessica Sharma, Deepikesh Sharma, Krishneel Sharma\",\"doi\":\"10.1109/CSDE53843.2021.9718390\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retail analytics helps a company gain a deeper understanding of customer demand, making shopping more relevant, personalized, and convenient and boosting sales using optimal pricing. This paper aims to demonstrate retail analytics through a prototype that uses big data technologies. Using the big data technologies, the raw data is stored, analyzed and visualized to get valuable decision-making insights. The project objective is to help companies get retail analytics from which they can make decisions to anticipate the Covid-19 effects. The design for the system includes Hadoop Distributed File System (HDFS), Apache Pig, Apache Hive, SparkSQL, Spark MLLib, and Apache Zeppelin. The prototype uses a dataset that contains information for the transactions in the United Kingdom. Therefore it does not relate to covid-19 retail data but helps answer relevant questions. The dataset is used to investigate revenue aggregate by the country for the top 5 countries, daily sales activity, hourly sales activity, basket size distribution, top 20 Items sold by frequency, and market basket analysis. This paper can be used to produce a production possibility curve, reduce shortage, avoid surplus, illustrate demand and supply curves, and detect current economic conditions. All these would help the decision-makers to develop strategies to help them anticipate the impacts of Covid-19.\",\"PeriodicalId\":166950,\"journal\":{\"name\":\"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)\",\"volume\":\"156 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSDE53843.2021.9718390\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE53843.2021.9718390","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Retail Analytics to anticipate Covid-19 effects Using Big Data Technologies
Retail analytics helps a company gain a deeper understanding of customer demand, making shopping more relevant, personalized, and convenient and boosting sales using optimal pricing. This paper aims to demonstrate retail analytics through a prototype that uses big data technologies. Using the big data technologies, the raw data is stored, analyzed and visualized to get valuable decision-making insights. The project objective is to help companies get retail analytics from which they can make decisions to anticipate the Covid-19 effects. The design for the system includes Hadoop Distributed File System (HDFS), Apache Pig, Apache Hive, SparkSQL, Spark MLLib, and Apache Zeppelin. The prototype uses a dataset that contains information for the transactions in the United Kingdom. Therefore it does not relate to covid-19 retail data but helps answer relevant questions. The dataset is used to investigate revenue aggregate by the country for the top 5 countries, daily sales activity, hourly sales activity, basket size distribution, top 20 Items sold by frequency, and market basket analysis. This paper can be used to produce a production possibility curve, reduce shortage, avoid surplus, illustrate demand and supply curves, and detect current economic conditions. All these would help the decision-makers to develop strategies to help them anticipate the impacts of Covid-19.