{"title":"Online Retail Pattern Quality Improvement: From Frequent Sequential Pattern to High-Utility Sequential Pattern","authors":"Ridowati Gunawan","doi":"10.1109/ISRITI54043.2021.9702782","DOIUrl":null,"url":null,"abstract":"There has been a change in people's shopping behavior, especially during the Covid-19 pandemic, from what traditionally requires direct face-to-face meetings between sellers and buyers, to virtual face-to-face through various shopping media. All activities carried out by customers, click streams performed, items purchased, the number of items including the price will be recorded in a log. Activity records in the log are very useful to be able to find out the pattern of activity sequences from customers, especially the order of items purchased by customers. However, the management certainly needs more knowledge, not just the order of goods that are often purchased by customers. Does the order of items purchased also provide maximum profit? There have been many methods to get frequent sequential patterns from customer activities, but getting a pattern that chooses more quality by adding utility value needs to be considered. In this research, the method used to obtain frequent sequential patterns is using PrefixSpan (Prefix-projected Sequential PAtterN) and the method used to obtain a high-utility sequential pattern is the USpan (Utility Sequential PAtterN) method. USpan is applied to the BMS (Blue Martini DataSet) dataset, which is the dataset used in KDD (Knowledge Discovery in Databases) CUP 2000 which consists of clickstream data from an e-commerce. The experimental results show that the frequent sequential pattern will always appear in the high-utility sequential pattern but not vice versa. It is certain that a high-utility pattern must be sequential, but a sequential pattern is not necessarily a high-utility sequential. From the results of the high-utility sequential pattern, it can be used as input to provide recommendations to customers to carry out the shopping process on items that can provide greater profits. The conclusion of the research conducted is that the high-utility sequential pattern mining can produce a higher quality pattern than just getting a frequent sequential pattern.","PeriodicalId":156265,"journal":{"name":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI54043.2021.9702782","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
There has been a change in people's shopping behavior, especially during the Covid-19 pandemic, from what traditionally requires direct face-to-face meetings between sellers and buyers, to virtual face-to-face through various shopping media. All activities carried out by customers, click streams performed, items purchased, the number of items including the price will be recorded in a log. Activity records in the log are very useful to be able to find out the pattern of activity sequences from customers, especially the order of items purchased by customers. However, the management certainly needs more knowledge, not just the order of goods that are often purchased by customers. Does the order of items purchased also provide maximum profit? There have been many methods to get frequent sequential patterns from customer activities, but getting a pattern that chooses more quality by adding utility value needs to be considered. In this research, the method used to obtain frequent sequential patterns is using PrefixSpan (Prefix-projected Sequential PAtterN) and the method used to obtain a high-utility sequential pattern is the USpan (Utility Sequential PAtterN) method. USpan is applied to the BMS (Blue Martini DataSet) dataset, which is the dataset used in KDD (Knowledge Discovery in Databases) CUP 2000 which consists of clickstream data from an e-commerce. The experimental results show that the frequent sequential pattern will always appear in the high-utility sequential pattern but not vice versa. It is certain that a high-utility pattern must be sequential, but a sequential pattern is not necessarily a high-utility sequential. From the results of the high-utility sequential pattern, it can be used as input to provide recommendations to customers to carry out the shopping process on items that can provide greater profits. The conclusion of the research conducted is that the high-utility sequential pattern mining can produce a higher quality pattern than just getting a frequent sequential pattern.
特别是在新冠肺炎疫情期间,人们的购物行为发生了变化,从传统上需要卖家和买家直接面对面的交流,转变为通过各种购物媒体进行虚拟面对面的交流。客户进行的所有活动,执行的点击流,购买的物品,包括价格在内的物品数量将被记录在日志中。日志中的活动记录非常有用,可以从客户那里找到活动序列的模式,特别是客户购买物品的顺序。然而,管理当然需要更多的知识,而不仅仅是客户经常购买的商品的顺序。购买物品的顺序是否也能提供最大的利润?已经有许多方法可以从客户活动中获得频繁的顺序模式,但是需要考虑通过增加实用价值来获得选择更高质量的模式。在本研究中,获得频繁序列模式的方法是使用PrefixSpan(前缀投影序列模式),获得高效用序列模式的方法是使用USpan(效用序列模式)方法。USpan应用于BMS (Blue Martini DataSet)数据集,该数据集是KDD (Knowledge Discovery in Databases) CUP 2000中使用的数据集,该数据集由来自电子商务的点击流数据组成。实验结果表明,频繁序列模式总是出现在高效用序列模式中,而非相反。可以肯定的是,高效用模式必须是顺序的,但顺序模式不一定是高效用序列。从高效用序列模式的结果来看,可以作为输入,向顾客提供建议,对能够提供更大利润的商品进行购物过程。研究结果表明,高效用序列模式挖掘比仅仅获得频繁序列模式能产生更高质量的模式。