{"title":"基于销售点数据的零售商店销售绩效分析的顺序聚类与分类方法","authors":"Chao-Lung Yang, Nguyen Thi Phuong Quyen","doi":"10.1142/s0219622022500079","DOIUrl":null,"url":null,"abstract":"Point-of-Sale (POS) data analysis is usually used to explore sales performance in business commence. This manuscript aims to combine unsupervised clustering and supervised classification methods in an integrated data analysis framework to analyze the real-world POS data. Clustering method, which is performed on sales dataset, is used to cluster the stores into several groups. The clustering results, data labels, are then combined with other information in store features dataset as the inputs of the classification model which classifies the clustering labels by using store features dataset. Non-dominated sorting generic algorithm-II (NSGA-II) is applied in the framework to employ the multi-objective of clustering and classification. The experimental case study shows clustering results can reveal the hidden structure of sales performance of retail stores while classification can reveal the major factors that effect to the sales performance under different group of retail stores. The correlations between sales clusters and the store information can be obtained sequentially under a series of data analysis with the proposed framework.","PeriodicalId":13527,"journal":{"name":"Int. J. Inf. Technol. Decis. Mak.","volume":"127 1","pages":"885-910"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Sequential Clustering and Classification Approach to Analyze Sales Performance of Retail Stores Based on Point-of-Sale Data\",\"authors\":\"Chao-Lung Yang, Nguyen Thi Phuong Quyen\",\"doi\":\"10.1142/s0219622022500079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Point-of-Sale (POS) data analysis is usually used to explore sales performance in business commence. This manuscript aims to combine unsupervised clustering and supervised classification methods in an integrated data analysis framework to analyze the real-world POS data. Clustering method, which is performed on sales dataset, is used to cluster the stores into several groups. The clustering results, data labels, are then combined with other information in store features dataset as the inputs of the classification model which classifies the clustering labels by using store features dataset. Non-dominated sorting generic algorithm-II (NSGA-II) is applied in the framework to employ the multi-objective of clustering and classification. The experimental case study shows clustering results can reveal the hidden structure of sales performance of retail stores while classification can reveal the major factors that effect to the sales performance under different group of retail stores. The correlations between sales clusters and the store information can be obtained sequentially under a series of data analysis with the proposed framework.\",\"PeriodicalId\":13527,\"journal\":{\"name\":\"Int. J. Inf. Technol. Decis. Mak.\",\"volume\":\"127 1\",\"pages\":\"885-910\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Inf. Technol. Decis. Mak.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s0219622022500079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Inf. Technol. Decis. Mak.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219622022500079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
销售点(POS)数据分析通常用于商业活动中对销售绩效的研究。本文旨在将无监督聚类和监督分类方法结合在一个集成的数据分析框架中,对真实POS数据进行分析。在销售数据集上执行聚类方法,将商店聚为几组。然后将聚类结果(数据标签)与存储特征数据集中的其他信息相结合,作为分类模型的输入,该模型利用存储特征数据集对聚类标签进行分类。该框架采用非支配排序通用算法- ii (NSGA-II),实现多目标聚类和分类。实验案例研究表明,聚类结果可以揭示零售商店销售绩效的隐藏结构,而分类可以揭示影响不同零售商店销售绩效的主要因素。在此框架下,通过一系列的数据分析,可以依次获得销售集群与商店信息之间的相关性。
Sequential Clustering and Classification Approach to Analyze Sales Performance of Retail Stores Based on Point-of-Sale Data
Point-of-Sale (POS) data analysis is usually used to explore sales performance in business commence. This manuscript aims to combine unsupervised clustering and supervised classification methods in an integrated data analysis framework to analyze the real-world POS data. Clustering method, which is performed on sales dataset, is used to cluster the stores into several groups. The clustering results, data labels, are then combined with other information in store features dataset as the inputs of the classification model which classifies the clustering labels by using store features dataset. Non-dominated sorting generic algorithm-II (NSGA-II) is applied in the framework to employ the multi-objective of clustering and classification. The experimental case study shows clustering results can reveal the hidden structure of sales performance of retail stores while classification can reveal the major factors that effect to the sales performance under different group of retail stores. The correlations between sales clusters and the store information can be obtained sequentially under a series of data analysis with the proposed framework.