基于RFM聚类回归模型的电子商务公司销售预测方案

Nithya Chalapathy, Helen Josephine V.L.
{"title":"基于RFM聚类回归模型的电子商务公司销售预测方案","authors":"Nithya Chalapathy, Helen Josephine V.L.","doi":"10.1145/3590837.3590937","DOIUrl":null,"url":null,"abstract":"Machine learning models are being used for better insights and decision making across many industries today. It shows to be quite useful for businesses in the ecommerce industry as well due to the vast amount of data generated and its potential. This research aimed to find insights on future sales of an ecommerce company [1]. The vast number of variables including both categorical and continuous variables under product data, customer information, transaction information, led us to implement a prediction model using regressors rather than just time series forecasting techniques. First an RFM (Recency, Frequency and Monetary) based clustering algorithm was used to get customer related information and then integrate those results into a regressor to achieve the desired goal of prediction of sales. Two schemes were tested one being predictions on individual clusters and the other where the clusters were one hot encoded back into the main data. Results show quite high accuracy of prediction. The high R-squared also indicated that our hypothesis of including the variables contributed significantly to the predicted sales values was correct in this case. This research fulfills an identified need to understand how machine learning algorithms can be implemented by multiple algorithms being integrated in sequential and logical orders thus helping derive business specific strategies rather than making it a mere technical process by providing empirical results about how the predicted sales values along with given inputs can contribute in business decision making relating to marketing, inventory management, dynamic pricing or many more such strategies.","PeriodicalId":112926,"journal":{"name":"Proceedings of the 4th International Conference on Information Management & Machine Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sales Prediction Scheme Using RFM based Clustering and Regressor Model for Ecommerce Company\",\"authors\":\"Nithya Chalapathy, Helen Josephine V.L.\",\"doi\":\"10.1145/3590837.3590937\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning models are being used for better insights and decision making across many industries today. It shows to be quite useful for businesses in the ecommerce industry as well due to the vast amount of data generated and its potential. This research aimed to find insights on future sales of an ecommerce company [1]. The vast number of variables including both categorical and continuous variables under product data, customer information, transaction information, led us to implement a prediction model using regressors rather than just time series forecasting techniques. First an RFM (Recency, Frequency and Monetary) based clustering algorithm was used to get customer related information and then integrate those results into a regressor to achieve the desired goal of prediction of sales. Two schemes were tested one being predictions on individual clusters and the other where the clusters were one hot encoded back into the main data. Results show quite high accuracy of prediction. The high R-squared also indicated that our hypothesis of including the variables contributed significantly to the predicted sales values was correct in this case. This research fulfills an identified need to understand how machine learning algorithms can be implemented by multiple algorithms being integrated in sequential and logical orders thus helping derive business specific strategies rather than making it a mere technical process by providing empirical results about how the predicted sales values along with given inputs can contribute in business decision making relating to marketing, inventory management, dynamic pricing or many more such strategies.\",\"PeriodicalId\":112926,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Information Management & Machine Intelligence\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Information Management & Machine Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3590837.3590937\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Information Management & Machine Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3590837.3590937","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

如今,机器学习模型在许多行业被用于更好的洞察和决策。由于产生的大量数据及其潜力,它对电子商务行业的企业也非常有用。这项研究旨在发现对电子商务公司未来销售的见解[1]。大量的变量,包括产品数据、客户信息、交易信息下的分类变量和连续变量,导致我们使用回归量而不仅仅是时间序列预测技术实现预测模型。首先使用基于RFM (Recency, Frequency and Monetary)的聚类算法获取客户相关信息,然后将这些结果整合到回归量中,以实现预期的销售预测目标。测试了两种方案,一种是对单个集群的预测,另一种是将集群编码回主数据。结果表明,预测精度较高。高r平方也表明,在这种情况下,我们包含变量对预测销售额有重大贡献的假设是正确的。本研究满足了一种明确的需求,即了解机器学习算法如何通过按顺序和逻辑顺序集成的多种算法来实现,从而帮助推导出特定的业务策略,而不是通过提供有关预测销售额以及给定输入如何有助于与营销、库存管理、动态定价或者更多类似的策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sales Prediction Scheme Using RFM based Clustering and Regressor Model for Ecommerce Company
Machine learning models are being used for better insights and decision making across many industries today. It shows to be quite useful for businesses in the ecommerce industry as well due to the vast amount of data generated and its potential. This research aimed to find insights on future sales of an ecommerce company [1]. The vast number of variables including both categorical and continuous variables under product data, customer information, transaction information, led us to implement a prediction model using regressors rather than just time series forecasting techniques. First an RFM (Recency, Frequency and Monetary) based clustering algorithm was used to get customer related information and then integrate those results into a regressor to achieve the desired goal of prediction of sales. Two schemes were tested one being predictions on individual clusters and the other where the clusters were one hot encoded back into the main data. Results show quite high accuracy of prediction. The high R-squared also indicated that our hypothesis of including the variables contributed significantly to the predicted sales values was correct in this case. This research fulfills an identified need to understand how machine learning algorithms can be implemented by multiple algorithms being integrated in sequential and logical orders thus helping derive business specific strategies rather than making it a mere technical process by providing empirical results about how the predicted sales values along with given inputs can contribute in business decision making relating to marketing, inventory management, dynamic pricing or many more such strategies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信