Linh Nguyen Tran Ngoc, Vu Hong Quan, Le Hoang Ngan, Tran Duy Phu, Hoang-Quynh Le
{"title":"An incremental ensemble learning system for Vietnamese e-commerce product classification","authors":"Linh Nguyen Tran Ngoc, Vu Hong Quan, Le Hoang Ngan, Tran Duy Phu, Hoang-Quynh Le","doi":"10.1109/KSE53942.2021.9648642","DOIUrl":null,"url":null,"abstract":"With the booming of e-commerce platforms, text classification models play an increasingly important role in businesses. Major challenges that businesses would face include dataset imbalance, continuously added data, language specificity and product specificity. In this paper, we propose a scalable incremental machine learning system for industrial-scale deployment in real-world business. The system also includes steps to optimize e-commerce product specifics. The proposal tactics including keyword dictionary mapping, sampling technique and ensemble learning delivered better performance when compared to models without them. Our experiments also showed that minibatch SVM produced good results and might be considerable in a lighter system.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the booming of e-commerce platforms, text classification models play an increasingly important role in businesses. Major challenges that businesses would face include dataset imbalance, continuously added data, language specificity and product specificity. In this paper, we propose a scalable incremental machine learning system for industrial-scale deployment in real-world business. The system also includes steps to optimize e-commerce product specifics. The proposal tactics including keyword dictionary mapping, sampling technique and ensemble learning delivered better performance when compared to models without them. Our experiments also showed that minibatch SVM produced good results and might be considerable in a lighter system.