Linh Nguyen Tran Ngoc, Vu Hong Quan, Le Hoang Ngan, Tran Duy Phu, Hoang-Quynh Le
{"title":"越南电子商务产品分类的增量集成学习系统","authors":"Linh Nguyen Tran Ngoc, Vu Hong Quan, Le Hoang Ngan, Tran Duy Phu, Hoang-Quynh Le","doi":"10.1109/KSE53942.2021.9648642","DOIUrl":null,"url":null,"abstract":"With the booming of e-commerce platforms, text classification models play an increasingly important role in businesses. Major challenges that businesses would face include dataset imbalance, continuously added data, language specificity and product specificity. In this paper, we propose a scalable incremental machine learning system for industrial-scale deployment in real-world business. The system also includes steps to optimize e-commerce product specifics. The proposal tactics including keyword dictionary mapping, sampling technique and ensemble learning delivered better performance when compared to models without them. Our experiments also showed that minibatch SVM produced good results and might be considerable in a lighter system.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An incremental ensemble learning system for Vietnamese e-commerce product classification\",\"authors\":\"Linh Nguyen Tran Ngoc, Vu Hong Quan, Le Hoang Ngan, Tran Duy Phu, Hoang-Quynh Le\",\"doi\":\"10.1109/KSE53942.2021.9648642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the booming of e-commerce platforms, text classification models play an increasingly important role in businesses. Major challenges that businesses would face include dataset imbalance, continuously added data, language specificity and product specificity. In this paper, we propose a scalable incremental machine learning system for industrial-scale deployment in real-world business. The system also includes steps to optimize e-commerce product specifics. The proposal tactics including keyword dictionary mapping, sampling technique and ensemble learning delivered better performance when compared to models without them. Our experiments also showed that minibatch SVM produced good results and might be considerable in a lighter system.\",\"PeriodicalId\":130986,\"journal\":{\"name\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"63 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE53942.2021.9648642\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An incremental ensemble learning system for Vietnamese e-commerce product classification
With the booming of e-commerce platforms, text classification models play an increasingly important role in businesses. Major challenges that businesses would face include dataset imbalance, continuously added data, language specificity and product specificity. In this paper, we propose a scalable incremental machine learning system for industrial-scale deployment in real-world business. The system also includes steps to optimize e-commerce product specifics. The proposal tactics including keyword dictionary mapping, sampling technique and ensemble learning delivered better performance when compared to models without them. Our experiments also showed that minibatch SVM produced good results and might be considerable in a lighter system.