MASCOT: A Quantization Framework for Efficient Matrix Factorization in Recommender Systems

2021 IEEE International Conference on Data Mining (ICDM) Pub Date : 2021-12-01 DOI:10.1109/ICDM51629.2021.00039

Yunyong Ko, Jae-Seo Yu, Hong-Kyun Bae, Y. Park, Dongwon Lee, Sang-Wook Kim

{"title":"MASCOT: A Quantization Framework for Efficient Matrix Factorization in Recommender Systems","authors":"Yunyong Ko, Jae-Seo Yu, Hong-Kyun Bae, Y. Park, Dongwon Lee, Sang-Wook Kim","doi":"10.1109/ICDM51629.2021.00039","DOIUrl":null,"url":null,"abstract":"In recent years, quantization methods have successfully accelerated the training of large deep neural network (DNN) models by reducing the level of precision in computing operations (e.g., forward/backward passes) without sacrificing its accuracy. In this work, therefore, we attempt to apply such a quantization idea to the popular Matrix factorization (MF) methods to deal with the growing scale of models and datasets in recommender systems. However, to our dismay, we observe that the state-of-the-art quantization methods are not effective in the training of MF models, unlike their successes in the training of DNN models. To this phenomenon, we posit that two distinctive features in training MF models could explain the difference: (i) the training of MF models is much more memory-intensive than that of DNN models, and (ii) the quantization errors across users and items in recommendation are not uniform. From these observations, we develop a quantization framework for MF models, named MASCOT, employing novel strategies (i.e., m-quantization and g-switching) to successfully address the aforementioned limitations of quantization in the training of MF models. The comprehensive evaluation using four real-world datasets demonstrates that MASCOT improves the training performance of MF models by about 45%, compared to the training without quantization, while maintaining low model errors, and the strategies and implementation optimizations of MASCOT are quite effective in the training of MF models. For the detailed information about MASCOT, we release the code of MASCOT and the datasets at: https://github.com/Yujaeseo/lCDM-2021_MASCOT.","PeriodicalId":320970,"journal":{"name":"2021 IEEE International Conference on Data Mining (ICDM)","volume":"165 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Data Mining (ICDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM51629.2021.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

In recent years, quantization methods have successfully accelerated the training of large deep neural network (DNN) models by reducing the level of precision in computing operations (e.g., forward/backward passes) without sacrificing its accuracy. In this work, therefore, we attempt to apply such a quantization idea to the popular Matrix factorization (MF) methods to deal with the growing scale of models and datasets in recommender systems. However, to our dismay, we observe that the state-of-the-art quantization methods are not effective in the training of MF models, unlike their successes in the training of DNN models. To this phenomenon, we posit that two distinctive features in training MF models could explain the difference: (i) the training of MF models is much more memory-intensive than that of DNN models, and (ii) the quantization errors across users and items in recommendation are not uniform. From these observations, we develop a quantization framework for MF models, named MASCOT, employing novel strategies (i.e., m-quantization and g-switching) to successfully address the aforementioned limitations of quantization in the training of MF models. The comprehensive evaluation using four real-world datasets demonstrates that MASCOT improves the training performance of MF models by about 45%, compared to the training without quantization, while maintaining low model errors, and the strategies and implementation optimizations of MASCOT are quite effective in the training of MF models. For the detailed information about MASCOT, we release the code of MASCOT and the datasets at: https://github.com/Yujaeseo/lCDM-2021_MASCOT.

查看原文本刊更多论文

推荐系统中高效矩阵分解的量化框架

近年来，量化方法在不牺牲其准确性的情况下，通过降低计算操作(例如向前/向后传递)的精度水平，成功地加速了大型深度神经网络(DNN)模型的训练。因此，在这项工作中，我们试图将这种量化思想应用于流行的矩阵分解(MF)方法，以处理推荐系统中不断增长的模型和数据集。然而，令我们沮丧的是，我们观察到最先进的量化方法在MF模型的训练中并不有效，不像它们在DNN模型的训练中所取得的成功。对于这一现象，我们认为训练MF模型的两个显著特征可以解释这一差异:(i) MF模型的训练比DNN模型的训练内存密集得多，(ii)推荐中用户和项目之间的量化误差不均匀。根据这些观察结果，我们开发了一个MF模型的量化框架，名为MASCOT，采用新颖的策略(即m量化和g切换)来成功解决上述量化在MF模型训练中的局限性。利用4个真实数据集进行的综合评价表明，与未量化训练相比，MASCOT在保持较低模型误差的同时，将MF模型的训练性能提高了约45%，并且MASCOT的策略和实现优化在MF模型的训练中是非常有效的。关于吉祥物的详细信息，我们在:https://github.com/Yujaeseo/lCDM-2021_MASCOT上发布了吉祥物的代码和数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Conference on Data Mining (ICDM)

自引率

0.00%

发文量