获得深度推荐器拟合:稀疏二进制输入/输出网络的Bloom嵌入

Proceedings of the Eleventh ACM Conference on Recommender Systems Pub Date : 2017-06-13 DOI:10.1145/3109859.3109876

J. Serrà, Alexandros Karatzoglou

{"title":"获得深度推荐器拟合:稀疏二进制输入/输出网络的Bloom嵌入","authors":"J. Serrà, Alexandros Karatzoglou","doi":"10.1145/3109859.3109876","DOIUrl":null,"url":null,"abstract":"Recommendation algorithms that incorporate techniques from deep learning are becoming increasingly popular. Due to the structure of the data coming from recommendation domains (i.e., one-hot-encoded vectors of item preferences), these algorithms tend to have large input and output dimensionalities that dominate their overall size. This makes them difficult to train, due to the limited memory of graphical processing units, and difficult to deploy on mobile devices with limited hardware. To address these difficulties, we propose Bloom embeddings, a compression technique that can be applied to the input and output of neural network models dealing with sparse high-dimensional binary-coded instances. Bloom embeddings are computationally efficient, and do not seriously compromise the accuracy of the model up to 1/5 compression ratios. In some cases, they even improve over the original accuracy, with relative increases up to 12%. We evaluate Bloom embeddings on 7 data sets and compare it against 4 alternative methods, obtaining favorable results. We also discuss a number of further advantages of Bloom embeddings, such as 'on-the-fly' constant-time operation, zero or marginal space requirements, training time speedups, or the fact that they do not require any change to the core model architecture or training configuration.","PeriodicalId":417173,"journal":{"name":"Proceedings of the Eleventh ACM Conference on Recommender Systems","volume":"10 8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"52","resultStr":"{\"title\":\"Getting Deep Recommenders Fit: Bloom Embeddings for Sparse Binary Input/Output Networks\",\"authors\":\"J. Serrà, Alexandros Karatzoglou\",\"doi\":\"10.1145/3109859.3109876\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recommendation algorithms that incorporate techniques from deep learning are becoming increasingly popular. Due to the structure of the data coming from recommendation domains (i.e., one-hot-encoded vectors of item preferences), these algorithms tend to have large input and output dimensionalities that dominate their overall size. This makes them difficult to train, due to the limited memory of graphical processing units, and difficult to deploy on mobile devices with limited hardware. To address these difficulties, we propose Bloom embeddings, a compression technique that can be applied to the input and output of neural network models dealing with sparse high-dimensional binary-coded instances. Bloom embeddings are computationally efficient, and do not seriously compromise the accuracy of the model up to 1/5 compression ratios. In some cases, they even improve over the original accuracy, with relative increases up to 12%. We evaluate Bloom embeddings on 7 data sets and compare it against 4 alternative methods, obtaining favorable results. We also discuss a number of further advantages of Bloom embeddings, such as 'on-the-fly' constant-time operation, zero or marginal space requirements, training time speedups, or the fact that they do not require any change to the core model architecture or training configuration.\",\"PeriodicalId\":417173,\"journal\":{\"name\":\"Proceedings of the Eleventh ACM Conference on Recommender Systems\",\"volume\":\"10 8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"52\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Eleventh ACM Conference on Recommender Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3109859.3109876\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Eleventh ACM Conference on Recommender Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3109859.3109876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 52

摘要

结合深度学习技术的推荐算法正变得越来越流行。由于来自推荐域的数据结构(即，单热编码的项目偏好向量)，这些算法倾向于具有较大的输入和输出维度，这些维度支配着它们的总体规模。由于图形处理单元的内存有限，这使得它们难以训练，并且难以在硬件有限的移动设备上部署。为了解决这些困难，我们提出了Bloom嵌入，这是一种压缩技术，可以应用于处理稀疏高维二进制编码实例的神经网络模型的输入和输出。布隆嵌入的计算效率很高，并且不会严重损害模型的精度，高达1/5的压缩比。在某些情况下，它们甚至比原始精度提高了12%。我们在7个数据集上评估了Bloom嵌入，并与4种替代方法进行了比较，获得了良好的结果。我们还讨论了Bloom嵌入的许多进一步的优点，例如“即时”恒定时间操作，零或边际空间要求，训练时间加速，或者它们不需要对核心模型体系结构或训练配置进行任何更改的事实。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Getting Deep Recommenders Fit: Bloom Embeddings for Sparse Binary Input/Output Networks

Recommendation algorithms that incorporate techniques from deep learning are becoming increasingly popular. Due to the structure of the data coming from recommendation domains (i.e., one-hot-encoded vectors of item preferences), these algorithms tend to have large input and output dimensionalities that dominate their overall size. This makes them difficult to train, due to the limited memory of graphical processing units, and difficult to deploy on mobile devices with limited hardware. To address these difficulties, we propose Bloom embeddings, a compression technique that can be applied to the input and output of neural network models dealing with sparse high-dimensional binary-coded instances. Bloom embeddings are computationally efficient, and do not seriously compromise the accuracy of the model up to 1/5 compression ratios. In some cases, they even improve over the original accuracy, with relative increases up to 12%. We evaluate Bloom embeddings on 7 data sets and compare it against 4 alternative methods, obtaining favorable results. We also discuss a number of further advantages of Bloom embeddings, such as 'on-the-fly' constant-time operation, zero or marginal space requirements, training time speedups, or the fact that they do not require any change to the core model architecture or training configuration.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Eleventh ACM Conference on Recommender Systems

自引率

0.00%

发文量