多核fm:一种用于CTR预测的多嵌入核分解机器框架

Yijun Wang, Kaibo Xu
{"title":"多核fm:一种用于CTR预测的多嵌入核分解机器框架","authors":"Yijun Wang, Kaibo Xu","doi":"10.1109/ICKG55886.2022.00043","DOIUrl":null,"url":null,"abstract":"Click-Through Rate (CTR) Prediction is one of the most critical components in recommender systems, where the task is to estimate the probability that a user clicks an item. In CTR models, embedding methods are widely used in feature representation to map categorical features into lower dimensional vectors, and thus those representations can be further exploited by various machine learning algorithms such as Factorization Machines (FMs) for CTR prediction. However, in the literature, most existing embedding models can only extract one latent vector for each individual feature as they calculate the feature interaction based on simple element product or inner product, limiting its ability to model user-item interactions in a high-dimensional space. It may miss some deep and complex interacted latent features, and therefore lead to a less proper representation as well as an inaccurate prediction. Motivated by the status quo, in this paper, we therefore propose a novel Multi-Kernel-FM (MKFM) framework for the task of CTR prediction. First of all, an embedding-based approach called Multi-FM (MFM) is proposed. It uses multiple embedding strategy and considers multiple representation sub-spaces for representing user-item features. After that, we construct a MKFM framework which combines kernel function and MFM to capture non-linear feature interactions. Then, the concept of kernel function is introduced and employed for capturing more high-dimensional feature interactions to further improve prediction accuracy. The results of our experiments on four public datasets demonstrate the superiorities of the proposed framework to some existing methods with respect to both prediction accuracy and training cost.","PeriodicalId":278067,"journal":{"name":"2022 IEEE International Conference on Knowledge Graph (ICKG)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Kernel-FM: A Multi-Embedding & Kernelization Factorization Machine Framework for CTR Prediction\",\"authors\":\"Yijun Wang, Kaibo Xu\",\"doi\":\"10.1109/ICKG55886.2022.00043\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Click-Through Rate (CTR) Prediction is one of the most critical components in recommender systems, where the task is to estimate the probability that a user clicks an item. In CTR models, embedding methods are widely used in feature representation to map categorical features into lower dimensional vectors, and thus those representations can be further exploited by various machine learning algorithms such as Factorization Machines (FMs) for CTR prediction. However, in the literature, most existing embedding models can only extract one latent vector for each individual feature as they calculate the feature interaction based on simple element product or inner product, limiting its ability to model user-item interactions in a high-dimensional space. It may miss some deep and complex interacted latent features, and therefore lead to a less proper representation as well as an inaccurate prediction. Motivated by the status quo, in this paper, we therefore propose a novel Multi-Kernel-FM (MKFM) framework for the task of CTR prediction. First of all, an embedding-based approach called Multi-FM (MFM) is proposed. It uses multiple embedding strategy and considers multiple representation sub-spaces for representing user-item features. After that, we construct a MKFM framework which combines kernel function and MFM to capture non-linear feature interactions. Then, the concept of kernel function is introduced and employed for capturing more high-dimensional feature interactions to further improve prediction accuracy. The results of our experiments on four public datasets demonstrate the superiorities of the proposed framework to some existing methods with respect to both prediction accuracy and training cost.\",\"PeriodicalId\":278067,\"journal\":{\"name\":\"2022 IEEE International Conference on Knowledge Graph (ICKG)\",\"volume\":\"86 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Knowledge Graph (ICKG)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICKG55886.2022.00043\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Knowledge Graph (ICKG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKG55886.2022.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

点击率(CTR)预测是推荐系统中最关键的组成部分之一,其任务是估计用户点击某项内容的概率。在CTR模型中,嵌入方法被广泛用于特征表示,将分类特征映射到较低维向量中,因此这些表征可以进一步被各种机器学习算法(如Factorization Machines (FMs))用于CTR预测。然而,在文献中,大多数现有的嵌入模型只能为每个单独的特征提取一个潜在向量,因为它们基于简单的元素积或内积计算特征交互,限制了其在高维空间中模拟用户-物品交互的能力。它可能会遗漏一些深层复杂的相互作用的潜在特征,从而导致表征不恰当,预测不准确。因此,本文提出了一种新的多核调频(MKFM)框架,用于CTR预测任务。首先,提出了一种基于嵌入的多调频(MFM)方法。它采用多重嵌入策略,并考虑多个表示子空间来表示用户项特征。在此基础上,我们构建了一个结合核函数和MFM的MKFM框架来捕获非线性特征交互。然后,引入核函数的概念,利用核函数捕获更多高维特征交互,进一步提高预测精度。我们在四个公共数据集上的实验结果表明,所提出的框架在预测精度和训练成本方面优于现有的一些方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-Kernel-FM: A Multi-Embedding & Kernelization Factorization Machine Framework for CTR Prediction
Click-Through Rate (CTR) Prediction is one of the most critical components in recommender systems, where the task is to estimate the probability that a user clicks an item. In CTR models, embedding methods are widely used in feature representation to map categorical features into lower dimensional vectors, and thus those representations can be further exploited by various machine learning algorithms such as Factorization Machines (FMs) for CTR prediction. However, in the literature, most existing embedding models can only extract one latent vector for each individual feature as they calculate the feature interaction based on simple element product or inner product, limiting its ability to model user-item interactions in a high-dimensional space. It may miss some deep and complex interacted latent features, and therefore lead to a less proper representation as well as an inaccurate prediction. Motivated by the status quo, in this paper, we therefore propose a novel Multi-Kernel-FM (MKFM) framework for the task of CTR prediction. First of all, an embedding-based approach called Multi-FM (MFM) is proposed. It uses multiple embedding strategy and considers multiple representation sub-spaces for representing user-item features. After that, we construct a MKFM framework which combines kernel function and MFM to capture non-linear feature interactions. Then, the concept of kernel function is introduced and employed for capturing more high-dimensional feature interactions to further improve prediction accuracy. The results of our experiments on four public datasets demonstrate the superiorities of the proposed framework to some existing methods with respect to both prediction accuracy and training cost.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信