Recipe Popularity Prediction with Deep Visual-Semantic Fusion

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management Pub Date : 2017-11-06 DOI:10.1145/3132847.3133137

Satoshi Sanjo, Marie Katsurai

引用次数: 23

Abstract

Predicting the popularity of user-created recipes has great potential to be adopted in several applications on recipe-sharing websites. To ensure timely prediction when a recipe is uploaded, a prediction model needs to be trained based on the recipe's content features (i.e., its visual and semantic features). This paper presents a novel approach to predicting recipe popularity using deep visual-semantic fusion. We first pre-train a deep model that predicts the popularity of recipes based on each single modality. We insert additional layers to the two models and concatenate their activations. Finally, we train a network comprising fully connected (FC) layers on the fused features to learn more powerful features, which are used for training a regressor. Based on experiments conducted on more than 150K recipes collected from the Cookpad website, we present a comprehensive comparison with several baselines to verify the effectiveness of our method. The best practice for the proposed method is also described.

查看原文本刊更多论文

基于深度视觉语义融合的配方流行度预测

预测用户创建的食谱的受欢迎程度在食谱分享网站上的几个应用程序中具有很大的潜力。为了确保在菜谱上传时及时预测，需要根据菜谱的内容特征(即其视觉和语义特征)训练预测模型。本文提出了一种基于深度视觉语义融合的食谱流行度预测方法。我们首先预训练一个深度模型，该模型基于每种单一模态预测食谱的受欢迎程度。我们向两个模型插入额外的层，并连接它们的激活。最后，我们在融合的特征上训练一个由全连接层(FC)组成的网络，以学习更强大的特征，这些特征用于训练回归器。通过对Cookpad网站上收集的15万多份食谱进行实验，我们与几个基线进行了全面的比较，以验证我们方法的有效性。本文还描述了该方法的最佳实践。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

自引率

0.00%

发文量