Mining Recipes in Microblog

2013 International Conference on Asian Language Processing Pub Date : 2013-08-17 DOI:10.1109/IALP.2013.13

Shengyu Liu, Qingcai Chen, Shanshan Guan, Xiaolong Wang, Huimiao Shi

引用次数: 2

Abstract

Microblog, as an online communication platform, is becoming more and more popular. Users generate volumes of data everyday and the user generated content contains a lot of useful knowledge such as practical skills and technical expertise. This paper proposes a cross-data method to mine recipes in Microblog. In the proposed method, snippets of text relevant to recipes are firstly extracted from Baidu Encyclopedia. Secondly, the extracted snippets of text are used to train a domain-specific unigram language model. Thirdly, candidate recipes in Microblog are mined based on the unigram language model. Finally, some heuristic rules are used to identify real recipes from the candidate recipes. Experimental results show the effectiveness of the proposed method.

查看原文本刊更多论文

挖掘微博秘方

微博作为一种在线交流平台，正变得越来越受欢迎。用户每天生成大量的数据，用户生成的内容包含许多有用的知识，如实用技能和技术专长。提出了一种跨数据挖掘微博菜谱的方法。该方法首先从百度百科中提取食谱相关的文本片段。其次，将提取的文本片段用于训练特定领域的一元语言模型。第三，基于一元语言模型挖掘微博候选菜谱。最后，利用启发式规则从候选菜谱中识别出真实菜谱。实验结果表明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 International Conference on Asian Language Processing

自引率

0.00%

发文量