Word Sense Discrimination on tweets: A graph-based approach

2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K) Pub Date : 2015-11-12 DOI:10.5220/0005640501380146

F. M. Cecchini, E. Fersini, E. Messina

引用次数: 6

Abstract

In this paper we are going to detail an unsupervised, graph-based approach for word sense discrimination on tweets. We deal with this problem by constructing a word graph of co-occurrences. By defining a distance on this graph, we obtain a word metric space, on which we can apply an aggregative algorithm for word clustering. As a result, we will get word clusters representing contexts that discriminate the possible senses of a term. We present some experimental results both on a data set consisting of tweets we collected and on the data set of task 14 at SemEval-2010.

查看原文本刊更多论文

推文的词义辨析:基于图的方法

在本文中，我们将详细介绍一种无监督的、基于图的方法，用于tweet上的词义辨别。我们通过构建共现词图来处理这个问题。通过在这个图上定义一个距离，我们得到一个词度量空间，在这个空间上我们可以应用聚类算法进行词聚类。因此，我们将得到表示上下文的词簇，这些上下文区分了一个术语的可能含义。我们在一个由我们收集的tweet组成的数据集和SemEval-2010上的任务14的数据集上展示了一些实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K)

自引率

0.00%

发文量