Identifying Informational vs. Conversational Questions on Community Question Answering Archives

Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining Pub Date : 2018-02-02 DOI:10.1145/3159652.3159733

Ido Guy, Victor Makarenkov, Niva Hazon, L. Rokach, Bracha Shapira

引用次数: 13

Abstract

Questions on community question answering websites usually reflect one of two intents: learning information or starting a conversation. In this paper, we revisit this fundamental classification task of informational versus conversational questions, which was originally introduced and studied in 2009. We use a substantially larger dataset of archived questions from Yahoo Answers, which includes the question»s title, description, answers, and votes. We replicate the original experiments over this dataset, point out the common and different from the original results, and present a broad set of characteristics that distinguish the two question types. We also develop new classifiers that make use of additional data types, advanced machine learning, and a large dataset of unlabeled data, which achieve enhanced performance.

查看原文本刊更多论文

在社区问答档案中识别信息问题与会话问题

社区问答网站上的问题通常反映了两种目的之一:学习信息或开始对话。在本文中，我们重新审视了2009年首次引入和研究的信息与会话问题的基本分类任务。我们使用了一个更大的来自雅虎答案的存档问题数据集，其中包括问题的标题、描述、答案和投票。我们在这个数据集上复制了原始实验，指出了与原始结果的共同点和不同点，并提出了一套广泛的特征来区分这两种问题类型。我们还开发了新的分类器，这些分类器利用了额外的数据类型、高级机器学习和大量未标记数据集，从而提高了性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量