Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata

NUT@EMNLP Pub Date : 2018-11-01 DOI:10.18653/v1/W18-6119

Steven Xu, Andrew Bennett, D. Hoogeveen, Jey Han Lau, Timothy Baldwin

引用次数: 4

Abstract

Community question answering (cQA) forums provide a rich source of data for facilitating non-factoid question answering over many technical domains. Given this, there is considerable interest in answer retrieval from these kinds of forums. However this is a difficult task as the structure of these forums is very rich, and both metadata and text features are important for successful retrieval. While there has recently been a lot of work on solving this problem using deep learning models applied to question/answer text, this work has not looked at how to make use of the rich metadata available in cQA forums. We propose an attention-based model which achieves state-of-the-art results for text-based answer selection alone, and by making use of complementary meta-data, achieves a substantially higher result over two reference datasets novel to this work.

查看原文本刊更多论文

堆栈溢出中的首选答案选择:更好的文本表示…元数据，元数据，元数据

社区问答(cQA)论坛为促进许多技术领域的非事实性问答提供了丰富的数据源。鉴于此，人们对从这类论坛中检索答案非常感兴趣。然而，这是一项艰巨的任务，因为这些论坛的结构非常丰富，元数据和文本特征对于成功检索都很重要。虽然最近有很多工作是通过将深度学习模型应用于问答文本来解决这个问题，但这些工作并没有关注如何利用cQA论坛中可用的丰富元数据。我们提出了一个基于注意力的模型，该模型仅在基于文本的答案选择中获得了最先进的结果，并且通过使用互补的元数据，在两个参考数据集上获得了更高的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NUT@EMNLP

自引率

0.00%

发文量