A Pointwise Approach to Training Dependency Parsers from Partially Annotated Corpora

Q4 Computer Science

Journal of Information Processing Pub Date : 2012-09-30 DOI:10.5715/JNLP.19.167

Daniel Flannery, Yusuke Miyao, Graham Neubig, Shinsuke Mori

引用次数: 9

Abstract

We introduce a word-based dependency parser for Japanese that can be trained from partially annotated corpora, allowing for effective use of available linguistic resources and reduction of the costs of preparing new training data. This is especially important for domain adaptation in a real-world situation. We use a pointwise approach where each edge in the dependency tree for a sentence is estimated independently. Experiments on Japanese dependency parsing show that this approach allows for rapid training and achieves accuracy comparable to state-of-the-art dependency parsers trained on fully annotated data.

查看原文本刊更多论文

从部分标注语料库中训练依赖解析器的点向方法

我们为日语引入了一个基于单词的依赖解析器，它可以从部分注释的语料库中进行训练，从而有效地利用可用的语言资源并减少准备新训练数据的成本。这对于现实世界中的领域适应尤其重要。我们使用逐点方法，其中句子依赖树中的每个边都是独立估计的。在日语依赖项解析上的实验表明，这种方法允许快速训练，并且达到与在完全注释数据上训练的最先进的依赖项解析器相当的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Information Processing Computer Science-Computer Science (all)

CiteScore

1.20

自引率

0.00%

发文量