Vietnamese Noun Phrase Chunking Based on Conditional Random Fields

2009 International Conference on Knowledge and Systems Engineering Pub Date : 2009-10-13 DOI:10.1109/KSE.2009.43

Nguyen Thi Huong Thao, N. Thai, Nguyen Le Minh, Hà Quang Thụy

引用次数: 11

Abstract

Noun phrase chunking is an important and useful task in many natural language processing applications. It is studied well for English, however with Vietnamese it is still an open problem. This paper presents a Vietnamese Noun Phrase chunking approach based on Conditional random fields (CRFs) models. We also describe a method to build Vietnamese corpus from a set of hand annotated sentences. For evaluation, we perform several experiments using different feature settings. Outcome results on our corpus show a high performance with the average of recall and precision 82.72% and 82.62% respectively.

查看原文本刊更多论文

基于条件随机场的越南语名词短语组块

名词短语组块在许多自然语言处理应用中是一项重要而有用的任务。英语学得很好，但是越南语仍然是一个开放的问题。提出了一种基于条件随机场(CRFs)模型的越南语名词短语分块方法。我们还描述了一种从一组手工注释的句子中构建越南语语料库的方法。为了评估，我们使用不同的特征设置进行了几个实验。结果表明，在我们的语料库上，平均查全率和查准率分别为82.72%和82.62%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 International Conference on Knowledge and Systems Engineering

自引率

0.00%

发文量