A Grammatically Structured Noun Phrase Extractor for Vietnamese

Hoai-Duc Tuan-Nguyen, Bao-Quoc Ho, Trung Bui, M. Hoang
{"title":"A Grammatically Structured Noun Phrase Extractor for Vietnamese","authors":"Hoai-Duc Tuan-Nguyen, Bao-Quoc Ho, Trung Bui, M. Hoang","doi":"10.1109/rivf.2012.6169837","DOIUrl":null,"url":null,"abstract":"Noun phrase (NP) extraction is a vital part of any Natural Language Processing (NLP) system. However, it would be much better if the system can also parse the grammar structure of the extracted NPs. Grammatically structured NP (GSNP) is helpful in many research fields (Conceptual Indexing, Syntactic variant generating, Nested NP identifying, etc). This paper introduces a system that extracts NPs from Vietnamese Documents and parses each NP into a tree representing its grammar structure. These trees, in one hand, can be saved as XML documents, and in the other hand, can be loaded from these XML documents by some particular Java classes.","PeriodicalId":115212,"journal":{"name":"2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future","volume":"PP 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/rivf.2012.6169837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Noun phrase (NP) extraction is a vital part of any Natural Language Processing (NLP) system. However, it would be much better if the system can also parse the grammar structure of the extracted NPs. Grammatically structured NP (GSNP) is helpful in many research fields (Conceptual Indexing, Syntactic variant generating, Nested NP identifying, etc). This paper introduces a system that extracts NPs from Vietnamese Documents and parses each NP into a tree representing its grammar structure. These trees, in one hand, can be saved as XML documents, and in the other hand, can be loaded from these XML documents by some particular Java classes.
越南语语法结构名词短语提取器
名词短语(NP)提取是任何自然语言处理(NLP)系统的重要组成部分。然而,如果系统还能解析提取的np的语法结构,那就更好了。语法结构化NP (GSNP)在概念索引、句法变体生成、嵌套NP识别等研究领域具有重要意义。本文介绍了一个从越南语文档中提取NP的系统,并将每个NP解析成代表其语法结构的树。一方面,这些树可以保存为XML文档,另一方面,一些特定的Java类可以从这些XML文档中加载这些树。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信