Hoai-Duc Tuan-Nguyen, Bao-Quoc Ho, Trung Bui, M. Hoang
{"title":"A Grammatically Structured Noun Phrase Extractor for Vietnamese","authors":"Hoai-Duc Tuan-Nguyen, Bao-Quoc Ho, Trung Bui, M. Hoang","doi":"10.1109/rivf.2012.6169837","DOIUrl":null,"url":null,"abstract":"Noun phrase (NP) extraction is a vital part of any Natural Language Processing (NLP) system. However, it would be much better if the system can also parse the grammar structure of the extracted NPs. Grammatically structured NP (GSNP) is helpful in many research fields (Conceptual Indexing, Syntactic variant generating, Nested NP identifying, etc). This paper introduces a system that extracts NPs from Vietnamese Documents and parses each NP into a tree representing its grammar structure. These trees, in one hand, can be saved as XML documents, and in the other hand, can be loaded from these XML documents by some particular Java classes.","PeriodicalId":115212,"journal":{"name":"2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future","volume":"PP 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/rivf.2012.6169837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Noun phrase (NP) extraction is a vital part of any Natural Language Processing (NLP) system. However, it would be much better if the system can also parse the grammar structure of the extracted NPs. Grammatically structured NP (GSNP) is helpful in many research fields (Conceptual Indexing, Syntactic variant generating, Nested NP identifying, etc). This paper introduces a system that extracts NPs from Vietnamese Documents and parses each NP into a tree representing its grammar structure. These trees, in one hand, can be saved as XML documents, and in the other hand, can be loaded from these XML documents by some particular Java classes.