{"title":"Head/modifier pairs for everyone","authors":"C. Koster","doi":"10.1145/860435.860557","DOIUrl":null,"url":null,"abstract":"The “English Phrases for IR” (EP4IR) grammar is a grammar of English concentrating on the description of the noun phrase and the verb phrase. The grammar is provided with a large lexicon, providing detailed Part-Of-Speech information. It is quite robust against badly formed input and unknown words and generates only the most probable analysis. The EP4IR grammar and lexicon are released along with the AGFL system [1], which is the first parser-generator for linguistic applications available under the GNU Public License. The parsers generated from it fall under the LGPL, so that they can be used for scientific and even commercial applications. From the EP4IR grammar and lexicon, an English parser can be generated automatically using the AGFL system, which produces as its output not parse trees but Head/ Modifier trees, binary dependency trees that can be unnested to Head/Modifier pairs. In this transduction process, the HM trees are syntactically normalized. The pairs generated when parsing some text represent only the major relations expressed in the text [2]:","PeriodicalId":209809,"journal":{"name":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/860435.860557","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The “English Phrases for IR” (EP4IR) grammar is a grammar of English concentrating on the description of the noun phrase and the verb phrase. The grammar is provided with a large lexicon, providing detailed Part-Of-Speech information. It is quite robust against badly formed input and unknown words and generates only the most probable analysis. The EP4IR grammar and lexicon are released along with the AGFL system [1], which is the first parser-generator for linguistic applications available under the GNU Public License. The parsers generated from it fall under the LGPL, so that they can be used for scientific and even commercial applications. From the EP4IR grammar and lexicon, an English parser can be generated automatically using the AGFL system, which produces as its output not parse trees but Head/ Modifier trees, binary dependency trees that can be unnested to Head/Modifier pairs. In this transduction process, the HM trees are syntactically normalized. The pairs generated when parsing some text represent only the major relations expressed in the text [2]: