{"title":"Enriching non-taxonomic relations extracted from domain texts","authors":"N. Nabila, Ali Mamat, M. Azmi-Murad, N. Mustapha","doi":"10.1109/STAIR.2011.5995772","DOIUrl":null,"url":null,"abstract":"Extracting non-taxonomic relations is one of the important tasks in the construction of ontology from the text. Most of current methods on identification and extraction of non-taxonomic relations is based on predicate representing relationships between two concepts, namely the relation between subject and object that occurs in a sentence. However, the number of relations that has been identified does not properly represent the domain as the methods only identify a portion of the total relations from domain texts. In this paper, we present a method that increases the number of relations extracted and thus properly represent the domain. In this method, all potential relations are first generated and then less significant ones, based on their frequency, are removed. The method has been tested on a collection of texts that described electronic voting machine and the result is encouraging.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Semantic Technology and Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STAIR.2011.5995772","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Extracting non-taxonomic relations is one of the important tasks in the construction of ontology from the text. Most of current methods on identification and extraction of non-taxonomic relations is based on predicate representing relationships between two concepts, namely the relation between subject and object that occurs in a sentence. However, the number of relations that has been identified does not properly represent the domain as the methods only identify a portion of the total relations from domain texts. In this paper, we present a method that increases the number of relations extracted and thus properly represent the domain. In this method, all potential relations are first generated and then less significant ones, based on their frequency, are removed. The method has been tested on a collection of texts that described electronic voting machine and the result is encouraging.