{"title":"Phrase-based clause extraction for open information extraction system","authors":"A. Romadhony, D. H. Widyantoro, A. Purwarianti","doi":"10.1109/ICACSIS.2015.7415184","DOIUrl":null,"url":null,"abstract":"Recent development of variety and volume of information circulating in the Internet has prompted the emergence of a new paradigm in information extraction, namely the Open Information Extraction (Open IE). An evaluation of several existing Open IE systems shows a good performance on precision. However, improvement is still needed to boost the recall. A relation between entity pair in simple sentence is detected easier by the Open IE system rather than in complex sentence. In this paper, we propose a clause extraction approach employing phrase feature and requiring no learning, focusing on the entity pair. The proposed approach needs less computational cost than the previous work that employing deep parse feature or requiring learning. The experimental result shows that by extracting simpler clause, the performance of Open IE system increases. The average of best F-measure achieved in the evaluation on three benchmark datasets is 0.62, outperforms the previous work.","PeriodicalId":325539,"journal":{"name":"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS.2015.7415184","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Recent development of variety and volume of information circulating in the Internet has prompted the emergence of a new paradigm in information extraction, namely the Open Information Extraction (Open IE). An evaluation of several existing Open IE systems shows a good performance on precision. However, improvement is still needed to boost the recall. A relation between entity pair in simple sentence is detected easier by the Open IE system rather than in complex sentence. In this paper, we propose a clause extraction approach employing phrase feature and requiring no learning, focusing on the entity pair. The proposed approach needs less computational cost than the previous work that employing deep parse feature or requiring learning. The experimental result shows that by extracting simpler clause, the performance of Open IE system increases. The average of best F-measure achieved in the evaluation on three benchmark datasets is 0.62, outperforms the previous work.