{"title":"Reducing semantic drift in bootstrapping for entity relation extraction","authors":"Chen Sijia, Li Yan, Chen Guang","doi":"10.1109/MEC.2013.6885371","DOIUrl":null,"url":null,"abstract":"This paper presents a novel bootstrapping algorithm for entity relation extraction. Shortest dependency patterns connecting entity pairs in sentences are captured initially and in turn applied to extract new binary relationships. The patterns are evaluated through correlation detection. In addition, we effectively prevent semantic drift by co-training with trigger words. Experiments for slot filling on the Knowledge Base Population (KBP) newspaper corpora show that our enhanced bootstrapping system achieves an 11% F1-score improvement over traditional bootstrapping algorithm.","PeriodicalId":196304,"journal":{"name":"Proceedings 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MEC.2013.6885371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a novel bootstrapping algorithm for entity relation extraction. Shortest dependency patterns connecting entity pairs in sentences are captured initially and in turn applied to extract new binary relationships. The patterns are evaluated through correlation detection. In addition, we effectively prevent semantic drift by co-training with trigger words. Experiments for slot filling on the Knowledge Base Population (KBP) newspaper corpora show that our enhanced bootstrapping system achieves an 11% F1-score improvement over traditional bootstrapping algorithm.
提出了一种新的实体关系抽取自举算法。最初捕获连接句子中实体对的最短依赖模式,然后应用于提取新的二元关系。通过相关性检测对模式进行评估。此外,我们通过与触发词的协同训练有效地防止了语义漂移。在KBP (Knowledge Base Population)报纸语料库上进行的槽填充实验表明,改进的自举算法比传统的自举算法提高了11%的f1分数。