Jian Wang, Qian Xu, Hongfei Lin, Zhihao Yang, Yanpeng Li
{"title":"Combining labeled and unlabeled data for biomédical event extraction","authors":"Jian Wang, Qian Xu, Hongfei Lin, Zhihao Yang, Yanpeng Li","doi":"10.1109/BIBMW.2012.6470206","DOIUrl":null,"url":null,"abstract":"In biomédical event extraction domain, there is a small amount of labeled data along with a large pool of unlabeled data. Many supervised learning algorithms for bio-event extraction have been affected by the data sparseness. In this paper, we present a new solution to perform biomédical event extraction from scientific documents, applying a semi-supervised approach to extract features from unlabeled data using labeled data features as a reference. This strategy is evaluated via experiments in which the data from the BioNLP2011 and PubMed are applied. To the best of our knowledge, it is the first time that the combination of labeled and unlabeled data are used for biomédical event extraction and our experimental results demonstrate the state-of-the-art performance in this task.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2012.6470206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In biomédical event extraction domain, there is a small amount of labeled data along with a large pool of unlabeled data. Many supervised learning algorithms for bio-event extraction have been affected by the data sparseness. In this paper, we present a new solution to perform biomédical event extraction from scientific documents, applying a semi-supervised approach to extract features from unlabeled data using labeled data features as a reference. This strategy is evaluated via experiments in which the data from the BioNLP2011 and PubMed are applied. To the best of our knowledge, it is the first time that the combination of labeled and unlabeled data are used for biomédical event extraction and our experimental results demonstrate the state-of-the-art performance in this task.