{"title":"Learning Relations and Information Extraction Rules for Protein Annotation","authors":"Jee Hyub Kim, M. Hilario","doi":"10.1109/AINAW.2007.220","DOIUrl":null,"url":null,"abstract":"Protein annotation is a task that describes protein X in terms of topic Y Until now, most of protein annotation work has been done manually by human annotators. However, as the number of biomedical papers grows ever rapidly, manual annotation becomes difficult, and there is increasing need to automate the protein annotation process. Recently, Information Extraction (IE) has been used to solve this problem. Typically, IE requires pre-defined relations and hand-crafted IE rules or annotated corpora, and these requirements are difficult to satisfy in real world domains such as the biomedical domain. In this paper, we describe an IE system which requires only sentences labeled relevant or not to a given topic by domain experts.","PeriodicalId":338799,"journal":{"name":"21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINAW.2007.220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Protein annotation is a task that describes protein X in terms of topic Y Until now, most of protein annotation work has been done manually by human annotators. However, as the number of biomedical papers grows ever rapidly, manual annotation becomes difficult, and there is increasing need to automate the protein annotation process. Recently, Information Extraction (IE) has been used to solve this problem. Typically, IE requires pre-defined relations and hand-crafted IE rules or annotated corpora, and these requirements are difficult to satisfy in real world domains such as the biomedical domain. In this paper, we describe an IE system which requires only sentences labeled relevant or not to a given topic by domain experts.