Kwangmin Kim, Sejoon Lee, Kyunghyun Park, Dongjin Jang, Doheon Lee
{"title":"Mining Context-Specific Rules from the Literature for Virtual Human Model Simulation","authors":"Kwangmin Kim, Sejoon Lee, Kyunghyun Park, Dongjin Jang, Doheon Lee","doi":"10.1145/2665970.2665987","DOIUrl":null,"url":null,"abstract":"Computer-based virtual human model is believed to be the promising solution for drug response identification. Literature mining is competitive method to extract those biological rules for human model simulation, since existing public databases provide only limited amount of information applicable for the simulation. Here we propose the method for mining context-specific rules from the literature, for future application to virtual human model simulation. Integrating the existing biological databases, we have constructed formalized ontology. From the PubMed literature, we have tagged 11 distinct types of biological entities using both of conditional random field (CRF) and dictionary based Named Entity Recognition (NER). Recognized named entities were normalized and mapped to formalized ontology. Context-specific biological rules between named entities, characterized by increase/decrease features, were extracted by pattern-based method utilizing regular expression. As the result, we have obtained the organ-context specific biological rules. Further researches on enhanced rule and context extraction will be followed.","PeriodicalId":143937,"journal":{"name":"Data and Text Mining in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data and Text Mining in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2665970.2665987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Computer-based virtual human model is believed to be the promising solution for drug response identification. Literature mining is competitive method to extract those biological rules for human model simulation, since existing public databases provide only limited amount of information applicable for the simulation. Here we propose the method for mining context-specific rules from the literature, for future application to virtual human model simulation. Integrating the existing biological databases, we have constructed formalized ontology. From the PubMed literature, we have tagged 11 distinct types of biological entities using both of conditional random field (CRF) and dictionary based Named Entity Recognition (NER). Recognized named entities were normalized and mapped to formalized ontology. Context-specific biological rules between named entities, characterized by increase/decrease features, were extracted by pattern-based method utilizing regular expression. As the result, we have obtained the organ-context specific biological rules. Further researches on enhanced rule and context extraction will be followed.