生物医学文摘中蛋白质功能信息提取的语义规则

K. Taha
{"title":"生物医学文摘中蛋白质功能信息提取的语义规则","authors":"K. Taha","doi":"10.1109/BIBM.2015.7359749","DOIUrl":null,"url":null,"abstract":"We present a classifier system called SRPFP that predicts the functions of un-annotated proteins. SRPFP aims at enhancing the state of the art of biological text mining. It analyzes biomedical texts in order to discover protein function information that is difficult to retrieve. It employs semantic rules for extracting proteins functions information from biomedical abstracts. It applies a novel model and linguistic computational techniques for extracting the functional relationship from different structural forms of terms in the sentences of biological abstracts. Specifically, SRPFP extracts phrases that represent functional relationships between proteins and molecules. These molecules usually bind to the proteins and are highly predictive of the functions of these proteins. The proposed semantic rules can identify the semantic relationship between each co-occurrence of a protein-molecule pair using the syntactic structures of sentences and linguistics theories. SRPFP represents each protein by the molecules that have high co-occurrences with the protein in biomedical abstracts. This is because such molecules are good characteristics and indicators of the functions of proteins. SRPFP measures the semantic similarity between the molecules representing an un-annotated protein p and the molecules representing annotated proteins and assigns p the functions of annotated proteins that are similar to p.","PeriodicalId":186217,"journal":{"name":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"438 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic rules for extracting proteins functions information from biomedical abstracts\",\"authors\":\"K. Taha\",\"doi\":\"10.1109/BIBM.2015.7359749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a classifier system called SRPFP that predicts the functions of un-annotated proteins. SRPFP aims at enhancing the state of the art of biological text mining. It analyzes biomedical texts in order to discover protein function information that is difficult to retrieve. It employs semantic rules for extracting proteins functions information from biomedical abstracts. It applies a novel model and linguistic computational techniques for extracting the functional relationship from different structural forms of terms in the sentences of biological abstracts. Specifically, SRPFP extracts phrases that represent functional relationships between proteins and molecules. These molecules usually bind to the proteins and are highly predictive of the functions of these proteins. The proposed semantic rules can identify the semantic relationship between each co-occurrence of a protein-molecule pair using the syntactic structures of sentences and linguistics theories. SRPFP represents each protein by the molecules that have high co-occurrences with the protein in biomedical abstracts. This is because such molecules are good characteristics and indicators of the functions of proteins. SRPFP measures the semantic similarity between the molecules representing an un-annotated protein p and the molecules representing annotated proteins and assigns p the functions of annotated proteins that are similar to p.\",\"PeriodicalId\":186217,\"journal\":{\"name\":\"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"volume\":\"438 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM.2015.7359749\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2015.7359749","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们提出了一个称为SRPFP的分类器系统,用于预测未注释蛋白的功能。SRPFP旨在提高生物文本挖掘的技术水平。它分析生物医学文本,以发现难以检索的蛋白质功能信息。它采用语义规则从生物医学摘要中提取蛋白质功能信息。它采用一种新颖的模型和语言计算技术,从生物摘要句子中不同结构形式的术语中提取功能关系。具体来说,SRPFP提取的短语表示蛋白质和分子之间的功能关系。这些分子通常与蛋白质结合,并高度预测这些蛋白质的功能。所提出的语义规则可以利用句子的句法结构和语言学理论来识别蛋白质-分子对每一个共现现象之间的语义关系。SRPFP通过在生物医学摘要中与蛋白质高度共现的分子来表示每种蛋白质。这是因为这些分子是蛋白质功能的良好特征和指示器。SRPFP测量代表未注释蛋白p和代表注释蛋白的分子之间的语义相似性,并赋予p与p相似的注释蛋白的功能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Semantic rules for extracting proteins functions information from biomedical abstracts
We present a classifier system called SRPFP that predicts the functions of un-annotated proteins. SRPFP aims at enhancing the state of the art of biological text mining. It analyzes biomedical texts in order to discover protein function information that is difficult to retrieve. It employs semantic rules for extracting proteins functions information from biomedical abstracts. It applies a novel model and linguistic computational techniques for extracting the functional relationship from different structural forms of terms in the sentences of biological abstracts. Specifically, SRPFP extracts phrases that represent functional relationships between proteins and molecules. These molecules usually bind to the proteins and are highly predictive of the functions of these proteins. The proposed semantic rules can identify the semantic relationship between each co-occurrence of a protein-molecule pair using the syntactic structures of sentences and linguistics theories. SRPFP represents each protein by the molecules that have high co-occurrences with the protein in biomedical abstracts. This is because such molecules are good characteristics and indicators of the functions of proteins. SRPFP measures the semantic similarity between the molecules representing an un-annotated protein p and the molecules representing annotated proteins and assigns p the functions of annotated proteins that are similar to p.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信