基于二级结构元素几何和生物学特征显著性的蛋白质基序检索

V. Cantoni, M. Ferretti, Nicola Pellicanò, Jennifer Vandoni, M. Musci, Nahumi Nugrahaningsih
{"title":"基于二级结构元素几何和生物学特征显著性的蛋白质基序检索","authors":"V. Cantoni, M. Ferretti, Nicola Pellicanò, Jennifer Vandoni, M. Musci, Nahumi Nugrahaningsih","doi":"10.1109/DEXA.2014.22","DOIUrl":null,"url":null,"abstract":"This paper presents an approach to detect the presence of a given motif in proteins or in protein data bank (PDB). The approach is based on the secondary structure elements (SSEs) geometrical arrangement in 3D space. A motif is represented as a set of SSEs in their specific positions related to a local reference system (LRS). We propose, exploiting the SSE biological feature saliency in the motif LRS construction stage, a planning strategy to speed-up the motif retrieval process. The experimentation has been carried out on a set of 20 proteins selected from the PDB. In detail we tested five different cases: (i) performances on searching a motif within single proteins, (ii) searching motifs on a set of proteins belonging to the same biological family, (iii) searching into single symmetric proteins, (iv) searching on a set of symmetric proteins from the same family, and finally (v) a general motif retrieval from the entire protein dataset. The experimental results showed good motif recognition performances on each test category, and, by exploiting the basic biological features saliency in motif construction, comparing to a previous approach of SSEs block geometrical retrieval based on the Generalized Hough Transform, it was revealed a significant decrease of the time/space computational complexity. It is worth to point out that the computation time for the case of motif absence is significantly lower than the case of motif present.","PeriodicalId":291899,"journal":{"name":"2014 25th International Workshop on Database and Expert Systems Applications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Protein Motif Retrieval by Secondary Structure Element Geometry and Biological Features Saliency\",\"authors\":\"V. Cantoni, M. Ferretti, Nicola Pellicanò, Jennifer Vandoni, M. Musci, Nahumi Nugrahaningsih\",\"doi\":\"10.1109/DEXA.2014.22\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an approach to detect the presence of a given motif in proteins or in protein data bank (PDB). The approach is based on the secondary structure elements (SSEs) geometrical arrangement in 3D space. A motif is represented as a set of SSEs in their specific positions related to a local reference system (LRS). We propose, exploiting the SSE biological feature saliency in the motif LRS construction stage, a planning strategy to speed-up the motif retrieval process. The experimentation has been carried out on a set of 20 proteins selected from the PDB. In detail we tested five different cases: (i) performances on searching a motif within single proteins, (ii) searching motifs on a set of proteins belonging to the same biological family, (iii) searching into single symmetric proteins, (iv) searching on a set of symmetric proteins from the same family, and finally (v) a general motif retrieval from the entire protein dataset. The experimental results showed good motif recognition performances on each test category, and, by exploiting the basic biological features saliency in motif construction, comparing to a previous approach of SSEs block geometrical retrieval based on the Generalized Hough Transform, it was revealed a significant decrease of the time/space computational complexity. It is worth to point out that the computation time for the case of motif absence is significantly lower than the case of motif present.\",\"PeriodicalId\":291899,\"journal\":{\"name\":\"2014 25th International Workshop on Database and Expert Systems Applications\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 25th International Workshop on Database and Expert Systems Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEXA.2014.22\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 25th International Workshop on Database and Expert Systems Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEXA.2014.22","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

本文提出了一种检测蛋白质或蛋白质数据库(PDB)中给定基序存在的方法。该方法基于三维空间中二级结构元素的几何排列。一个基序被表示为一组与局部参考系统(LRS)相关的特定位置的sse。我们提出了一种利用基序LRS构建阶段SSE生物特征显著性来加快基序检索过程的规划策略。实验是在从PDB中选择的20种蛋白质上进行的。我们详细测试了五种不同的情况:(i)在单个蛋白质中搜索基序的性能,(ii)在属于同一生物家族的一组蛋白质上搜索基序,(iii)在单个对称蛋白质中搜索,(iv)在同一家族的一组对称蛋白质上搜索,最后(v)从整个蛋白质数据集中检索一般基序。实验结果表明,该方法在每个测试类别上都具有良好的基序识别性能,并且利用基序构建的基本生物学特征显著性,与以往基于广义霍夫变换的SSEs块几何检索方法相比,显著降低了时间/空间计算复杂度。值得指出的是,母基缺失情况下的计算时间明显低于母基存在情况下的计算时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Protein Motif Retrieval by Secondary Structure Element Geometry and Biological Features Saliency
This paper presents an approach to detect the presence of a given motif in proteins or in protein data bank (PDB). The approach is based on the secondary structure elements (SSEs) geometrical arrangement in 3D space. A motif is represented as a set of SSEs in their specific positions related to a local reference system (LRS). We propose, exploiting the SSE biological feature saliency in the motif LRS construction stage, a planning strategy to speed-up the motif retrieval process. The experimentation has been carried out on a set of 20 proteins selected from the PDB. In detail we tested five different cases: (i) performances on searching a motif within single proteins, (ii) searching motifs on a set of proteins belonging to the same biological family, (iii) searching into single symmetric proteins, (iv) searching on a set of symmetric proteins from the same family, and finally (v) a general motif retrieval from the entire protein dataset. The experimental results showed good motif recognition performances on each test category, and, by exploiting the basic biological features saliency in motif construction, comparing to a previous approach of SSEs block geometrical retrieval based on the Generalized Hough Transform, it was revealed a significant decrease of the time/space computational complexity. It is worth to point out that the computation time for the case of motif absence is significantly lower than the case of motif present.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信