Rapid protein fragment search using hash functions based on the Fourier transform.

T Akutsu, K Onizuka, M Ishikawa
{"title":"Rapid protein fragment search using hash functions based on the Fourier transform.","authors":"T Akutsu,&nbsp;K Onizuka,&nbsp;M Ishikawa","doi":"10.1093/bioinformatics/13.4.357","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Since the protein structure database has been growing very rapidly in recent years, the development of efficient methods for searching for similar structures is very important.</p><p><strong>Results: </strong>This paper presents a novel method for searching for similar fragments of proteins. In this method, a hash vector (a vector of real numbers) is associated with each fixed-length fragment of three-dimensional protein structure. Each vector consists of low-frequency components of the Fourier-like spectrum for the distances between C alpha atoms and the centroid. Then, we can analyze the similarity between fragments by evaluating the difference between hash vectors. The novel aspect of the method is that the following property is proved theoretically: if the root mean square distance between two fragments is small, then the distance between the hash vectors is small. Several variants of this method were compared with a naive method and a previous method using PDB data. The results show that the fastest one among the variants is 18-80 times faster than the naive method, and 3-10 times faster than the previous method.</p>","PeriodicalId":77081,"journal":{"name":"Computer applications in the biosciences : CABIOS","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/bioinformatics/13.4.357","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer applications in the biosciences : CABIOS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/13.4.357","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Motivation: Since the protein structure database has been growing very rapidly in recent years, the development of efficient methods for searching for similar structures is very important.

Results: This paper presents a novel method for searching for similar fragments of proteins. In this method, a hash vector (a vector of real numbers) is associated with each fixed-length fragment of three-dimensional protein structure. Each vector consists of low-frequency components of the Fourier-like spectrum for the distances between C alpha atoms and the centroid. Then, we can analyze the similarity between fragments by evaluating the difference between hash vectors. The novel aspect of the method is that the following property is proved theoretically: if the root mean square distance between two fragments is small, then the distance between the hash vectors is small. Several variants of this method were compared with a naive method and a previous method using PDB data. The results show that the fastest one among the variants is 18-80 times faster than the naive method, and 3-10 times faster than the previous method.

基于傅里叶变换的哈希函数快速蛋白质片段搜索。
动机:由于近年来蛋白质结构数据库的增长非常迅速,因此开发有效的方法来搜索相似结构是非常重要的。结果:本文提出了一种寻找相似蛋白片段的新方法。在这种方法中,一个哈希向量(实数向量)与三维蛋白质结构的每个定长片段相关联。每个向量由C原子和质心之间距离的类傅立叶谱的低频分量组成。然后,我们可以通过计算哈希向量之间的差异来分析片段之间的相似性。该方法的新颖之处在于从理论上证明了以下性质:如果两个片段之间的均方根距离较小,则哈希向量之间的距离较小。将该方法的几种变体与原始方法和先前使用PDB数据的方法进行了比较。结果表明,其中速度最快的方法比朴素方法快18 ~ 80倍,比原方法快3 ~ 10倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信