A linear time algorithm for finding all maximal scoring subsequences.

Proceedings. International Conference on Intelligent Systems for Molecular Biology Pub Date : 1999-01-01

W L Ruzzo, M Tompa

引用次数: 0

Abstract

Given a sequence of real numbers ("scores"), we present a practical linear time algorithm to find those nonoverlapping, contiguous subsequences having greatest total scores. This improves on the best previously known algorithm, which requires quadratic time in the worst case. The problem arises in biological sequence analysis, where the high-scoring subsequences correspond to regions of unusual composition in a nucleic acid or protein sequence. For instance, Altschul, Karlin, and others have used this approach to identify transmembrane regions, DNA binding domains, and regions of high charge in proteins.

本刊更多论文

寻找所有最大得分子序列的线性时间算法。

给定一个实数序列(“分数”)，我们提出了一个实用的线性时间算法来找到那些具有最大总分的非重叠、连续子序列。这改进了之前已知的最佳算法，在最坏的情况下需要二次的时间。问题出现在生物序列分析中，其中高分子序列对应于核酸或蛋白质序列中不寻常组成的区域。例如，Altschul, Karlin和其他人已经使用这种方法来识别跨膜区域，DNA结合域和蛋白质中的高电荷区域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. International Conference on Intelligent Systems for Molecular Biology

自引率

0.00%

发文量