超级气旋再现。

IF 1.5 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS
Algorithms for Molecular Biology Pub Date : 2018-12-01 eCollection Date: 2018-01-01 DOI:10.1186/s13015-018-0134-3
Fabian Gärtner, Lydia Müller, Peter F Stadler
{"title":"超级气旋再现。","authors":"Fabian Gärtner,&nbsp;Lydia Müller,&nbsp;Peter F Stadler","doi":"10.1186/s13015-018-0134-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Superbubbles are distinctive subgraphs in direct graphs that play an important role in assembly algorithms for high-throughput sequencing (HTS) data. Their practical importance derives from the fact they are connected to their host graph by a single entrance and a single exit vertex, thus allowing them to be handled independently. Efficient algorithms for the enumeration of superbubbles are therefore of important for the processing of HTS data. Superbubbles can be identified within the strongly connected components of the input digraph after transforming them into directed acyclic graphs. The algorithm by Sung et al. (IEEE ACM Trans Comput Biol Bioinform 12:770-777, 2015) achieves this task in <math><mrow><mi>O</mi> <mo>(</mo> <mi>m</mi> <mspace></mspace> <mi>l</mi> <mi>o</mi> <mi>g</mi> <mo>(</mo> <mi>m</mi> <mo>)</mo> <mo>)</mo></mrow> </math> -time. The extraction of superbubbles from the transformed components was later improved to by Brankovic et al. (Theor Comput Sci 609:374-383, 2016) resulting in an overall <math><mrow><mi>O</mi> <mo>(</mo> <mi>m</mi> <mo>+</mo> <mi>n</mi> <mo>)</mo></mrow> </math> -time algorithm.</p><p><strong>Results: </strong>A re-analysis of the mathematical structure of superbubbles showed that the construction of auxiliary DAGs from the strongly connected components in the work of Sung et al. missed some details that can lead to the reporting of false positive superbubbles. We propose an alternative, even simpler auxiliary graph that solved the problem and retains the linear running time for general digraph. Furthermore, we describe a simpler, space-efficient <math><mrow><mi>O</mi> <mo>(</mo> <mi>m</mi> <mo>+</mo> <mi>n</mi> <mo>)</mo></mrow> </math> -time algorithm for detecting superbubbles in DAGs that uses only simple data structures.</p><p><strong>Implementation: </strong>We present a reference implementation of the algorithm that accepts many commonly used formats for the input graph and provides convenient access to the improved algorithm. https://github.com/Fabianexe/Superbubble.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"13 ","pages":"16"},"PeriodicalIF":1.5000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13015-018-0134-3","citationCount":"5","resultStr":"{\"title\":\"Superbubbles revisited.\",\"authors\":\"Fabian Gärtner,&nbsp;Lydia Müller,&nbsp;Peter F Stadler\",\"doi\":\"10.1186/s13015-018-0134-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Superbubbles are distinctive subgraphs in direct graphs that play an important role in assembly algorithms for high-throughput sequencing (HTS) data. Their practical importance derives from the fact they are connected to their host graph by a single entrance and a single exit vertex, thus allowing them to be handled independently. Efficient algorithms for the enumeration of superbubbles are therefore of important for the processing of HTS data. Superbubbles can be identified within the strongly connected components of the input digraph after transforming them into directed acyclic graphs. The algorithm by Sung et al. (IEEE ACM Trans Comput Biol Bioinform 12:770-777, 2015) achieves this task in <math><mrow><mi>O</mi> <mo>(</mo> <mi>m</mi> <mspace></mspace> <mi>l</mi> <mi>o</mi> <mi>g</mi> <mo>(</mo> <mi>m</mi> <mo>)</mo> <mo>)</mo></mrow> </math> -time. The extraction of superbubbles from the transformed components was later improved to by Brankovic et al. (Theor Comput Sci 609:374-383, 2016) resulting in an overall <math><mrow><mi>O</mi> <mo>(</mo> <mi>m</mi> <mo>+</mo> <mi>n</mi> <mo>)</mo></mrow> </math> -time algorithm.</p><p><strong>Results: </strong>A re-analysis of the mathematical structure of superbubbles showed that the construction of auxiliary DAGs from the strongly connected components in the work of Sung et al. missed some details that can lead to the reporting of false positive superbubbles. We propose an alternative, even simpler auxiliary graph that solved the problem and retains the linear running time for general digraph. Furthermore, we describe a simpler, space-efficient <math><mrow><mi>O</mi> <mo>(</mo> <mi>m</mi> <mo>+</mo> <mi>n</mi> <mo>)</mo></mrow> </math> -time algorithm for detecting superbubbles in DAGs that uses only simple data structures.</p><p><strong>Implementation: </strong>We present a reference implementation of the algorithm that accepts many commonly used formats for the input graph and provides convenient access to the improved algorithm. https://github.com/Fabianexe/Superbubble.</p>\",\"PeriodicalId\":50823,\"journal\":{\"name\":\"Algorithms for Molecular Biology\",\"volume\":\"13 \",\"pages\":\"16\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s13015-018-0134-3\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Algorithms for Molecular Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s13015-018-0134-3\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2018/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Algorithms for Molecular Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13015-018-0134-3","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2018/1/1 0:00:00","PubModel":"eCollection","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 5

摘要

背景:超泡是直接图中独特的子图,在高通量测序(HTS)数据的组装算法中起着重要作用。它们的实际重要性源于它们通过单个入口和单个出口顶点连接到它们的主图,从而允许它们独立处理。因此,高效的超级气泡枚举算法对于高温超导数据的处理是非常重要的。将输入有向图转换为有向无环图后,可以在输入有向图的强连通分量中识别出超泡。Sung等人(IEEE ACM Trans computer Biol Bioinform 12:770-777, 2015)的算法在O (m) l (m) O (m))时间内完成了该任务。Brankovic等人(theory computer Sci 609:374-383, 2016)改进了从变换后的分量中提取超泡的方法,得到了一个总体耗时为O (m + n)的算法。结果:对超级气泡数学结构的重新分析表明,Sung等人的工作中从强连接成分构建辅助dag遗漏了一些可能导致报告假阳性超级气泡的细节。我们提出了一种更简单的辅助图,它既解决了这个问题,又保留了一般有向图的线性运行时间。此外,我们描述了一种更简单,空间效率高的O (m + n)时间算法,用于仅使用简单数据结构的dag中检测超泡。实现:我们给出了该算法的参考实现,该算法接受许多常用的输入图格式,并提供了对改进算法的方便访问。https://github.com/Fabianexe/Superbubble。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Superbubbles revisited.

Superbubbles revisited.

Superbubbles revisited.

Superbubbles revisited.

Background: Superbubbles are distinctive subgraphs in direct graphs that play an important role in assembly algorithms for high-throughput sequencing (HTS) data. Their practical importance derives from the fact they are connected to their host graph by a single entrance and a single exit vertex, thus allowing them to be handled independently. Efficient algorithms for the enumeration of superbubbles are therefore of important for the processing of HTS data. Superbubbles can be identified within the strongly connected components of the input digraph after transforming them into directed acyclic graphs. The algorithm by Sung et al. (IEEE ACM Trans Comput Biol Bioinform 12:770-777, 2015) achieves this task in O ( m l o g ( m ) ) -time. The extraction of superbubbles from the transformed components was later improved to by Brankovic et al. (Theor Comput Sci 609:374-383, 2016) resulting in an overall O ( m + n ) -time algorithm.

Results: A re-analysis of the mathematical structure of superbubbles showed that the construction of auxiliary DAGs from the strongly connected components in the work of Sung et al. missed some details that can lead to the reporting of false positive superbubbles. We propose an alternative, even simpler auxiliary graph that solved the problem and retains the linear running time for general digraph. Furthermore, we describe a simpler, space-efficient O ( m + n ) -time algorithm for detecting superbubbles in DAGs that uses only simple data structures.

Implementation: We present a reference implementation of the algorithm that accepts many commonly used formats for the input graph and provides convenient access to the improved algorithm. https://github.com/Fabianexe/Superbubble.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Algorithms for Molecular Biology
Algorithms for Molecular Biology 生物-生化研究方法
CiteScore
2.40
自引率
10.00%
发文量
16
审稿时长
>12 weeks
期刊介绍: Algorithms for Molecular Biology publishes articles on novel algorithms for biological sequence and structure analysis, phylogeny reconstruction, and combinatorial algorithms and machine learning. Areas of interest include but are not limited to: algorithms for RNA and protein structure analysis, gene prediction and genome analysis, comparative sequence analysis and alignment, phylogeny, gene expression, machine learning, and combinatorial algorithms. Where appropriate, manuscripts should describe applications to real-world data. However, pure algorithm papers are also welcome if future applications to biological data are to be expected, or if they address complexity or approximation issues of novel computational problems in molecular biology. Articles about novel software tools will be considered for publication if they contain some algorithmically interesting aspects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信