Algorithms for Molecular Biology最新文献

筛选
英文 中文
Parsimonious Clone Tree Integration in cancer 癌症中的简约克隆树整合
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2022-03-14 DOI: 10.1186/s13015-022-00209-9
P. Sashittal, Simone Zaccaria, M. El-Kebir
{"title":"Parsimonious Clone Tree Integration in cancer","authors":"P. Sashittal, Simone Zaccaria, M. El-Kebir","doi":"10.1186/s13015-022-00209-9","DOIUrl":"https://doi.org/10.1186/s13015-022-00209-9","url":null,"abstract":"","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"18 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86681252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Efficiently sparse listing of classes of optimal cophylogeny reconciliations. 最优亲缘关系协调类的高效稀疏列表。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2022-02-15 DOI: 10.1186/s13015-022-00206-y
Yishu Wang, Arnaud Mary, Marie-France Sagot, Blerina Sinaimeri
{"title":"Efficiently sparse listing of classes of optimal cophylogeny reconciliations.","authors":"Yishu Wang,&nbsp;Arnaud Mary,&nbsp;Marie-France Sagot,&nbsp;Blerina Sinaimeri","doi":"10.1186/s13015-022-00206-y","DOIUrl":"https://doi.org/10.1186/s13015-022-00206-y","url":null,"abstract":"<p><strong>Background: </strong>Cophylogeny reconciliation is a powerful method for analyzing host-parasite (or host-symbiont) co-evolution. It models co-evolution as an optimization problem where the set of all optimal solutions may represent different biological scenarios which thus need to be analyzed separately. Despite the significant research done in the area, few approaches have addressed the problem of helping the biologist deal with the often huge space of optimal solutions.</p><p><strong>Results: </strong>In this paper, we propose a new approach to tackle this problem. We introduce three different criteria under which two solutions may be considered biologically equivalent, and then we propose polynomial-delay algorithms that enumerate only one representative per equivalence class (without listing all the solutions).</p><p><strong>Conclusions: </strong>Our results are of both theoretical and practical importance. Indeed, as shown by the experiments, we are able to significantly reduce the space of optimal solutions while still maintaining important biological information about the whole space.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":" ","pages":"2"},"PeriodicalIF":1.0,"publicationDate":"2022-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8845303/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39788408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new 1.375-approximation algorithm for sorting by transpositions. 一种新的1.375-近似算法用于换位排序。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2022-01-15 DOI: 10.1186/s13015-022-00205-z
Luiz Augusto G Silva, Luis Antonio B Kowada, Noraí Romeu Rocco, Maria Emília M T Walter
{"title":"A new 1.375-approximation algorithm for sorting by transpositions.","authors":"Luiz Augusto G Silva,&nbsp;Luis Antonio B Kowada,&nbsp;Noraí Romeu Rocco,&nbsp;Maria Emília M T Walter","doi":"10.1186/s13015-022-00205-z","DOIUrl":"https://doi.org/10.1186/s13015-022-00205-z","url":null,"abstract":"<p><strong>Background: </strong>SORTING BY TRANSPOSITIONS (SBT) is a classical problem in genome rearrangements. In 2012, SBT was proven to be [Formula: see text]-hard and the best approximation algorithm with a 1.375 ratio was proposed in 2006 by Elias and Hartman (EH algorithm). Their algorithm employs simplification, a technique used to transform an input permutation [Formula: see text] into a simple permutation [Formula: see text], presumably easier to handle with. The permutation [Formula: see text] is obtained by inserting new symbols into [Formula: see text] in a way that the lower bound of the transposition distance of [Formula: see text] is kept on [Formula: see text]. The simplification is guaranteed to keep the lower bound, not the transposition distance. A sequence of operations sorting [Formula: see text] can be mimicked to sort [Formula: see text].</p><p><strong>Results and conclusions: </strong>First, using an algebraic approach, we propose a new upper bound for the transposition distance, which holds for all [Formula: see text]. Next, motivated by a problem identified in the EH algorithm, which causes it, in scenarios involving how the input permutation is simplified, to require one extra transposition above the 1.375-approximation ratio, we propose a new approximation algorithm to solve SBT ensuring the 1.375-approximation ratio for all [Formula: see text]. We implemented our algorithm and EH's. Regarding the implementation of the EH algorithm, two other issues were identified and needed to be fixed. We tested both algorithms against all permutations of size n, [Formula: see text]. The results show that the EH algorithm exceeds the approximation ratio of 1.375 for permutations with a size greater than 7. The percentage of computed distances that are equal to transposition distance, computed by the implemented algorithms are also compared with others available in the literature. Finally, we investigate the performance of both implementations on longer permutations of maximum length 500. From the experiments, we conclude that maximum and the average distances computed by our algorithm are a little better than the ones computed by the EH algorithm and the running times of both algorithms are similar, despite the time complexity of our algorithm being higher.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":" ","pages":"1"},"PeriodicalIF":1.0,"publicationDate":"2022-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760837/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39913478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An improved approximation algorithm for the reversal and transposition distance considering gene order and intergenic sizes. 一种考虑基因顺序和基因间大小的反转和转位距离的改进近似算法。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-12-29 DOI: 10.1186/s13015-021-00203-7
Klairton L Brito, Andre R Oliveira, Alexsandro O Alexandrino, Ulisses Dias, Zanoni Dias
{"title":"An improved approximation algorithm for the reversal and transposition distance considering gene order and intergenic sizes.","authors":"Klairton L Brito,&nbsp;Andre R Oliveira,&nbsp;Alexsandro O Alexandrino,&nbsp;Ulisses Dias,&nbsp;Zanoni Dias","doi":"10.1186/s13015-021-00203-7","DOIUrl":"https://doi.org/10.1186/s13015-021-00203-7","url":null,"abstract":"<p><strong>Background: </strong>In the comparative genomics field, one of the goals is to estimate a sequence of genetic changes capable of transforming a genome into another. Genome rearrangement events are mutations that can alter the genetic content or the arrangement of elements from the genome. Reversal and transposition are two of the most studied genome rearrangement events. A reversal inverts a segment of a genome while a transposition swaps two consecutive segments. Initial studies in the area considered only the order of the genes. Recent works have incorporated other genetic information in the model. In particular, the information regarding the size of intergenic regions, which are structures between each pair of genes and in the extremities of a linear genome.</p><p><strong>Results and conclusions: </strong>In this work, we investigate the SORTING BY INTERGENIC REVERSALS AND TRANSPOSITIONS problem on genomes sharing the same set of genes, considering the cases where the orientation of genes is known and unknown. Besides, we explored a variant of the problem, which generalizes the transposition event. As a result, we present an approximation algorithm that guarantees an approximation factor of 4 for both cases considering the reversal and transposition (classic definition) events, an improvement from the 4.5-approximation previously known for the scenario where the orientation of the genes is unknown. We also present a 3-approximation algorithm by incorporating the generalized transposition event, and we propose a greedy strategy to improve the performance of the algorithms. We performed practical tests adopting simulated data which indicated that the algorithms, in both cases, tend to perform better when compared with the best-known algorithms for the problem. Lastly, we conducted experiments using real genomes to demonstrate the applicability of the algorithms.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"24"},"PeriodicalIF":1.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8717661/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39773174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Approximation algorithm for rearrangement distances considering repeated genes and intergenic regions. 考虑重复基因和基因间区域的重排距离逼近算法。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-10-13 DOI: 10.1186/s13015-021-00200-w
Gabriel Siqueira, Alexsandro Oliveira Alexandrino, Andre Rodrigues Oliveira, Zanoni Dias
{"title":"Approximation algorithm for rearrangement distances considering repeated genes and intergenic regions.","authors":"Gabriel Siqueira,&nbsp;Alexsandro Oliveira Alexandrino,&nbsp;Andre Rodrigues Oliveira,&nbsp;Zanoni Dias","doi":"10.1186/s13015-021-00200-w","DOIUrl":"https://doi.org/10.1186/s13015-021-00200-w","url":null,"abstract":"<p><p>The rearrangement distance is a method to compare genomes of different species. Such distance is the number of rearrangement events necessary to transform one genome into another. Two commonly studied events are the transposition, which exchanges two consecutive blocks of the genome, and the reversal, which reverts a block of the genome. When dealing with such problems, seminal works represented genomes as sequences of genes without repetition. More realistic models started to consider gene repetition or the presence of intergenic regions, sequences of nucleotides between genes and in the extremities of the genome. This work explores the transposition and reversal events applied in a genome representation considering both gene repetition and intergenic regions. We define two problems called Minimum Common Intergenic String Partition and Reverse Minimum Common Intergenic String Partition. Using a relation with these two problems, we show a [Formula: see text]-approximation for the Intergenic Transposition Distance, the Intergenic Reversal Distance, and the Intergenic Reversal and Transposition Distance problems, where k is the maximum number of copies of a gene in the genomes. Our practical experiments on simulated genomes show that the use of partitions improves the estimates for the distances.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"21"},"PeriodicalIF":1.0,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8513232/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39539880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Heuristic algorithms for best match graph editing. 最佳匹配图编辑的启发式算法。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-08-17 DOI: 10.1186/s13015-021-00196-3
David Schaller, Manuela Geiß, Marc Hellmuth, Peter F Stadler
{"title":"Heuristic algorithms for best match graph editing.","authors":"David Schaller,&nbsp;Manuela Geiß,&nbsp;Marc Hellmuth,&nbsp;Peter F Stadler","doi":"10.1186/s13015-021-00196-3","DOIUrl":"https://doi.org/10.1186/s13015-021-00196-3","url":null,"abstract":"<p><strong>Background: </strong>Best match graphs (BMGs) are a class of colored digraphs that naturally appear in mathematical phylogenetics as a representation of the pairwise most closely related genes among multiple species. An arc connects a gene x with a gene y from another species (vertex color) Y whenever it is one of the phylogenetically closest relatives of x. BMGs can be approximated with the help of similarity measures between gene sequences, albeit not without errors. Empirical estimates thus will usually violate the theoretical properties of BMGs. The corresponding graph editing problem can be used to guide error correction for best match data. Since the arc set modification problems for BMGs are NP-complete, efficient heuristics are needed if BMGs are to be used for the practical analysis of biological sequence data.</p><p><strong>Results: </strong>Since BMGs have a characterization in terms of consistency of a certain set of rooted triples (binary trees on three vertices) defined on the set of genes, we consider heuristics that operate on triple sets. As an alternative, we show that there is a close connection to a set partitioning problem that leads to a class of top-down recursive algorithms that are similar to Aho's supertree algorithm and give rise to BMG editing algorithms that are consistent in the sense that they leave BMGs invariant. Extensive benchmarking shows that community detection algorithms for the partitioning steps perform best for BMG editing.</p><p><strong>Conclusion: </strong>Noisy BMG data can be corrected with sufficient accuracy and efficiency to make BMGs an attractive alternative to classical phylogenetic methods.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"19"},"PeriodicalIF":1.0,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8369769/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39320777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
INGOT-DR: an interpretable classifier for predicting drug resistance in M. tuberculosis. INGOT-DR:预测结核分枝杆菌耐药性的可解释分类器。
IF 1.5 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-08-10 DOI: 10.1186/s13015-021-00198-1
Hooman Zabeti, Nick Dexter, Amir Hosein Safari, Nafiseh Sedaghat, Maxwell Libbrecht, Leonid Chindelevitch
{"title":"INGOT-DR: an interpretable classifier for predicting drug resistance in M. tuberculosis.","authors":"Hooman Zabeti, Nick Dexter, Amir Hosein Safari, Nafiseh Sedaghat, Maxwell Libbrecht, Leonid Chindelevitch","doi":"10.1186/s13015-021-00198-1","DOIUrl":"10.1186/s13015-021-00198-1","url":null,"abstract":"<p><strong>Motivation: </strong>Prediction of drug resistance and identification of its mechanisms in bacteria such as Mycobacterium tuberculosis, the etiological agent of tuberculosis, is a challenging problem. Solving this problem requires a transparent, accurate, and flexible predictive model. The methods currently used for this purpose rarely satisfy all of these criteria. On the one hand, approaches based on testing strains against a catalogue of previously identified mutations often yield poor predictive performance; on the other hand, machine learning techniques typically have higher predictive accuracy, but often lack interpretability and may learn patterns that produce accurate predictions for the wrong reasons. Current interpretable methods may either exhibit a lower accuracy or lack the flexibility needed to generalize them to previously unseen data.</p><p><strong>Contribution: </strong>In this paper we propose a novel technique, inspired by group testing and Boolean compressed sensing, which yields highly accurate predictions, interpretable results, and is flexible enough to be optimized for various evaluation metrics at the same time.</p><p><strong>Results: </strong>We test the predictive accuracy of our approach on five first-line and seven second-line antibiotics used for treating tuberculosis. We find that it has a higher or comparable accuracy to that of commonly used machine learning models, and is able to identify variants in genes with previously reported association to drug resistance. Our method is intrinsically interpretable, and can be customized for different evaluation metrics. Our implementation is available at github.com/hoomanzabeti/INGOT_DR and can be installed via The Python Package Index (Pypi) under ingotdr. This package is also compatible with most of the tools in the Scikit-learn machine learning library.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"17"},"PeriodicalIF":1.5,"publicationDate":"2021-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8353837/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39298492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximate search for known gene clusters in new genomes using PQ-trees. 使用pq树对新基因组中已知基因簇进行近似搜索。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-07-09 DOI: 10.1186/s13015-021-00190-9
Galia R Zimerman, Dina Svetlitsky, Meirav Zehavi, Michal Ziv-Ukelson
{"title":"Approximate search for known gene clusters in new genomes using PQ-trees.","authors":"Galia R Zimerman,&nbsp;Dina Svetlitsky,&nbsp;Meirav Zehavi,&nbsp;Michal Ziv-Ukelson","doi":"10.1186/s13015-021-00190-9","DOIUrl":"https://doi.org/10.1186/s13015-021-00190-9","url":null,"abstract":"<p><p>Gene clusters are groups of genes that are co-locally conserved across various genomes, not necessarily in the same order. Their discovery and analysis is valuable in tasks such as gene annotation and prediction of gene interactions, and in the study of genome organization and evolution. The discovery of conserved gene clusters in a given set of genomes is a well studied problem, but with the rapid sequencing of prokaryotic genomes a new problem is inspired. Namely, given an already known gene cluster that was discovered and studied in one genomic dataset, to identify all the instances of the gene cluster in a given new genomic sequence. Thus, we define a new problem in comparative genomics, denoted PQ-TREE SEARCH that takes as input a PQ-tree T representing the known gene orders of a gene cluster of interest, a gene-to-gene substitution scoring function h, integer arguments [Formula: see text] and [Formula: see text], and a new sequence of genes S. The objective is to identify in S approximate new instances of the gene cluster; These instances could vary from the known gene orders by genome rearrangements that are constrained by T, by gene substitutions that are governed by h, and by gene deletions and insertions that are bounded from above by [Formula: see text] and [Formula: see text], respectively. We prove that PQ-TREE SEARCH is NP-hard and propose a parameterized algorithm that solves the optimization variant of PQ-TREE SEARCH in [Formula: see text] time, where [Formula: see text] is the maximum degree of a node in T and [Formula: see text] is used to hide factors polynomial in the input size. The algorithm is implemented as a search tool, denoted PQFinder, and applied to search for instances of chromosomal gene clusters in plasmids, within a dataset of 1,487 prokaryotic genomes. We report on 29 chromosomal gene clusters that are rearranged in plasmids, where the rearrangements are guided by the corresponding PQ-trees. One of these results, coding for a heavy metal efflux pump, is further analysed to exemplify how PQFinder can be harnessed to reveal interesting new structural variants of known gene clusters.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"16"},"PeriodicalIF":1.0,"publicationDate":"2021-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13015-021-00190-9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39170062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Shape decomposition algorithms for laser capture microdissection. 激光捕获显微解剖的形状分解算法。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-07-08 DOI: 10.1186/s13015-021-00193-6
Leonie Selbach, Tobias Kowalski, Klaus Gerwert, Maike Buchin, Axel Mosig
{"title":"Shape decomposition algorithms for laser capture microdissection.","authors":"Leonie Selbach,&nbsp;Tobias Kowalski,&nbsp;Klaus Gerwert,&nbsp;Maike Buchin,&nbsp;Axel Mosig","doi":"10.1186/s13015-021-00193-6","DOIUrl":"https://doi.org/10.1186/s13015-021-00193-6","url":null,"abstract":"<p><strong>Background: </strong>In the context of biomarker discovery and molecular characterization of diseases, laser capture microdissection is a highly effective approach to extract disease-specific regions from complex, heterogeneous tissue samples. For the extraction to be successful, these regions have to satisfy certain constraints in size and shape and thus have to be decomposed into feasible fragments.</p><p><strong>Results: </strong>We model this problem of constrained shape decomposition as the computation of optimal feasible decompositions of simple polygons. We use a skeleton-based approach and present an algorithmic framework that allows the implementation of various feasibility criteria as well as optimization goals. Motivated by our application, we consider different constraints and examine the resulting fragmentations. We evaluate our algorithm on lung tissue samples in comparison to a heuristic decomposition approach. Our method achieved a success rate of over 95% in the microdissection and tissue yield was increased by 10-30%.</p><p><strong>Conclusion: </strong>We present a novel approach for constrained shape decomposition by demonstrating its advantages for the application in the microdissection of tissue samples. In comparison to the previous decomposition approach, the proposed method considerably increases the amount of successfully dissected tissue.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"15"},"PeriodicalIF":1.0,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13015-021-00193-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39165163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors. 根据肿瘤单细胞DNA测序数据区分线性进化和分支进化。
IF 1 4区 生物学
Algorithms for Molecular Biology Pub Date : 2021-07-06 DOI: 10.1186/s13015-021-00194-5
Leah L Weber, Mohammed El-Kebir
{"title":"Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors.","authors":"Leah L Weber,&nbsp;Mohammed El-Kebir","doi":"10.1186/s13015-021-00194-5","DOIUrl":"https://doi.org/10.1186/s13015-021-00194-5","url":null,"abstract":"<p><strong>Background: </strong>Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, classifying a tumor's evolutionary process as either linear or branched and understanding what cancer types and which patients have each of these trajectories could provide useful insights for both clinicians and researchers. While comprehensive cancer phylogeny inference from single-cell DNA sequencing data is challenging due to limitations with current sequencing technology and the complexity of the resulting problem, current data might provide sufficient signal to accurately classify a tumor's evolutionary history as either linear or branched.</p><p><strong>Results: </strong>We introduce the Linear Perfect Phylogeny Flipping (LPPF) problem as a means of testing two alternative hypotheses for the pattern of evolution, which we prove to be NP-hard. We develop Phyolin, which uses constraint programming to solve the LPPF problem. Through both in silico experiments and real data application, we demonstrate the performance of our method, outperforming a competing machine learning approach.</p><p><strong>Conclusion: </strong>Phyolin is an accurate, easy to use and fast method for classifying an evolutionary trajectory as linear or branched given a tumor's single-cell DNA sequencing data.</p>","PeriodicalId":50823,"journal":{"name":"Algorithms for Molecular Biology","volume":"16 1","pages":"14"},"PeriodicalIF":1.0,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13015-021-00194-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39158756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信