{"title":"On Pattern-Cladistic Analyses Based on Complete Plastid Genome Sequences","authors":"Evgeny V. Mavrodiev, Alexander Madorsky","doi":"10.1007/s10441-023-09475-5","DOIUrl":null,"url":null,"abstract":"<div><p>The fundamental Hennigian principle, grouping <i>solely</i> on synapomorphy, is seldom used in modern phylogenetics. In the submitted paper, we apply this principle in reanalyzing five datasets comprising 197 complete plastid genomes (plastomes). We focused on the latter because plastome-based DNA sequence data gained dramatic popularity in molecular systematics during the last decade. We show that pattern-cladistic analyses based on complete plastid genome sequences can successfully resolve affinities between plant taxa, simultaneously simplifying both the genomic and analytical frameworks of phylogenetic studies. We developed “Matrix to Newick” (M2N), a program to represent the standard molecular alignment of plastid genomes in the form of trees or <i>relationships</i> directly. Thus, massive plastome-based DNA sequence data can be successfully represented in a <i>relational</i> form rather than as a standard molecular alignment. Application of methods of median supertree construction (the Average Consensus method has been used as an example in this study) or Maximum Parsimony analysis to relational representations of plastome sequence data may help systematist to avoid the complicated assumption-based frameworks of Maximum Likelihood or Bayesian phylogenetics that are most used today in massive plastid sequence data analyses. We also found that significant amounts of pure genomic information that typically accommodate the majority of current plastid phylogenomic studies can be effectively dropped by systematists if they focus on the pattern-cladistics or relational analyses of plastome-based molecular data. The proposed pattern-cladistic approach is a powerful and straightforward heuristic alternative to modern plastome-based phylogenetics. </p></div>","PeriodicalId":7057,"journal":{"name":"Acta Biotheoretica","volume":"71 4","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Biotheoretica","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10441-023-09475-5","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The fundamental Hennigian principle, grouping solely on synapomorphy, is seldom used in modern phylogenetics. In the submitted paper, we apply this principle in reanalyzing five datasets comprising 197 complete plastid genomes (plastomes). We focused on the latter because plastome-based DNA sequence data gained dramatic popularity in molecular systematics during the last decade. We show that pattern-cladistic analyses based on complete plastid genome sequences can successfully resolve affinities between plant taxa, simultaneously simplifying both the genomic and analytical frameworks of phylogenetic studies. We developed “Matrix to Newick” (M2N), a program to represent the standard molecular alignment of plastid genomes in the form of trees or relationships directly. Thus, massive plastome-based DNA sequence data can be successfully represented in a relational form rather than as a standard molecular alignment. Application of methods of median supertree construction (the Average Consensus method has been used as an example in this study) or Maximum Parsimony analysis to relational representations of plastome sequence data may help systematist to avoid the complicated assumption-based frameworks of Maximum Likelihood or Bayesian phylogenetics that are most used today in massive plastid sequence data analyses. We also found that significant amounts of pure genomic information that typically accommodate the majority of current plastid phylogenomic studies can be effectively dropped by systematists if they focus on the pattern-cladistics or relational analyses of plastome-based molecular data. The proposed pattern-cladistic approach is a powerful and straightforward heuristic alternative to modern plastome-based phylogenetics.
Hennigiean的基本原理,仅根据突触形态进行分组,在现代系统发育学中很少使用。在提交的论文中,我们将这一原理应用于重新分析包括197个完整质体基因组(质体)的五个数据集。我们之所以关注后者,是因为在过去十年中,基于质体的DNA序列数据在分子系统学中得到了极大的普及。我们表明,基于完整质体基因组序列的模式分支分析可以成功地解决植物分类群之间的亲缘关系,同时简化系统发育研究的基因组和分析框架。我们开发了“Matrix to Newick”(M2N),这是一个以树或关系的形式直接表示质体基因组标准分子比对的程序。因此,基于质体的大量DNA序列数据可以成功地以关系形式表示,而不是作为标准分子比对。将中值超树构建方法(本研究以平均一致性方法为例)或最大分析法应用于质体序列数据的关系表示,可以帮助系统学家避免目前在大量质体序列数据中最常用的基于最大似然或贝叶斯系统发育学的复杂假设框架分析。我们还发现,如果系统学家专注于基于质体的分子数据的模式分支学或关系分析,那么他们可以有效地丢弃大量的纯基因组信息,这些信息通常可以容纳当前大多数质体系统发育学研究。所提出的模式分支方法是现代基于质体的系统发育学的一种强大而直接的启发式替代方法。
期刊介绍:
Acta Biotheoretica is devoted to the promotion of theoretical biology, encompassing mathematical biology and the philosophy of biology, paying special attention to the methodology of formation of biological theory.
Papers on all kind of biological theories are welcome. Interesting subjects include philosophy of biology, biomathematics, computational biology, genetics, ecology and morphology. The process of theory formation can be presented in verbal or mathematical form. Moreover, purely methodological papers can be devoted to the historical origins of the philosophy underlying biological theories and concepts.
Papers should contain clear statements of biological assumptions, and where applicable, a justification of their translation into mathematical form and a detailed discussion of the mathematical treatment. The connection to empirical data should be clarified.
Acta Biotheoretica also welcomes critical book reviews, short comments on previous papers and short notes directing attention to interesting new theoretical ideas.