{"title":"nT4X和nT4M:新的时间不可逆混合物氨基酸取代模型。","authors":"Nguyen Huy Tinh, Cuong Cao Dang, Le Sy Vinh","doi":"10.1007/s00239-024-10230-8","DOIUrl":null,"url":null,"abstract":"<p><p>One of the most important and difficult challenges in the research of molecular evolution is modeling the process of amino acid substitutions. Although single-matrix models, such as the LG model, are popular, their capability to properly capture the heterogeneity of the substitution process across sites is still questioned. Several mixture models with multiple matrices have been introduced and shown to offer advantages over single-matrix models. Current general mixture models assume the reversibility of the evolutionary process, implying that substitution rates between any two amino acids are equal in both forward and backward directions. This assumption is not based on biological properties but rather on computational simplicity. The well-known hypothesis is that more realistic models can yield more accurate evolutionary inferences; therefore, our aim is to estimate more biologically realistic models. To this end, we relax the assumption of reversibility and introduce two new general non-reversible 4-matrix mixture models, called nT4M and nT4X. Using alignments from HSSP and TreeBASE databases as data, our newly estimated models outperformed all single-matrix models and almost all reversible mixture models. Moreover, the new non-reversible mixture models enable us to infer rooted trees.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"nT4X and nT4M: Novel Time Non-reversible Mixture Amino Acid Substitution Models.\",\"authors\":\"Nguyen Huy Tinh, Cuong Cao Dang, Le Sy Vinh\",\"doi\":\"10.1007/s00239-024-10230-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>One of the most important and difficult challenges in the research of molecular evolution is modeling the process of amino acid substitutions. Although single-matrix models, such as the LG model, are popular, their capability to properly capture the heterogeneity of the substitution process across sites is still questioned. Several mixture models with multiple matrices have been introduced and shown to offer advantages over single-matrix models. Current general mixture models assume the reversibility of the evolutionary process, implying that substitution rates between any two amino acids are equal in both forward and backward directions. This assumption is not based on biological properties but rather on computational simplicity. The well-known hypothesis is that more realistic models can yield more accurate evolutionary inferences; therefore, our aim is to estimate more biologically realistic models. To this end, we relax the assumption of reversibility and introduce two new general non-reversible 4-matrix mixture models, called nT4M and nT4X. Using alignments from HSSP and TreeBASE databases as data, our newly estimated models outperformed all single-matrix models and almost all reversible mixture models. Moreover, the new non-reversible mixture models enable us to infer rooted trees.</p>\",\"PeriodicalId\":16366,\"journal\":{\"name\":\"Journal of Molecular Evolution\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-01-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Molecular Evolution\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s00239-024-10230-8\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s00239-024-10230-8","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
nT4X and nT4M: Novel Time Non-reversible Mixture Amino Acid Substitution Models.
One of the most important and difficult challenges in the research of molecular evolution is modeling the process of amino acid substitutions. Although single-matrix models, such as the LG model, are popular, their capability to properly capture the heterogeneity of the substitution process across sites is still questioned. Several mixture models with multiple matrices have been introduced and shown to offer advantages over single-matrix models. Current general mixture models assume the reversibility of the evolutionary process, implying that substitution rates between any two amino acids are equal in both forward and backward directions. This assumption is not based on biological properties but rather on computational simplicity. The well-known hypothesis is that more realistic models can yield more accurate evolutionary inferences; therefore, our aim is to estimate more biologically realistic models. To this end, we relax the assumption of reversibility and introduce two new general non-reversible 4-matrix mixture models, called nT4M and nT4X. Using alignments from HSSP and TreeBASE databases as data, our newly estimated models outperformed all single-matrix models and almost all reversible mixture models. Moreover, the new non-reversible mixture models enable us to infer rooted trees.
期刊介绍:
Journal of Molecular Evolution covers experimental, computational, and theoretical work aimed at deciphering features of molecular evolution and the processes bearing on these features, from the initial formation of macromolecular systems through their evolution at the molecular level, the co-evolution of their functions in cellular and organismal systems, and their influence on organismal adaptation, speciation, and ecology. Topics addressed include the evolution of informational macromolecules and their relation to more complex levels of biological organization, including populations and taxa, as well as the molecular basis for the evolution of ecological interactions of species and the use of molecular data to infer fundamental processes in evolutionary ecology. This coverage accommodates such subfields as new genome sequences, comparative structural and functional genomics, population genetics, the molecular evolution of development, the evolution of gene regulation and gene interaction networks, and in vitro evolution of DNA and RNA, molecular evolutionary ecology, and the development of methods and theory that enable molecular evolutionary inference, including but not limited to, phylogenetic methods.