{"title":"稳定地编码具有叶位对开的系统发育树。","authors":"Mark M. Tanaka , Ruiting Lan , Andrew R. Francis","doi":"10.1016/j.jtbi.2025.112265","DOIUrl":null,"url":null,"abstract":"<div><div>As genome sequencing data continue to expand, a persistent research challenge is to accommodate the growth of a phylogeny. This situation arises in molecular epidemiology, for example, where new taxonomic groups can appear in real time as pathogen isolates are sequenced. Efficient computational methods have been developed to place new leaves in existing trees, which removes the need to reconstruct trees from scratch. But for these tree extensions to be fully integrated with classification schemes requires a stable encoding of trees that keeps existing tree structures intact as new branches appear. Here, we propose a tree encoding, which we call a <em>folio</em>, that records the path from a reference vertex to each leaf, giving each leaf an <em>address</em>. We present a simple set of rules to assign new addresses to added leaves. The encoding is stable in the sense that it does not change as further leaf addresses are added to the folio. The tree can be uniquely recovered from a folio of addresses. We illustrate the methods using <em>Salmonella</em> genome data. Due to the properties of our encoding framework, we anticipate that it can be used for a range of different phylogenetic analyses.</div></div>","PeriodicalId":54763,"journal":{"name":"Journal of Theoretical Biology","volume":"616 ","pages":"Article 112265"},"PeriodicalIF":2.0000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stably encoding phylogenetic trees with folios of leaf addresses\",\"authors\":\"Mark M. Tanaka , Ruiting Lan , Andrew R. Francis\",\"doi\":\"10.1016/j.jtbi.2025.112265\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As genome sequencing data continue to expand, a persistent research challenge is to accommodate the growth of a phylogeny. This situation arises in molecular epidemiology, for example, where new taxonomic groups can appear in real time as pathogen isolates are sequenced. Efficient computational methods have been developed to place new leaves in existing trees, which removes the need to reconstruct trees from scratch. But for these tree extensions to be fully integrated with classification schemes requires a stable encoding of trees that keeps existing tree structures intact as new branches appear. Here, we propose a tree encoding, which we call a <em>folio</em>, that records the path from a reference vertex to each leaf, giving each leaf an <em>address</em>. We present a simple set of rules to assign new addresses to added leaves. The encoding is stable in the sense that it does not change as further leaf addresses are added to the folio. The tree can be uniquely recovered from a folio of addresses. We illustrate the methods using <em>Salmonella</em> genome data. Due to the properties of our encoding framework, we anticipate that it can be used for a range of different phylogenetic analyses.</div></div>\",\"PeriodicalId\":54763,\"journal\":{\"name\":\"Journal of Theoretical Biology\",\"volume\":\"616 \",\"pages\":\"Article 112265\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Theoretical Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022519325002310\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Theoretical Biology","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022519325002310","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
Stably encoding phylogenetic trees with folios of leaf addresses
As genome sequencing data continue to expand, a persistent research challenge is to accommodate the growth of a phylogeny. This situation arises in molecular epidemiology, for example, where new taxonomic groups can appear in real time as pathogen isolates are sequenced. Efficient computational methods have been developed to place new leaves in existing trees, which removes the need to reconstruct trees from scratch. But for these tree extensions to be fully integrated with classification schemes requires a stable encoding of trees that keeps existing tree structures intact as new branches appear. Here, we propose a tree encoding, which we call a folio, that records the path from a reference vertex to each leaf, giving each leaf an address. We present a simple set of rules to assign new addresses to added leaves. The encoding is stable in the sense that it does not change as further leaf addresses are added to the folio. The tree can be uniquely recovered from a folio of addresses. We illustrate the methods using Salmonella genome data. Due to the properties of our encoding framework, we anticipate that it can be used for a range of different phylogenetic analyses.
期刊介绍:
The Journal of Theoretical Biology is the leading forum for theoretical perspectives that give insight into biological processes. It covers a very wide range of topics and is of interest to biologists in many areas of research, including:
• Brain and Neuroscience
• Cancer Growth and Treatment
• Cell Biology
• Developmental Biology
• Ecology
• Evolution
• Immunology,
• Infectious and non-infectious Diseases,
• Mathematical, Computational, Biophysical and Statistical Modeling
• Microbiology, Molecular Biology, and Biochemistry
• Networks and Complex Systems
• Physiology
• Pharmacodynamics
• Animal Behavior and Game Theory
Acceptable papers are those that bear significant importance on the biology per se being presented, and not on the mathematical analysis. Papers that include some data or experimental material bearing on theory will be considered, including those that contain comparative study, statistical data analysis, mathematical proof, computer simulations, experiments, field observations, or even philosophical arguments, which are all methods to support or reject theoretical ideas. However, there should be a concerted effort to make papers intelligible to biologists in the chosen field.