化石化的出生-死亡模型是可识别的。

IF 5.7 1区生物学 Q1 EVOLUTIONARY BIOLOGY

Systematic Biology Pub Date : 2025-02-10 DOI:10.1093/sysbio/syae058

Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina

{"title":"化石化的出生-死亡模型是可识别的。","authors":"Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina","doi":"10.1093/sysbio/syae058","DOIUrl":null,"url":null,"abstract":"Time-dependent birth-death sampling models have been used in numerous studies to infer past evolutionary dynamics in different biological contexts, for example, speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore, the rates become unidentifiable from the trees regardless of their size. Here, we show that widely used time-dependent fossilized birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilization process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. The identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent FBD model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages, we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"112-123"},"PeriodicalIF":5.7000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11997801/pdf/","citationCount":"0","resultStr":"{\"title\":\"The Fossilized Birth-Death Model Is Identifiable.\",\"authors\":\"Kate Truman, Timothy G Vaughan, Alex Gavryushkin, Alexandra Sasha Gavryushkina\",\"doi\":\"10.1093/sysbio/syae058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Time-dependent birth-death sampling models have been used in numerous studies to infer past evolutionary dynamics in different biological contexts, for example, speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore, the rates become unidentifiable from the trees regardless of their size. Here, we show that widely used time-dependent fossilized birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilization process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. The identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent FBD model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages, we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.\",\"PeriodicalId\":22120,\"journal\":{\"name\":\"Systematic Biology\",\"volume\":\" \",\"pages\":\"112-123\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-02-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11997801/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Systematic Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/sysbio/syae058\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EVOLUTIONARY BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systematic Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/sysbio/syae058","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

依赖时间的出生-死亡采样模型已被大量研究用于推断不同生物背景下过去的进化动态，例如宏观进化研究中的物种分化和灭绝率，或流行病学研究中的有效繁殖数量。这些模型是一个分支过程，在这个过程中，世系可以分叉、死亡，或以随时间变化的出生率、死亡率和采样率进行采样，从而生成系统进化树。有研究表明，在这类模型的某些子类中，不同的速率集会导致重建的系统发生树分布相同，因此，无论系统发生树的大小如何，速率都无法从系统发生树中识别出来。在这里，我们证明了广泛使用的时间依赖化石出生-死亡（FBD）模型是可识别的。与无法识别的出生-死亡抽样模型相比，这一子类模型对化石化过程和某些传染病的传播过程做出了更现实的假设。也就是说，FBD 模型假定采样的世系会留在化石过程中，而不是在采样后立即消失。时间依赖性 FBD 模型的可识别性使我们有理由使用实现该模型的统计方法，从系统发生树或直接从分子或其他比较数据中推断潜在的时间多样化或流行病学动态。我们进一步证明，具有额外参数（取样后移除概率）的时间依赖性化石出生-死亡模型是不可识别的。这意味着，在我们不知道采样如何影响世系的情况下，我们无法仅从树中推断出这个额外参数以及出生率、死亡率和采样率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

The Fossilized Birth-Death Model Is Identifiable.

查看原文本刊更多论文

The Fossilized Birth-Death Model Is Identifiable.

Time-dependent birth-death sampling models have been used in numerous studies to infer past evolutionary dynamics in different biological contexts, for example, speciation and extinction rates in macroevolutionary studies, or effective reproductive number in epidemiological studies. These models are branching processes where lineages can bifurcate, die, or be sampled with time-dependent birth, death, and sampling rates, generating phylogenetic trees. It has been shown that in some subclasses of such models, different sets of rates can result in the same distributions of reconstructed phylogenetic trees, and therefore, the rates become unidentifiable from the trees regardless of their size. Here, we show that widely used time-dependent fossilized birth-death (FBD) models are identifiable. This subclass of models makes more realistic assumptions about the fossilization process and certain infectious disease transmission processes than the unidentifiable birth-death sampling models. Namely, FBD models assume that sampled lineages stay in the process rather than being immediately removed upon sampling. The identifiability of the time-dependent FBD model justifies using statistical methods that implement this model to infer the underlying temporal diversification or epidemiological dynamics from phylogenetic trees or directly from molecular or other comparative data. We further show that the time-dependent FBD model with an extra parameter, the removal after sampling probability, is unidentifiable. This implies that in scenarios where we do not know how sampling affects lineages, we are unable to infer this extra parameter together with birth, death, and sampling rates solely from trees.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Systematic Biology 生物-进化生物学

CiteScore

13.00

自引率

7.70%

发文量

审稿时长

6-12 weeks

期刊介绍： Systematic Biology is the bimonthly journal of the Society of Systematic Biologists. Papers for the journal are original contributions to the theory, principles, and methods of systematics as well as phylogeny, evolution, morphology, biogeography, paleontology, genetics, and the classification of all living things. A Points of View section offers a forum for discussion, while book reviews and announcements of general interest are also featured.