SAMPLE SIZE ESTIMATION FOR CANCER PROGRESSION MODELS

C. Netzer, J. Rahnenführer
{"title":"SAMPLE SIZE ESTIMATION FOR CANCER PROGRESSION MODELS","authors":"C. Netzer, J. Rahnenführer","doi":"10.2316/J.2012.210-1029","DOIUrl":null,"url":null,"abstract":"Human tumours are often associated with the accumulation of chromosomal alterations in the cancer cells. The identification of characteristic pathogenic routes improves prediction of survival times and optimal therapy choice. The simplest model assumes independent alterations. Then progression is measured by the count statistic, the total number of alterations. An advanced model is the oncogenetic trees mixture model. An oncogenetic tree allows both independent and sequential relationships between alterations, and the mixture model divides the patients into groups with different progression paths. Progression along such a model can be quantified univariately by the GPS (genetic progression score). On real cancer data, the GPS was shown to discriminate better than the count statistic between patient subgroups with different survival prognosis. Here, in a simulation study, we evaluate the necessary numbers of patients for detecting true relationships between genetic progression and survival time. We generate survival times correlated with count statistic and GPS, respectively. If the simple model is the correct one, misspecification with the advanced model requires about 20% larger sample size, independent from the number of events. In contrast, misspecification with the simple model leads with increasing numbers of events from 20% to 70% larger sample size. Additionally, if the true data-generating model is the mixture model, the absolute numbers are more than twice as large, thus favouring the advanced modelling approach especially in situations with limited model knowledge.","PeriodicalId":330541,"journal":{"name":"International Journal of Computational Bioscience","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Bioscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2316/J.2012.210-1029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Human tumours are often associated with the accumulation of chromosomal alterations in the cancer cells. The identification of characteristic pathogenic routes improves prediction of survival times and optimal therapy choice. The simplest model assumes independent alterations. Then progression is measured by the count statistic, the total number of alterations. An advanced model is the oncogenetic trees mixture model. An oncogenetic tree allows both independent and sequential relationships between alterations, and the mixture model divides the patients into groups with different progression paths. Progression along such a model can be quantified univariately by the GPS (genetic progression score). On real cancer data, the GPS was shown to discriminate better than the count statistic between patient subgroups with different survival prognosis. Here, in a simulation study, we evaluate the necessary numbers of patients for detecting true relationships between genetic progression and survival time. We generate survival times correlated with count statistic and GPS, respectively. If the simple model is the correct one, misspecification with the advanced model requires about 20% larger sample size, independent from the number of events. In contrast, misspecification with the simple model leads with increasing numbers of events from 20% to 70% larger sample size. Additionally, if the true data-generating model is the mixture model, the absolute numbers are more than twice as large, thus favouring the advanced modelling approach especially in situations with limited model knowledge.
癌症进展模型的样本量估计
人类肿瘤通常与癌细胞中染色体改变的积累有关。特异性致病途径的识别提高了生存时间的预测和最佳治疗选择。最简单的模型假定独立的变化。然后通过计数统计来衡量进展,即改变的总数。一种先进的模型是肿瘤发生树混合模型。癌基因树允许改变之间的独立和顺序关系,混合模型将患者分为具有不同进展路径的组。沿着这样一个模型的进展可以通过GPS(遗传进展评分)单一地量化。在真实的癌症数据中,GPS在具有不同生存预后的患者亚组之间的区别优于计数统计。在这里,在一项模拟研究中,我们评估了检测遗传进展和生存时间之间真正关系所需的患者数量。我们分别生成与计数统计和GPS相关的生存时间。如果简单模型是正确的,则与高级模型的错配需要的样本量增加20%左右,与事件的数量无关。相反,简单模型的错误规范导致事件数量增加,样本量从20%增加到70%。此外,如果真正的数据生成模型是混合模型,则绝对数量会增加一倍以上,从而有利于高级建模方法,特别是在模型知识有限的情况下。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信