{"title":"使用 Cox 模型的稳健变量选择方法--一项选择性实用基准研究。","authors":"Yunwei Zhang, Samuel Muller","doi":"10.1093/bib/bbae508","DOIUrl":null,"url":null,"abstract":"<p><p>With the advancement of biological and medical techniques, we can now obtain large amounts of high-dimensional omics data with censored survival information. This presents challenges in method development across various domains, particularly in variable selection. Given the inherently skewed distribution of the survival time outcome variable, robust variable selection methods offer potential solutions. Recently, there has been a focus on extending robust variable selection methods from linear regression models to survival models. However, despite these developments, robust methods are currently rarely used in practical applications, possibly due to a limited appreciation of their overall good performance. To address this gap, we conduct a selective review comparing the variable selection performance of twelve robust and non-robust penalised Cox models. Our study reveals the intricate relationship among covariates, survival outcomes, and modeling approaches, demonstrating how subtle variations can significantly impact the performance of methods considered. Based on our empirical research, we recommend the use of robust Cox models for variable selection in practice based on their superior performance in presence of outliers while maintaining good efficiency and accuracy when there are no outliers. This study provides valuable insights for method development and application, contributing to a better understanding of the relationship between correlated covariates and censored outcomes.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472364/pdf/","citationCount":"0","resultStr":"{\"title\":\"Robust variable selection methods with Cox model-a selective practical benchmark study.\",\"authors\":\"Yunwei Zhang, Samuel Muller\",\"doi\":\"10.1093/bib/bbae508\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>With the advancement of biological and medical techniques, we can now obtain large amounts of high-dimensional omics data with censored survival information. This presents challenges in method development across various domains, particularly in variable selection. Given the inherently skewed distribution of the survival time outcome variable, robust variable selection methods offer potential solutions. Recently, there has been a focus on extending robust variable selection methods from linear regression models to survival models. However, despite these developments, robust methods are currently rarely used in practical applications, possibly due to a limited appreciation of their overall good performance. To address this gap, we conduct a selective review comparing the variable selection performance of twelve robust and non-robust penalised Cox models. Our study reveals the intricate relationship among covariates, survival outcomes, and modeling approaches, demonstrating how subtle variations can significantly impact the performance of methods considered. Based on our empirical research, we recommend the use of robust Cox models for variable selection in practice based on their superior performance in presence of outliers while maintaining good efficiency and accuracy when there are no outliers. This study provides valuable insights for method development and application, contributing to a better understanding of the relationship between correlated covariates and censored outcomes.</p>\",\"PeriodicalId\":9209,\"journal\":{\"name\":\"Briefings in bioinformatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2024-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472364/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Briefings in bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/bib/bbae508\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbae508","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
With the advancement of biological and medical techniques, we can now obtain large amounts of high-dimensional omics data with censored survival information. This presents challenges in method development across various domains, particularly in variable selection. Given the inherently skewed distribution of the survival time outcome variable, robust variable selection methods offer potential solutions. Recently, there has been a focus on extending robust variable selection methods from linear regression models to survival models. However, despite these developments, robust methods are currently rarely used in practical applications, possibly due to a limited appreciation of their overall good performance. To address this gap, we conduct a selective review comparing the variable selection performance of twelve robust and non-robust penalised Cox models. Our study reveals the intricate relationship among covariates, survival outcomes, and modeling approaches, demonstrating how subtle variations can significantly impact the performance of methods considered. Based on our empirical research, we recommend the use of robust Cox models for variable selection in practice based on their superior performance in presence of outliers while maintaining good efficiency and accuracy when there are no outliers. This study provides valuable insights for method development and application, contributing to a better understanding of the relationship between correlated covariates and censored outcomes.
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.