{"title":"Machine Learning for Precision Medicine: Model Selection, Estimation, and Inference","authors":"Yi Li","doi":"10.11159/icsta22.003","DOIUrl":null,"url":null,"abstract":"- In the era of precision medicine, high-throughput data are routinely collected. These high dimensional data defy classical regression models, which are either infeasible to fit or likely to incur low predictability because of overfitting. In this talk we will introduce several cutting-edge machine learning methods, developed by my group in the last few years, for modeling (censored) outcome data with high dimensional predictors. Specifically, we will introduce a Dantzig selector for fitting survival models with high dimensional predictors, followed by various semiparametric and nonparametric feature screening methods for handling ultra-high dimensional predictors. We will also discuss statistical inference for regression models with high dimensional predictors. With high dimensional outcome data, we will introduce a new class of high dimensional Gaussian graphical regression models with predictors. The talk focuses on statistical principles and concepts behind these methods, which are motivated and illustrated by various biomedical examples, which have precision medicine contexts.","PeriodicalId":325859,"journal":{"name":"Proceedings of the 4th International Conference on Statistics: Theory and Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Statistics: Theory and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11159/icsta22.003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
- In the era of precision medicine, high-throughput data are routinely collected. These high dimensional data defy classical regression models, which are either infeasible to fit or likely to incur low predictability because of overfitting. In this talk we will introduce several cutting-edge machine learning methods, developed by my group in the last few years, for modeling (censored) outcome data with high dimensional predictors. Specifically, we will introduce a Dantzig selector for fitting survival models with high dimensional predictors, followed by various semiparametric and nonparametric feature screening methods for handling ultra-high dimensional predictors. We will also discuss statistical inference for regression models with high dimensional predictors. With high dimensional outcome data, we will introduce a new class of high dimensional Gaussian graphical regression models with predictors. The talk focuses on statistical principles and concepts behind these methods, which are motivated and illustrated by various biomedical examples, which have precision medicine contexts.