{"title":"Regression with right-censored high-dimensional data: An application with different imputation techniques","authors":"E. Yılmaz, D. Aydın, S. Ahmed","doi":"10.48129/kjs.splml.18961","DOIUrl":null,"url":null,"abstract":"This study aims to introduce four modified linear estimators for the right-censored high-dimensional data. Obviously, data of interest involves two important problems to be solved that are censorship and high dimensionality. This paper can be distinguished from other studies in the literature with that it achieves to handle these two problems simultaneously. The main contribution of the paper is merging weightedridge method with the imputation techniques to obtain more efficient estimators than its alternatives. To solve the censorship problem, four imputation techniques are considered based on machine learning algorithms kNN, sliding-windows, regression and support vector machines. The high-dimensionality problem is handled by the weighted ridge approach which provides estimator with less risk than its alternatives because it detects the covariates with a weak contribution via the post-selection procedure. To show the empirical performance of the introduced estimators, a simulation study is made and comparative results are presented. Results show that kNN and regression imputation basis WR esitmators show satisfying performances on estimation of the high-dimensional right-censored model.","PeriodicalId":49933,"journal":{"name":"Kuwait Journal of Science & Engineering","volume":"90 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kuwait Journal of Science & Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48129/kjs.splml.18961","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to introduce four modified linear estimators for the right-censored high-dimensional data. Obviously, data of interest involves two important problems to be solved that are censorship and high dimensionality. This paper can be distinguished from other studies in the literature with that it achieves to handle these two problems simultaneously. The main contribution of the paper is merging weightedridge method with the imputation techniques to obtain more efficient estimators than its alternatives. To solve the censorship problem, four imputation techniques are considered based on machine learning algorithms kNN, sliding-windows, regression and support vector machines. The high-dimensionality problem is handled by the weighted ridge approach which provides estimator with less risk than its alternatives because it detects the covariates with a weak contribution via the post-selection procedure. To show the empirical performance of the introduced estimators, a simulation study is made and comparative results are presented. Results show that kNN and regression imputation basis WR esitmators show satisfying performances on estimation of the high-dimensional right-censored model.