{"title":"Modified Regularization for High-dimensional Data Decomposition","authors":"Sheng Chai, W. Feng, Hossam S. Hassanein","doi":"10.1109/WI-IAT55865.2022.00113","DOIUrl":null,"url":null,"abstract":"With the increased dimensionality of datasets, high-dimensional data decomposition models have become essential data analysis tools. However, the decomposition method usually suffers from the overfitting problem and, consequently, cannot achieve state-of-the-art performance. This motivates the introduction of various regularization terms. The commonly applied Ridge regression has limited applicability for the asperity dataset and reduces performance for sparse data, while the Lasso regression has higher efficiency in the sparse dataset. To address this challenge, we propose a modified regularization term designed by integrating both the Lasso and Ridge regressions. The different roles of these two regressions are analyzed. By adjusting the weights of the regression in the regularization term, the existing decomposition method can be applied to the dataset with different degrees of sparsity. The experiments show that the modified regularization term yields consistent improvement in the performance of existing benchmarks.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI-IAT55865.2022.00113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the increased dimensionality of datasets, high-dimensional data decomposition models have become essential data analysis tools. However, the decomposition method usually suffers from the overfitting problem and, consequently, cannot achieve state-of-the-art performance. This motivates the introduction of various regularization terms. The commonly applied Ridge regression has limited applicability for the asperity dataset and reduces performance for sparse data, while the Lasso regression has higher efficiency in the sparse dataset. To address this challenge, we propose a modified regularization term designed by integrating both the Lasso and Ridge regressions. The different roles of these two regressions are analyzed. By adjusting the weights of the regression in the regularization term, the existing decomposition method can be applied to the dataset with different degrees of sparsity. The experiments show that the modified regularization term yields consistent improvement in the performance of existing benchmarks.