{"title":"高维数据分解的改进正则化","authors":"Sheng Chai, W. Feng, Hossam S. Hassanein","doi":"10.1109/WI-IAT55865.2022.00113","DOIUrl":null,"url":null,"abstract":"With the increased dimensionality of datasets, high-dimensional data decomposition models have become essential data analysis tools. However, the decomposition method usually suffers from the overfitting problem and, consequently, cannot achieve state-of-the-art performance. This motivates the introduction of various regularization terms. The commonly applied Ridge regression has limited applicability for the asperity dataset and reduces performance for sparse data, while the Lasso regression has higher efficiency in the sparse dataset. To address this challenge, we propose a modified regularization term designed by integrating both the Lasso and Ridge regressions. The different roles of these two regressions are analyzed. By adjusting the weights of the regression in the regularization term, the existing decomposition method can be applied to the dataset with different degrees of sparsity. The experiments show that the modified regularization term yields consistent improvement in the performance of existing benchmarks.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modified Regularization for High-dimensional Data Decomposition\",\"authors\":\"Sheng Chai, W. Feng, Hossam S. Hassanein\",\"doi\":\"10.1109/WI-IAT55865.2022.00113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the increased dimensionality of datasets, high-dimensional data decomposition models have become essential data analysis tools. However, the decomposition method usually suffers from the overfitting problem and, consequently, cannot achieve state-of-the-art performance. This motivates the introduction of various regularization terms. The commonly applied Ridge regression has limited applicability for the asperity dataset and reduces performance for sparse data, while the Lasso regression has higher efficiency in the sparse dataset. To address this challenge, we propose a modified regularization term designed by integrating both the Lasso and Ridge regressions. The different roles of these two regressions are analyzed. By adjusting the weights of the regression in the regularization term, the existing decomposition method can be applied to the dataset with different degrees of sparsity. The experiments show that the modified regularization term yields consistent improvement in the performance of existing benchmarks.\",\"PeriodicalId\":345445,\"journal\":{\"name\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WI-IAT55865.2022.00113\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI-IAT55865.2022.00113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Modified Regularization for High-dimensional Data Decomposition
With the increased dimensionality of datasets, high-dimensional data decomposition models have become essential data analysis tools. However, the decomposition method usually suffers from the overfitting problem and, consequently, cannot achieve state-of-the-art performance. This motivates the introduction of various regularization terms. The commonly applied Ridge regression has limited applicability for the asperity dataset and reduces performance for sparse data, while the Lasso regression has higher efficiency in the sparse dataset. To address this challenge, we propose a modified regularization term designed by integrating both the Lasso and Ridge regressions. The different roles of these two regressions are analyzed. By adjusting the weights of the regression in the regularization term, the existing decomposition method can be applied to the dataset with different degrees of sparsity. The experiments show that the modified regularization term yields consistent improvement in the performance of existing benchmarks.