{"title":"基于改进主成分分析的特征选择","authors":"Zhang Li, Yihui Qiu","doi":"10.1145/3590003.3590036","DOIUrl":null,"url":null,"abstract":"Abstract: The filtered feature selection method has low computational complexity and less time, and is widely used in feature selection, but the filtered method only considers the importance of features for label classification and ignores the correlation between features. For this reason, a feature selection method with improved principal component analysis is proposed. The main idea of the method is that on the basis of principal components, the loadings of each indicator on different principal components and their variance contribution ratios with that principal component are considered. A number of indicators with the largest cumulative contribution rates were selected, so that the final extracted indicators retained more information. Subsequently, comparative experiments are conducted using the UCI dataset, and the results show that the approach proposed in this paper has some superiority over other methods. Finally, the features of China's green innovation efficiency are selected using the approach proposed in this paper to demonstrate the feasibility of the method.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"232 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature selection based on improved principal component analysis\",\"authors\":\"Zhang Li, Yihui Qiu\",\"doi\":\"10.1145/3590003.3590036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract: The filtered feature selection method has low computational complexity and less time, and is widely used in feature selection, but the filtered method only considers the importance of features for label classification and ignores the correlation between features. For this reason, a feature selection method with improved principal component analysis is proposed. The main idea of the method is that on the basis of principal components, the loadings of each indicator on different principal components and their variance contribution ratios with that principal component are considered. A number of indicators with the largest cumulative contribution rates were selected, so that the final extracted indicators retained more information. Subsequently, comparative experiments are conducted using the UCI dataset, and the results show that the approach proposed in this paper has some superiority over other methods. Finally, the features of China's green innovation efficiency are selected using the approach proposed in this paper to demonstrate the feasibility of the method.\",\"PeriodicalId\":340225,\"journal\":{\"name\":\"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning\",\"volume\":\"232 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3590003.3590036\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3590003.3590036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature selection based on improved principal component analysis
Abstract: The filtered feature selection method has low computational complexity and less time, and is widely used in feature selection, but the filtered method only considers the importance of features for label classification and ignores the correlation between features. For this reason, a feature selection method with improved principal component analysis is proposed. The main idea of the method is that on the basis of principal components, the loadings of each indicator on different principal components and their variance contribution ratios with that principal component are considered. A number of indicators with the largest cumulative contribution rates were selected, so that the final extracted indicators retained more information. Subsequently, comparative experiments are conducted using the UCI dataset, and the results show that the approach proposed in this paper has some superiority over other methods. Finally, the features of China's green innovation efficiency are selected using the approach proposed in this paper to demonstrate the feasibility of the method.