SVD在机器学习中的研究与实现

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) Pub Date : 2017-05-01 DOI:10.1109/ICIS.2017.7960038

Yongchang Wang, Ligu Zhu

{"title":"SVD在机器学习中的研究与实现","authors":"Yongchang Wang, Ligu Zhu","doi":"10.1109/ICIS.2017.7960038","DOIUrl":null,"url":null,"abstract":"With the arrival of the era of big data, people's ability to collect and obtain data is becoming more powerful. These data have shown the characteristics of high dimension, large scale and complex structure. High dimensional data has seriously hindered the efficiency of data mining algorithm, we call it \"the Dimension disaster \". Therefore, dimension reduction technology has become the primary task of big data mining and machine learning. In this paper, we focus on the method of data reduction, described the category of data dimension reduction. The research status and main algorithms of dimension reduction method are described in detail. This paper briefly introduces the latest research progress of data dimension reduction algorithm, including some popular algorithm such as PCA, KPCA, SVD, etc. The principle of principal component analysis (PCA) is discussed in this article, and the singular value decomposition (SVD) theorem is introduced to solve the problem that the PCA method has a large amount of computation, we also give a comparison of PCA and SVD. Finally, we design and implement some experiments to verify the application of SVD in data analysis and latent semantic indexing.","PeriodicalId":301467,"journal":{"name":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Research and implementation of SVD in machine learning\",\"authors\":\"Yongchang Wang, Ligu Zhu\",\"doi\":\"10.1109/ICIS.2017.7960038\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the arrival of the era of big data, people's ability to collect and obtain data is becoming more powerful. These data have shown the characteristics of high dimension, large scale and complex structure. High dimensional data has seriously hindered the efficiency of data mining algorithm, we call it \\\"the Dimension disaster \\\". Therefore, dimension reduction technology has become the primary task of big data mining and machine learning. In this paper, we focus on the method of data reduction, described the category of data dimension reduction. The research status and main algorithms of dimension reduction method are described in detail. This paper briefly introduces the latest research progress of data dimension reduction algorithm, including some popular algorithm such as PCA, KPCA, SVD, etc. The principle of principal component analysis (PCA) is discussed in this article, and the singular value decomposition (SVD) theorem is introduced to solve the problem that the PCA method has a large amount of computation, we also give a comparison of PCA and SVD. Finally, we design and implement some experiments to verify the application of SVD in data analysis and latent semantic indexing.\",\"PeriodicalId\":301467,\"journal\":{\"name\":\"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIS.2017.7960038\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2017.7960038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

摘要

随着大数据时代的到来，人们收集和获取数据的能力越来越强大。这些数据具有高维数、规模大、结构复杂的特点。高维数据严重阻碍了数据挖掘算法的效率，我们称之为“维数灾难”。因此，降维技术已成为大数据挖掘和机器学习的首要任务。本文重点介绍了数据降维的方法，描述了数据降维的范畴。详细介绍了降维方法的研究现状和主要算法。本文简要介绍了数据降维算法的最新研究进展，包括常用的PCA、KPCA、SVD等算法。本文讨论了主成分分析(PCA)的原理，引入奇异值分解(SVD)定理，解决了主成分分析方法计算量大的问题，并对主成分分析和奇异值分解进行了比较。最后，我们设计并实现了一些实验来验证奇异向量分解在数据分析和潜在语义索引中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Research and implementation of SVD in machine learning

With the arrival of the era of big data, people's ability to collect and obtain data is becoming more powerful. These data have shown the characteristics of high dimension, large scale and complex structure. High dimensional data has seriously hindered the efficiency of data mining algorithm, we call it "the Dimension disaster ". Therefore, dimension reduction technology has become the primary task of big data mining and machine learning. In this paper, we focus on the method of data reduction, described the category of data dimension reduction. The research status and main algorithms of dimension reduction method are described in detail. This paper briefly introduces the latest research progress of data dimension reduction algorithm, including some popular algorithm such as PCA, KPCA, SVD, etc. The principle of principal component analysis (PCA) is discussed in this article, and the singular value decomposition (SVD) theorem is introduced to solve the problem that the PCA method has a large amount of computation, we also give a comparison of PCA and SVD. Finally, we design and implement some experiments to verify the application of SVD in data analysis and latent semantic indexing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)

自引率

0.00%

发文量