Exploring data by PCA and k-means for IEEE Xplore digital library

J. Anzola, Luz Andrea Rodríguez Rojas, G. T. Bermúdez
{"title":"Exploring data by PCA and k-means for IEEE Xplore digital library","authors":"J. Anzola, Luz Andrea Rodríguez Rojas, G. T. Bermúdez","doi":"10.1145/2925995.2926007","DOIUrl":null,"url":null,"abstract":"An important feature in data analysis is the exploration and data representation. This article describes the Principal Components Analysis techniques (PCA) and clusters analysis with k-means, in order to represent a set of two-dimensional spatial data and group similar data to find relationships between the two techniques. Data is extracted from IEEE Xplore digital library, which lacks processing tools and information display since it doesn't permit analysis and identification of trends and patterns in a query. At the end of the article, is discussed as a technique of data analysis unsupervised allows grouping and organizing of data by proximity based on the variance, finding similar keywords between groups and major components, allowing temporary and evolutionary view of a set of keywords, which can later be interpreted as topics and areas of exploration and research.","PeriodicalId":159180,"journal":{"name":"Proceedings of the The 11th International Knowledge Management in Organizations Conference on The changing face of Knowledge Management Impacting Society","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the The 11th International Knowledge Management in Organizations Conference on The changing face of Knowledge Management Impacting Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2925995.2926007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

An important feature in data analysis is the exploration and data representation. This article describes the Principal Components Analysis techniques (PCA) and clusters analysis with k-means, in order to represent a set of two-dimensional spatial data and group similar data to find relationships between the two techniques. Data is extracted from IEEE Xplore digital library, which lacks processing tools and information display since it doesn't permit analysis and identification of trends and patterns in a query. At the end of the article, is discussed as a technique of data analysis unsupervised allows grouping and organizing of data by proximity based on the variance, finding similar keywords between groups and major components, allowing temporary and evolutionary view of a set of keywords, which can later be interpreted as topics and areas of exploration and research.
基于PCA和k-means的IEEE explore数字图书馆数据挖掘
数据分析的一个重要特征是数据挖掘和数据表示。本文介绍了主成分分析技术(PCA)和k-means聚类分析,以表示一组二维空间数据,并对相似的数据进行分组,以发现两种技术之间的关系。数据是从IEEE Xplore数字图书馆中提取的,它缺乏处理工具和信息显示,因为它不允许分析和识别查询中的趋势和模式。在文章的最后,讨论了作为一种数据分析技术的无监督允许分组和组织基于方差的接近数据,在组和主要组件之间找到相似的关键字,允许一组关键字的临时和进化视图,这些关键字可以稍后被解释为主题和探索和研究领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信