{"title":"大数据聚类计算方法研究","authors":"Lijun Chen, Zhengjun Pan, Lina Yuan","doi":"10.1145/3386415.3386964","DOIUrl":null,"url":null,"abstract":"In the past few years, the rapidly developing technology in the field of information technology is \"big data\". Clustering is one of the key tasks in a wide range of areas dealing with large amounts of data. This survey introduces various clustering methods used for effective big data clustering. Therefore, this review paper reviewed 15 research papers, which proposed various methods for effective big data clustering, such as k-means clustering, k-means variant clustering, fuzzy c-means clustering, possibility c-means clustering, collaborative filtering and optimization based clustering. In addition, detailed analysis is carried out by referring to the implementation tools used, the data sets used and the big data clustering framework adopted. Then, an effective solution must be developed to go beyond the existing technology to the special management of big data. Finally, the research problems and gaps of various big data clustering technologies are proposed to enable researchers to start with better big data clustering.","PeriodicalId":250211,"journal":{"name":"Proceedings of the 2nd International Conference on Information Technologies and Electrical Engineering","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Study on Clustering Computing Methods of Big Data\",\"authors\":\"Lijun Chen, Zhengjun Pan, Lina Yuan\",\"doi\":\"10.1145/3386415.3386964\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the past few years, the rapidly developing technology in the field of information technology is \\\"big data\\\". Clustering is one of the key tasks in a wide range of areas dealing with large amounts of data. This survey introduces various clustering methods used for effective big data clustering. Therefore, this review paper reviewed 15 research papers, which proposed various methods for effective big data clustering, such as k-means clustering, k-means variant clustering, fuzzy c-means clustering, possibility c-means clustering, collaborative filtering and optimization based clustering. In addition, detailed analysis is carried out by referring to the implementation tools used, the data sets used and the big data clustering framework adopted. Then, an effective solution must be developed to go beyond the existing technology to the special management of big data. Finally, the research problems and gaps of various big data clustering technologies are proposed to enable researchers to start with better big data clustering.\",\"PeriodicalId\":250211,\"journal\":{\"name\":\"Proceedings of the 2nd International Conference on Information Technologies and Electrical Engineering\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2nd International Conference on Information Technologies and Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3386415.3386964\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Information Technologies and Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3386415.3386964","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In the past few years, the rapidly developing technology in the field of information technology is "big data". Clustering is one of the key tasks in a wide range of areas dealing with large amounts of data. This survey introduces various clustering methods used for effective big data clustering. Therefore, this review paper reviewed 15 research papers, which proposed various methods for effective big data clustering, such as k-means clustering, k-means variant clustering, fuzzy c-means clustering, possibility c-means clustering, collaborative filtering and optimization based clustering. In addition, detailed analysis is carried out by referring to the implementation tools used, the data sets used and the big data clustering framework adopted. Then, an effective solution must be developed to go beyond the existing technology to the special management of big data. Finally, the research problems and gaps of various big data clustering technologies are proposed to enable researchers to start with better big data clustering.