Yongyan Zhang, Guo Xie, Wenqing Wang, Xiaofan Wang, F. Qian, Xulong Du, Jinhua Du
{"title":"Distributed dimensionality reduction of industrial data based on clustering","authors":"Yongyan Zhang, Guo Xie, Wenqing Wang, Xiaofan Wang, F. Qian, Xulong Du, Jinhua Du","doi":"10.1109/ICIEA.2018.8397744","DOIUrl":null,"url":null,"abstract":"Large amounts of data are produced in system operation, and how to extract effective information from these data has become an important research topic in the industrial application. Dimensionality reduction is a way to refine the data. Because of the low efficiency of the existing methods, these methods can't discover the internal structure of the data. Regarding these problems, a distributed method of dimensionality reduction based on clustering is proposed, which includes the following steps:(1) Clustering the data into some small classes according to the similarity between the data variables; (2) reducing the dimension of data in a small class after being clustered respectively; (3) merging the data after being reduced dimension; (4) classifying the data after being merged by support vector machine (SVM). The data in the simulation is the test data, and the results show that the methods proposed in this paper are better than the existing dimensionality reduction methods.","PeriodicalId":140420,"journal":{"name":"2018 13th IEEE Conference on Industrial Electronics and Applications (ICIEA)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 13th IEEE Conference on Industrial Electronics and Applications (ICIEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIEA.2018.8397744","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Large amounts of data are produced in system operation, and how to extract effective information from these data has become an important research topic in the industrial application. Dimensionality reduction is a way to refine the data. Because of the low efficiency of the existing methods, these methods can't discover the internal structure of the data. Regarding these problems, a distributed method of dimensionality reduction based on clustering is proposed, which includes the following steps:(1) Clustering the data into some small classes according to the similarity between the data variables; (2) reducing the dimension of data in a small class after being clustered respectively; (3) merging the data after being reduced dimension; (4) classifying the data after being merged by support vector machine (SVM). The data in the simulation is the test data, and the results show that the methods proposed in this paper are better than the existing dimensionality reduction methods.