{"title":"Cluster‐scaled principal component analysis","authors":"M. Sato-Ilic","doi":"10.1002/wics.1572","DOIUrl":null,"url":null,"abstract":"Cluster‐scaled analysis means exploiting the cluster‐based scaling to conventional data analysis to obtain more accurate results or results that we cannot obtain by using ordinary analysis. Our target data is complex and large amounts of data. For this type of data, it is well known that ordinary statistical methods do not always work well, or theoretically, we know that we cannot obtain a correct result. As a tool of this implementation, we utilize fuzzy clustering, which is well known as a robust clustering to a complex and large amount of data. That is, we use the fuzzy clustering result as a scale of data and apply the rescaled data by the cluster‐scale to another target analysis. Our target analysis in this article is principal component analysis, which is a well‐known dimensional reduction method. A numerical example shows a better performance of the cluster‐scaled principal component analysis.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wiley Interdisciplinary Reviews-Computational Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/wics.1572","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 3
Abstract
Cluster‐scaled analysis means exploiting the cluster‐based scaling to conventional data analysis to obtain more accurate results or results that we cannot obtain by using ordinary analysis. Our target data is complex and large amounts of data. For this type of data, it is well known that ordinary statistical methods do not always work well, or theoretically, we know that we cannot obtain a correct result. As a tool of this implementation, we utilize fuzzy clustering, which is well known as a robust clustering to a complex and large amount of data. That is, we use the fuzzy clustering result as a scale of data and apply the rescaled data by the cluster‐scale to another target analysis. Our target analysis in this article is principal component analysis, which is a well‐known dimensional reduction method. A numerical example shows a better performance of the cluster‐scaled principal component analysis.