{"title":"Credible visualizations for planar projections","authors":"A. Ultsch, Michael C. Thrun","doi":"10.1109/WSOM.2017.8020010","DOIUrl":null,"url":null,"abstract":"Planar projections, i.e. projections from a high dimensional data space onto a two dimensional plane, are still in use to detect structures, such as clusters, in multivariate data. It can be shown that only the subclass of focusing projections such as CCA, NeRV and the ESOM are able to disentangle linear non separable data. However, even these projections are sometimes erroneous. U-matrix methods are able to visualize these errors for SOM based projections. This paper extends the U-matrix methods to other projections in form of a so called generalized U-matrix. Based on previous work, an algorithm for the construction of generalized U-matrix is introduced, that is more efficient and free of parameters which may be hard to determine. Results are presented on a difficult artificial data set and a real word multivariate data set from cancer research.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSOM.2017.8020010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
Planar projections, i.e. projections from a high dimensional data space onto a two dimensional plane, are still in use to detect structures, such as clusters, in multivariate data. It can be shown that only the subclass of focusing projections such as CCA, NeRV and the ESOM are able to disentangle linear non separable data. However, even these projections are sometimes erroneous. U-matrix methods are able to visualize these errors for SOM based projections. This paper extends the U-matrix methods to other projections in form of a so called generalized U-matrix. Based on previous work, an algorithm for the construction of generalized U-matrix is introduced, that is more efficient and free of parameters which may be hard to determine. Results are presented on a difficult artificial data set and a real word multivariate data set from cancer research.