{"title":"Targeted Projection Pursuit for Interactive Exploration of High- Dimensional Data Sets","authors":"Joe Faith","doi":"10.1109/IV.2007.107","DOIUrl":null,"url":null,"abstract":"High-dimensional data is, by its nature, difficult to visualise. Many current techniques involve reducing the dimensionality of the data, which results in a loss of information. Targeted Projection Pursuit is a novel method for visualising high-dimensional datasets which allows the user to interactively explore the space of possible views to find those that meet their requirements. A prototype tool that utilises this method is introduced, and is shown to allow users to explore data through an interface that is transparent and efficient. The tool and underlying technique are general purpose - applicable to any high-dimensional numeric data, and supporting a wide range of exploratory data analysis activities - but are evaluated on three particular tasks using gene expression data: identifying discriminatory genes, visualising diagnostic classes, and detecting misdiagnosed samples. It is found to perform well in comparison with standard techniques.","PeriodicalId":177429,"journal":{"name":"2007 11th International Conference Information Visualization (IV '07)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 11th International Conference Information Visualization (IV '07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV.2007.107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24
Abstract
High-dimensional data is, by its nature, difficult to visualise. Many current techniques involve reducing the dimensionality of the data, which results in a loss of information. Targeted Projection Pursuit is a novel method for visualising high-dimensional datasets which allows the user to interactively explore the space of possible views to find those that meet their requirements. A prototype tool that utilises this method is introduced, and is shown to allow users to explore data through an interface that is transparent and efficient. The tool and underlying technique are general purpose - applicable to any high-dimensional numeric data, and supporting a wide range of exploratory data analysis activities - but are evaluated on three particular tasks using gene expression data: identifying discriminatory genes, visualising diagnostic classes, and detecting misdiagnosed samples. It is found to perform well in comparison with standard techniques.