{"title":"A new method for dimensionality reduction of multi-dimensional data using Copulas","authors":"Rima Houari, A. Bounceur, Mohand Tahar Kechadi","doi":"10.1109/ISPS.2013.6581491","DOIUrl":null,"url":null,"abstract":"A new technique for the Dimensionality Reduction of Multi-Dimensional Data is presented in this paper. This technique employs the theory of Copulas to estimate the multivariate joint probability distribution without constraints to specific types of marginal distributions of random variables that represent the dimensions of our Data. A Copulas-based model, provides a complete and scale-free description of dependence that is more suitable to be modeled using well-known multivariate parametric laws. The model can be readily used for comparing of dependence of random variables by estimating the parameters of the Copula and to better see the relationship between data. This dependence is thereafter used for detecting the Redundant Values and noise in order to clean the original data, reduce them (eliminate Redundant attributes) and obtain representative Samples of good quality. We compared the proposed approach with singular values decomposition (SVD) technique, one of the most efficient method of Data mining.","PeriodicalId":222438,"journal":{"name":"2013 11th International Symposium on Programming and Systems (ISPS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 11th International Symposium on Programming and Systems (ISPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPS.2013.6581491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
A new technique for the Dimensionality Reduction of Multi-Dimensional Data is presented in this paper. This technique employs the theory of Copulas to estimate the multivariate joint probability distribution without constraints to specific types of marginal distributions of random variables that represent the dimensions of our Data. A Copulas-based model, provides a complete and scale-free description of dependence that is more suitable to be modeled using well-known multivariate parametric laws. The model can be readily used for comparing of dependence of random variables by estimating the parameters of the Copula and to better see the relationship between data. This dependence is thereafter used for detecting the Redundant Values and noise in order to clean the original data, reduce them (eliminate Redundant attributes) and obtain representative Samples of good quality. We compared the proposed approach with singular values decomposition (SVD) technique, one of the most efficient method of Data mining.