{"title":"Stochastic Principal Component Analysis Via Mean Absolute Projection Maximization","authors":"M. Dhanaraj, Panos P. Markopoulos","doi":"10.1109/GlobalSIP45357.2019.8969411","DOIUrl":null,"url":null,"abstract":"Principal-Component Analysis (PCA) is a data processing method with numerous applications in signal processing and machine learning. At the same time, standard PCA has been shown to be very sensitive against faulty/outlying data. On the other hand, L1-norm-based PCA (L1-PCA), seeking to maximize the aggregate absolute projections of the processed data, has demonstrated sturdy corruption resistance. At the same time, in our big data era, there is a need for online (stochastic) algorithms for data analysis with limited storage and computation requirements. To this end, in this paper we extend batch L1-PCA and propose a novel algorithm for stochastic PC calculation based on mean absolute projection maximization, with formal convergence guarantees. Our numerical studies demonstrate the convergence and corroborate the corruption resistance of the proposed method.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP45357.2019.8969411","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Principal-Component Analysis (PCA) is a data processing method with numerous applications in signal processing and machine learning. At the same time, standard PCA has been shown to be very sensitive against faulty/outlying data. On the other hand, L1-norm-based PCA (L1-PCA), seeking to maximize the aggregate absolute projections of the processed data, has demonstrated sturdy corruption resistance. At the same time, in our big data era, there is a need for online (stochastic) algorithms for data analysis with limited storage and computation requirements. To this end, in this paper we extend batch L1-PCA and propose a novel algorithm for stochastic PC calculation based on mean absolute projection maximization, with formal convergence guarantees. Our numerical studies demonstrate the convergence and corroborate the corruption resistance of the proposed method.