{"title":"Efficient sequential karhunen-loeve basis extraction","authors":"A. Levy, M. Lindenbaum","doi":"10.1109/ICCV.2001.937701","DOIUrl":null,"url":null,"abstract":"The Karhunen-Loeve (KL) transform is an optimal method for approximating a set of vectors or images by a low dimensional subspace. The method provides the optimal partial KL basis, which minimizes the MSE between the given set of vectors and their projections on this basis. In computer vision it is used for a variety of tasks such as object recognition, motion estimation, visual learning and object tracking. Calculating the IU basis for N images of size M , where M >> N , requires roughly O ( M N 2 ) operations and O ( M N ) units of memory. In many applications, this large computational demands may be prohibitive. Here, we suggest an approach to reduce the computational effort, relying on the relatively small dimension (denoted K ) of the partial KL basis, that is usually needed. We propose an algorithm that does not require to store the entire set of input images before proceeding to the calculation of the KL basis. Rather, it takes the images in small blocks and updates the required KL basis sequentially.","PeriodicalId":429441,"journal":{"name":"Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2001.937701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
The Karhunen-Loeve (KL) transform is an optimal method for approximating a set of vectors or images by a low dimensional subspace. The method provides the optimal partial KL basis, which minimizes the MSE between the given set of vectors and their projections on this basis. In computer vision it is used for a variety of tasks such as object recognition, motion estimation, visual learning and object tracking. Calculating the IU basis for N images of size M , where M >> N , requires roughly O ( M N 2 ) operations and O ( M N ) units of memory. In many applications, this large computational demands may be prohibitive. Here, we suggest an approach to reduce the computational effort, relying on the relatively small dimension (denoted K ) of the partial KL basis, that is usually needed. We propose an algorithm that does not require to store the entire set of input images before proceeding to the calculation of the KL basis. Rather, it takes the images in small blocks and updates the required KL basis sequentially.