Huxiang Gu, Leibo Joel, Anselmi Fabio, Chunhong Pan, T. Poggio
{"title":"模糊和下采样变换的不变表示","authors":"Huxiang Gu, Leibo Joel, Anselmi Fabio, Chunhong Pan, T. Poggio","doi":"10.1109/ICIP.2016.7533029","DOIUrl":null,"url":null,"abstract":"Invariant representations of images can significantly reduce the sample complexity of a classifier performing object identification or categorization as shown in a recent analysis of invariant representations for object recognition. In the case of geometric transformations of images the theory [1] shows how invariant signatures can be learned in a biologically plausible way from unsupervised observations of the transformations of a set of randomly chosen template images. Here we extend the theory to non-geometric transformations such as blur and down-sampling. The proposed algorithm achieve an invariant representation via two simple biologically-plausible steps: 1. compute normalized dot products of the input with the stored transformations of each template, and 2. for each template compute the statistics of the resulting set of values such as the histogram or moments. The performance of our system on challenging blurred and low resolution face matching tasks exceeds the previous state-of-the-art by a large margin which grows with increasing image corruption.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Invariant representation for blur and down-sampling transformations\",\"authors\":\"Huxiang Gu, Leibo Joel, Anselmi Fabio, Chunhong Pan, T. Poggio\",\"doi\":\"10.1109/ICIP.2016.7533029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Invariant representations of images can significantly reduce the sample complexity of a classifier performing object identification or categorization as shown in a recent analysis of invariant representations for object recognition. In the case of geometric transformations of images the theory [1] shows how invariant signatures can be learned in a biologically plausible way from unsupervised observations of the transformations of a set of randomly chosen template images. Here we extend the theory to non-geometric transformations such as blur and down-sampling. The proposed algorithm achieve an invariant representation via two simple biologically-plausible steps: 1. compute normalized dot products of the input with the stored transformations of each template, and 2. for each template compute the statistics of the resulting set of values such as the histogram or moments. The performance of our system on challenging blurred and low resolution face matching tasks exceeds the previous state-of-the-art by a large margin which grows with increasing image corruption.\",\"PeriodicalId\":147245,\"journal\":{\"name\":\"International Conference on Information Photonics\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Information Photonics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIP.2016.7533029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Information Photonics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIP.2016.7533029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Invariant representation for blur and down-sampling transformations
Invariant representations of images can significantly reduce the sample complexity of a classifier performing object identification or categorization as shown in a recent analysis of invariant representations for object recognition. In the case of geometric transformations of images the theory [1] shows how invariant signatures can be learned in a biologically plausible way from unsupervised observations of the transformations of a set of randomly chosen template images. Here we extend the theory to non-geometric transformations such as blur and down-sampling. The proposed algorithm achieve an invariant representation via two simple biologically-plausible steps: 1. compute normalized dot products of the input with the stored transformations of each template, and 2. for each template compute the statistics of the resulting set of values such as the histogram or moments. The performance of our system on challenging blurred and low resolution face matching tasks exceeds the previous state-of-the-art by a large margin which grows with increasing image corruption.