{"title":"可记忆基础:向以人为中心的稀疏表示方向发展","authors":"Xiaoshuai Sun, H. Yao","doi":"10.1145/2393347.2396306","DOIUrl":null,"url":null,"abstract":"Previous studies of sparse representation in multimedia research focus on developing reliable and efficient dictionary learning algorithms. Despite the sparse prior, how to integrate other related perceptual factors of human being into dictionary learning process was seldom studied. In this paper, we investigate the influence of image memorability for human-centralized sparse representation. Based on the results of a photo memory game, we are able to quantitatively characterize an image's memorability which allows us to train sparse bases from the most memorable images instead of randomly selected natural images. We believed that such kind of basis is more consistent with neural networks in human brain and hence can better predict where human looks. To test our hypothesis, we choose human eye-fixation prediction problem for quantitative evaluation. The experimental results demonstrate the superior performance of our Memorable Basis compared to traditional sparse basis trained from unselected images.","PeriodicalId":212654,"journal":{"name":"Proceedings of the 20th ACM international conference on Multimedia","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Memorable basis: towards human-centralized sparse representation\",\"authors\":\"Xiaoshuai Sun, H. Yao\",\"doi\":\"10.1145/2393347.2396306\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Previous studies of sparse representation in multimedia research focus on developing reliable and efficient dictionary learning algorithms. Despite the sparse prior, how to integrate other related perceptual factors of human being into dictionary learning process was seldom studied. In this paper, we investigate the influence of image memorability for human-centralized sparse representation. Based on the results of a photo memory game, we are able to quantitatively characterize an image's memorability which allows us to train sparse bases from the most memorable images instead of randomly selected natural images. We believed that such kind of basis is more consistent with neural networks in human brain and hence can better predict where human looks. To test our hypothesis, we choose human eye-fixation prediction problem for quantitative evaluation. The experimental results demonstrate the superior performance of our Memorable Basis compared to traditional sparse basis trained from unselected images.\",\"PeriodicalId\":212654,\"journal\":{\"name\":\"Proceedings of the 20th ACM international conference on Multimedia\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 20th ACM international conference on Multimedia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2393347.2396306\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 20th ACM international conference on Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2393347.2396306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Memorable basis: towards human-centralized sparse representation
Previous studies of sparse representation in multimedia research focus on developing reliable and efficient dictionary learning algorithms. Despite the sparse prior, how to integrate other related perceptual factors of human being into dictionary learning process was seldom studied. In this paper, we investigate the influence of image memorability for human-centralized sparse representation. Based on the results of a photo memory game, we are able to quantitatively characterize an image's memorability which allows us to train sparse bases from the most memorable images instead of randomly selected natural images. We believed that such kind of basis is more consistent with neural networks in human brain and hence can better predict where human looks. To test our hypothesis, we choose human eye-fixation prediction problem for quantitative evaluation. The experimental results demonstrate the superior performance of our Memorable Basis compared to traditional sparse basis trained from unselected images.