{"title":"以对象为中心的场景理解图像记忆预测","authors":"Sejong Yoon, Jongpil Kim","doi":"10.1109/MIPR.2018.00070","DOIUrl":null,"url":null,"abstract":"Computational image memorability prediction has made significant progress in recent years. It is reported that we can robustly estimate the memorability of images with many different object and scene classes. However, the large scale data-based method including deep Convolutional Neural Networks (CNNs) showed a room for improvement when it was applied to smaller benchmark dataset. In this work, we investigate the missing link that causes such performance gap via in-depth qualitative analysis, and then provide suggestions to bridge the gap. Specifically, we study the relationship between the image memorability and the object spatial composition within the scene depicted by an image. Our hypothesis is that the image memorability is closely related to the composition of the scene, that is beyond mere location and existence. Experimental results show that the recent advances in scene parsing methods, which extracts contextual information of the image, may not only help better understanding of the image memorability and the object composition, but also show promising potential in improving computational memorability prediction.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Object-Centric Scene Understanding for Image Memorability Prediction\",\"authors\":\"Sejong Yoon, Jongpil Kim\",\"doi\":\"10.1109/MIPR.2018.00070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computational image memorability prediction has made significant progress in recent years. It is reported that we can robustly estimate the memorability of images with many different object and scene classes. However, the large scale data-based method including deep Convolutional Neural Networks (CNNs) showed a room for improvement when it was applied to smaller benchmark dataset. In this work, we investigate the missing link that causes such performance gap via in-depth qualitative analysis, and then provide suggestions to bridge the gap. Specifically, we study the relationship between the image memorability and the object spatial composition within the scene depicted by an image. Our hypothesis is that the image memorability is closely related to the composition of the scene, that is beyond mere location and existence. Experimental results show that the recent advances in scene parsing methods, which extracts contextual information of the image, may not only help better understanding of the image memorability and the object composition, but also show promising potential in improving computational memorability prediction.\",\"PeriodicalId\":320000,\"journal\":{\"name\":\"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MIPR.2018.00070\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object-Centric Scene Understanding for Image Memorability Prediction
Computational image memorability prediction has made significant progress in recent years. It is reported that we can robustly estimate the memorability of images with many different object and scene classes. However, the large scale data-based method including deep Convolutional Neural Networks (CNNs) showed a room for improvement when it was applied to smaller benchmark dataset. In this work, we investigate the missing link that causes such performance gap via in-depth qualitative analysis, and then provide suggestions to bridge the gap. Specifically, we study the relationship between the image memorability and the object spatial composition within the scene depicted by an image. Our hypothesis is that the image memorability is closely related to the composition of the scene, that is beyond mere location and existence. Experimental results show that the recent advances in scene parsing methods, which extracts contextual information of the image, may not only help better understanding of the image memorability and the object composition, but also show promising potential in improving computational memorability prediction.