{"title":"结合手工和CNN特征的视觉情感预测","authors":"Wang Fengjiao, Masaki Aono","doi":"10.1109/ICAICTA.2018.8541312","DOIUrl":null,"url":null,"abstract":"Nowadays, more and more people are getting used to social media such as Instagram, Facebook, Twitter, and Flickr to post images and texts to express their sentiment and emotions on almost all events and subjects. In consequence, analyzing sentiment of the huge number of images and texts on social networks has become more indispensable. Most of current research has focused on analyzing sentiment of textual data, while only few research has focused on sentiment analysis of image data. Some of these research has considered handcraft image features, the others has utilized Convolutional Neural Network (CNN) features. However, no research to our knowledge has considered mixing both hand-craft and CNN features. In this paper, we attempt to merge CNN which has shown remarkable achievements in Computer Vision recently, with handcraft features such as Color Histogram (CH) and Bag-of-Visual Words (BoVW) with some local features such as SURF and SIFT to predict sentiment of images. Furthermore, because it is often the case that the large amount of training data may not be easily obtained in the area of visual sentiment, we employ both data augmentation and transfer learning from a pre-trained CNN such as VGG16 trained with ImageNet dataset. With the handshake of hand-craft and End-to-End features from CNN, we attempt to attain the improvement of the performance of the proposed visual sentiment prediction framework. We conducted experiments on an image dataset from Twitter with polarity labels (\"positive\" and \"negative\"). The results of experiments demonstrate that our proposed visual sentimental prediction framework outperforms the current state-of-the-art methods.","PeriodicalId":184882,"journal":{"name":"2018 5th International Conference on Advanced Informatics: Concept Theory and Applications (ICAICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Visual Sentiment Prediction by Merging Hand-Craft and CNN Features\",\"authors\":\"Wang Fengjiao, Masaki Aono\",\"doi\":\"10.1109/ICAICTA.2018.8541312\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, more and more people are getting used to social media such as Instagram, Facebook, Twitter, and Flickr to post images and texts to express their sentiment and emotions on almost all events and subjects. In consequence, analyzing sentiment of the huge number of images and texts on social networks has become more indispensable. Most of current research has focused on analyzing sentiment of textual data, while only few research has focused on sentiment analysis of image data. Some of these research has considered handcraft image features, the others has utilized Convolutional Neural Network (CNN) features. However, no research to our knowledge has considered mixing both hand-craft and CNN features. In this paper, we attempt to merge CNN which has shown remarkable achievements in Computer Vision recently, with handcraft features such as Color Histogram (CH) and Bag-of-Visual Words (BoVW) with some local features such as SURF and SIFT to predict sentiment of images. Furthermore, because it is often the case that the large amount of training data may not be easily obtained in the area of visual sentiment, we employ both data augmentation and transfer learning from a pre-trained CNN such as VGG16 trained with ImageNet dataset. With the handshake of hand-craft and End-to-End features from CNN, we attempt to attain the improvement of the performance of the proposed visual sentiment prediction framework. We conducted experiments on an image dataset from Twitter with polarity labels (\\\"positive\\\" and \\\"negative\\\"). The results of experiments demonstrate that our proposed visual sentimental prediction framework outperforms the current state-of-the-art methods.\",\"PeriodicalId\":184882,\"journal\":{\"name\":\"2018 5th International Conference on Advanced Informatics: Concept Theory and Applications (ICAICTA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th International Conference on Advanced Informatics: Concept Theory and Applications (ICAICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICTA.2018.8541312\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Advanced Informatics: Concept Theory and Applications (ICAICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICTA.2018.8541312","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Visual Sentiment Prediction by Merging Hand-Craft and CNN Features
Nowadays, more and more people are getting used to social media such as Instagram, Facebook, Twitter, and Flickr to post images and texts to express their sentiment and emotions on almost all events and subjects. In consequence, analyzing sentiment of the huge number of images and texts on social networks has become more indispensable. Most of current research has focused on analyzing sentiment of textual data, while only few research has focused on sentiment analysis of image data. Some of these research has considered handcraft image features, the others has utilized Convolutional Neural Network (CNN) features. However, no research to our knowledge has considered mixing both hand-craft and CNN features. In this paper, we attempt to merge CNN which has shown remarkable achievements in Computer Vision recently, with handcraft features such as Color Histogram (CH) and Bag-of-Visual Words (BoVW) with some local features such as SURF and SIFT to predict sentiment of images. Furthermore, because it is often the case that the large amount of training data may not be easily obtained in the area of visual sentiment, we employ both data augmentation and transfer learning from a pre-trained CNN such as VGG16 trained with ImageNet dataset. With the handshake of hand-craft and End-to-End features from CNN, we attempt to attain the improvement of the performance of the proposed visual sentiment prediction framework. We conducted experiments on an image dataset from Twitter with polarity labels ("positive" and "negative"). The results of experiments demonstrate that our proposed visual sentimental prediction framework outperforms the current state-of-the-art methods.