Learning Perceptual Embeddings with Two Related Tasks for Joint Predictions of Media Interestingness and Emotions

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval Pub Date : 2018-06-05 DOI:10.1145/3206025.3206071

Yang Liu, Zhonglei Gu, Tobey H. Ko, K. Hua

{"title":"Learning Perceptual Embeddings with Two Related Tasks for Joint Predictions of Media Interestingness and Emotions","authors":"Yang Liu, Zhonglei Gu, Tobey H. Ko, K. Hua","doi":"10.1145/3206025.3206071","DOIUrl":null,"url":null,"abstract":"Integrating media elements of various medium, multimedia is capable of expressing complex information in a neat and compact way. Early studies have linked different sensory presentation in multimedia with the perception of human-like concepts. Yet, the richness of information in multimedia makes understanding and predicting user perceptions in multimedia content a challenging task both to the machine and the human mind. This paper presents a novel multi-task feature extraction method for accurate prediction of user perceptions in multimedia content. Differentiating from the conventional feature extraction algorithms which focus on perfecting a single task, the proposed model recognizes the commonality between different perceptions (e.g., interestingness and emotional impact), and attempts to jointly optimize the performance of all the tasks through uncovered commonality features. Using both a media interestingness dataset and a media emotion dataset for user perception prediction tasks, the proposed model attempts to simultaneously characterize the individualities of each task and capture the commonalities shared by both tasks, and achieves better accuracy in predictions than other competing algorithms on real-world datasets of two related tasks: MediaEval 2017 Predicting Media Interestingness Task and MediaEval 2017 Emotional Impact of Movies Task.","PeriodicalId":224132,"journal":{"name":"Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3206025.3206071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Integrating media elements of various medium, multimedia is capable of expressing complex information in a neat and compact way. Early studies have linked different sensory presentation in multimedia with the perception of human-like concepts. Yet, the richness of information in multimedia makes understanding and predicting user perceptions in multimedia content a challenging task both to the machine and the human mind. This paper presents a novel multi-task feature extraction method for accurate prediction of user perceptions in multimedia content. Differentiating from the conventional feature extraction algorithms which focus on perfecting a single task, the proposed model recognizes the commonality between different perceptions (e.g., interestingness and emotional impact), and attempts to jointly optimize the performance of all the tasks through uncovered commonality features. Using both a media interestingness dataset and a media emotion dataset for user perception prediction tasks, the proposed model attempts to simultaneously characterize the individualities of each task and capture the commonalities shared by both tasks, and achieves better accuracy in predictions than other competing algorithms on real-world datasets of two related tasks: MediaEval 2017 Predicting Media Interestingness Task and MediaEval 2017 Emotional Impact of Movies Task.

查看原文本刊更多论文

用两个相关任务学习感知嵌入对媒体兴趣和情绪的联合预测

多媒体是一种集各种媒体元素于一体的技术，能够以简洁、紧凑的方式表达复杂的信息。早期的研究将多媒体中的不同感官呈现与类人概念的感知联系起来。然而，多媒体中信息的丰富性使得理解和预测用户对多媒体内容的感知对机器和人类大脑来说都是一项具有挑战性的任务。本文提出了一种新的多任务特征提取方法，用于准确预测多媒体内容中的用户感知。与传统的专注于完善单个任务的特征提取算法不同，该模型识别了不同感知之间的共性(例如，兴趣和情感影响)，并试图通过发现共性特征来共同优化所有任务的性能。使用媒体兴趣度数据集和媒体情感数据集进行用户感知预测任务，该模型试图同时表征每个任务的个性并捕获两个任务的共性，并在两个相关任务(MediaEval 2017预测媒体兴趣度任务和MediaEval 2017电影情感影响任务)的真实数据集上取得比其他竞争算法更好的预测准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval

自引率

0.00%

发文量