Convolutions through time for multi-label movie genre classification

Proceedings of the Symposium on Applied Computing Pub Date : 2017-04-03 DOI:10.1145/3019612.3019641

Jonatas Wehrmann, Rodrigo C. Barros

引用次数: 23

Abstract

In this paper, we explore the suitability of employing Convolutional Neural Networks (ConvNets) for multi-label movie trailer genre classification. Assigning genres to movies is a particularly challenging task because genre is an immaterial feature that is not physically present in a movie frame, so off-the-shelf image detection models cannot be easily adapted to this context. Moreover, multi-label classification is more challenging than single-label classification considering that one instance can be assigned to multiple classes at once. We propose a novel classification method that encapsulates an ultra-deep ConvNet with residual connections. Our approach extracts temporal information from image-based features prior to performing the mapping of trailers to genres. We compare our novel approach with the current state-of-the-art techniques for movie classification, which make use of well-known image descriptors and low-level handcrafted features. Results show that our method significantly outperforms the state-of-the-art in this task, improving the classification accuracy for all genres.

查看原文本刊更多论文

多标签电影类型分类的时间卷积

在本文中，我们探讨了使用卷积神经网络(ConvNets)进行多标签电影预告片类型分类的适用性。为电影分配类型是一项特别具有挑战性的任务，因为类型是一种非物质特征，不存在于电影帧中，所以现成的图像检测模型不容易适应这种情况。此外，考虑到一个实例可以一次分配给多个类，多标签分类比单标签分类更具挑战性。我们提出了一种新的分类方法，该方法封装了带有残差连接的超深度卷积神经网络。我们的方法是从基于图像的特征中提取时间信息，然后再执行预告片到类型的映射。我们将我们的新方法与当前最先进的电影分类技术进行了比较，后者利用了众所周知的图像描述符和低级别的手工特征。结果表明，我们的方法在此任务中显著优于最先进的方法，提高了所有类型的分类精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Symposium on Applied Computing

自引率

0.00%

发文量