Selfie Detection by Synergy-Constraint Based Convolutional Neural Network

2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS) Pub Date : 2016-11-01 DOI:10.1109/SITIS.2016.61

Yashas Annadani, Vijayakrishna Naganoor, A. Jagadish, K. Chemmangat

{"title":"Selfie Detection by Synergy-Constraint Based Convolutional Neural Network","authors":"Yashas Annadani, Vijayakrishna Naganoor, A. Jagadish, K. Chemmangat","doi":"10.1109/SITIS.2016.61","DOIUrl":null,"url":null,"abstract":"Categorisation of huge amount of data on the multimedia platform is a crucial task. In this work, we propose a novel approach to address the subtle problem of selfie detection for image database segregation on the web, given rapid rise in the number of selfies being clicked. A Convolutional Neural Network (CNN) is modeled to learn a synergy feature in the common subspace of head and shoulder orientation, derived from Local Binary Pattern (LBP) and Histogram of Oriented Gradients (HOG) features respectively. This synergy was captured by projecting the aforementioned features using Canonical Correlation Analysis (CCA). We show that the resulting network's convolutional activations in the neighbourhood of spatial keypoints captured by SIFT are discriminative for selfie-detection. In general, proposed approach aids in capturing intricacies present in the image data and has the potential for usage in other subtle image analysis scenarios apart from just selfie detection. We investigate and analyse the performance of the popular CNN architectures (GoogleNet, Alexnet), used for other image classification tasks, when subjected to the task of detecting the selfies on the multimedia platform. The results of the proposed approach are compared with these popular architectures on a dataset of ninety thousand images comprising of roughly equal number of selfies and non-selfies. Experimental results on this dataset shows the effectiveness of the proposed approach.","PeriodicalId":403704,"journal":{"name":"2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2016.61","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Categorisation of huge amount of data on the multimedia platform is a crucial task. In this work, we propose a novel approach to address the subtle problem of selfie detection for image database segregation on the web, given rapid rise in the number of selfies being clicked. A Convolutional Neural Network (CNN) is modeled to learn a synergy feature in the common subspace of head and shoulder orientation, derived from Local Binary Pattern (LBP) and Histogram of Oriented Gradients (HOG) features respectively. This synergy was captured by projecting the aforementioned features using Canonical Correlation Analysis (CCA). We show that the resulting network's convolutional activations in the neighbourhood of spatial keypoints captured by SIFT are discriminative for selfie-detection. In general, proposed approach aids in capturing intricacies present in the image data and has the potential for usage in other subtle image analysis scenarios apart from just selfie detection. We investigate and analyse the performance of the popular CNN architectures (GoogleNet, Alexnet), used for other image classification tasks, when subjected to the task of detecting the selfies on the multimedia platform. The results of the proposed approach are compared with these popular architectures on a dataset of ninety thousand images comprising of roughly equal number of selfies and non-selfies. Experimental results on this dataset shows the effectiveness of the proposed approach.

查看原文本刊更多论文

基于协同约束的卷积神经网络自拍照检测

对多媒体平台上的海量数据进行分类是一项至关重要的任务。在这项工作中，我们提出了一种新的方法来解决网络上图像数据库隔离的自拍检测的微妙问题，因为自拍被点击的数量迅速增加。对卷积神经网络(CNN)进行建模，分别从局部二值模式(Local Binary Pattern, LBP)和方向梯度直方图(Histogram of Oriented Gradients, HOG)特征中提取头部和肩部方向共同子空间中的协同特征。这种协同作用是通过使用典型相关分析(CCA)预测上述特征来实现的。我们证明了所得网络在SIFT捕获的空间关键点附近的卷积激活对于自检测具有区别性。总的来说，所提出的方法有助于捕获图像数据中存在的复杂性，并且除了自拍检测之外，还具有在其他微妙图像分析场景中使用的潜力。我们调查和分析了用于其他图像分类任务的流行CNN架构(GoogleNet, Alexnet)在检测多媒体平台上的自拍照任务时的性能。将所提出的方法的结果与这些流行的架构在9万张图像的数据集上进行比较，这些图像包括大致相同数量的自拍照和非自拍照。在该数据集上的实验结果表明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)

自引率

0.00%

发文量