{"title":"A Deep Convolutional Neural Network for Interrelationship Identification between Humans from Images","authors":"Amit Verma, T. Meenpal, B. Acharya","doi":"10.1109/INFOCOMTECH.2018.8722391","DOIUrl":null,"url":null,"abstract":"The paper proposes a deep convolutional neural network for visual categorization of different interrelationships between humans from digital images. To achieve this goal, we first generated a dataset of interrelationships containing two interrelationship classes i.e. handshaking and hugging. Our proposed network having around 2 lakh neurons is trained with 8 lakh parameters. The network contains a total of seven layers i.e. two convolution layers each followed by max pooling layers and fully connected layers. Output layer contains a sigmoid function providing binary outputs i.e. 0 for one class and 1 for another class. To maintain the nonlinearity of images, Rectified Linear Units (ReLUs) have been used in each convolution and fully connected layers. The model generates an average accuracy of approximately 81%. Data augmentation technique has also been applied to reduce over-fitting.","PeriodicalId":175757,"journal":{"name":"2018 Conference on Information and Communication Technology (CICT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Conference on Information and Communication Technology (CICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOMTECH.2018.8722391","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The paper proposes a deep convolutional neural network for visual categorization of different interrelationships between humans from digital images. To achieve this goal, we first generated a dataset of interrelationships containing two interrelationship classes i.e. handshaking and hugging. Our proposed network having around 2 lakh neurons is trained with 8 lakh parameters. The network contains a total of seven layers i.e. two convolution layers each followed by max pooling layers and fully connected layers. Output layer contains a sigmoid function providing binary outputs i.e. 0 for one class and 1 for another class. To maintain the nonlinearity of images, Rectified Linear Units (ReLUs) have been used in each convolution and fully connected layers. The model generates an average accuracy of approximately 81%. Data augmentation technique has also been applied to reduce over-fitting.