Sarthak Gupta, S. Bagga, Sanjay Kumar Dharandher, D. Sharma
{"title":"GPOL: Gradient and Probabilistic approach for Object Localization to understand the working of CNNs","authors":"Sarthak Gupta, S. Bagga, Sanjay Kumar Dharandher, D. Sharma","doi":"10.1109/IBSSC47189.2019.8972980","DOIUrl":null,"url":null,"abstract":"Convolutional neural networks have been a revolution in the field of Computer Vision and are being extensively used for the purpose of image classification, object detection, generation of captions etc. CNNs are mostly considered black boxes where the internal functioning is not known. The objective of this work is to provide an explanation of the functioning of the the predictions made by the CNN. We propose a new technique for comprehending the functioning of the middle layers of the neural network and the classifier operations. The proposed approach is capable of analyzing multifarious models which are trained for applications such as object detection and recognition. In this work, probabilistic approach and gradient based approach have been used for the purpose of object localization. Geometric mean of heatmaps of both the approaches has been done. In the former approach, the true object’s gradient’s are made to flow into the last convolutional layer for the purpose of determining the most significant points which would help to predict that particular object. In the probabilistic approach, CNN’s top down attention has been used which serves the purpose of generation of attention maps which are task specific. A probabilistic scheme (to select a significant neuron in the network) has been used during backpropagation of signals from top to down in the hierarchy of network. The proposed work has been executed on CLS-LOC dataset which is a part of Imagenet dataset. The proposed work is then compared with the previously developed techniques such as saliency maps, SmoothGrad, GradCam, Top Down Neural approach to exhibit the better accuracy of the proposed work.","PeriodicalId":148941,"journal":{"name":"2019 IEEE Bombay Section Signature Conference (IBSSC)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC47189.2019.8972980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Convolutional neural networks have been a revolution in the field of Computer Vision and are being extensively used for the purpose of image classification, object detection, generation of captions etc. CNNs are mostly considered black boxes where the internal functioning is not known. The objective of this work is to provide an explanation of the functioning of the the predictions made by the CNN. We propose a new technique for comprehending the functioning of the middle layers of the neural network and the classifier operations. The proposed approach is capable of analyzing multifarious models which are trained for applications such as object detection and recognition. In this work, probabilistic approach and gradient based approach have been used for the purpose of object localization. Geometric mean of heatmaps of both the approaches has been done. In the former approach, the true object’s gradient’s are made to flow into the last convolutional layer for the purpose of determining the most significant points which would help to predict that particular object. In the probabilistic approach, CNN’s top down attention has been used which serves the purpose of generation of attention maps which are task specific. A probabilistic scheme (to select a significant neuron in the network) has been used during backpropagation of signals from top to down in the hierarchy of network. The proposed work has been executed on CLS-LOC dataset which is a part of Imagenet dataset. The proposed work is then compared with the previously developed techniques such as saliency maps, SmoothGrad, GradCam, Top Down Neural approach to exhibit the better accuracy of the proposed work.