GPOL:基于梯度和概率的目标定位方法来理解cnn的工作

2019 IEEE Bombay Section Signature Conference (IBSSC) Pub Date : 2019-07-01 DOI:10.1109/IBSSC47189.2019.8972980

Sarthak Gupta, S. Bagga, Sanjay Kumar Dharandher, D. Sharma

{"title":"GPOL:基于梯度和概率的目标定位方法来理解cnn的工作","authors":"Sarthak Gupta, S. Bagga, Sanjay Kumar Dharandher, D. Sharma","doi":"10.1109/IBSSC47189.2019.8972980","DOIUrl":null,"url":null,"abstract":"Convolutional neural networks have been a revolution in the field of Computer Vision and are being extensively used for the purpose of image classification, object detection, generation of captions etc. CNNs are mostly considered black boxes where the internal functioning is not known. The objective of this work is to provide an explanation of the functioning of the the predictions made by the CNN. We propose a new technique for comprehending the functioning of the middle layers of the neural network and the classifier operations. The proposed approach is capable of analyzing multifarious models which are trained for applications such as object detection and recognition. In this work, probabilistic approach and gradient based approach have been used for the purpose of object localization. Geometric mean of heatmaps of both the approaches has been done. In the former approach, the true object’s gradient’s are made to flow into the last convolutional layer for the purpose of determining the most significant points which would help to predict that particular object. In the probabilistic approach, CNN’s top down attention has been used which serves the purpose of generation of attention maps which are task specific. A probabilistic scheme (to select a significant neuron in the network) has been used during backpropagation of signals from top to down in the hierarchy of network. The proposed work has been executed on CLS-LOC dataset which is a part of Imagenet dataset. The proposed work is then compared with the previously developed techniques such as saliency maps, SmoothGrad, GradCam, Top Down Neural approach to exhibit the better accuracy of the proposed work.","PeriodicalId":148941,"journal":{"name":"2019 IEEE Bombay Section Signature Conference (IBSSC)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"GPOL: Gradient and Probabilistic approach for Object Localization to understand the working of CNNs\",\"authors\":\"Sarthak Gupta, S. Bagga, Sanjay Kumar Dharandher, D. Sharma\",\"doi\":\"10.1109/IBSSC47189.2019.8972980\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional neural networks have been a revolution in the field of Computer Vision and are being extensively used for the purpose of image classification, object detection, generation of captions etc. CNNs are mostly considered black boxes where the internal functioning is not known. The objective of this work is to provide an explanation of the functioning of the the predictions made by the CNN. We propose a new technique for comprehending the functioning of the middle layers of the neural network and the classifier operations. The proposed approach is capable of analyzing multifarious models which are trained for applications such as object detection and recognition. In this work, probabilistic approach and gradient based approach have been used for the purpose of object localization. Geometric mean of heatmaps of both the approaches has been done. In the former approach, the true object’s gradient’s are made to flow into the last convolutional layer for the purpose of determining the most significant points which would help to predict that particular object. In the probabilistic approach, CNN’s top down attention has been used which serves the purpose of generation of attention maps which are task specific. A probabilistic scheme (to select a significant neuron in the network) has been used during backpropagation of signals from top to down in the hierarchy of network. The proposed work has been executed on CLS-LOC dataset which is a part of Imagenet dataset. The proposed work is then compared with the previously developed techniques such as saliency maps, SmoothGrad, GradCam, Top Down Neural approach to exhibit the better accuracy of the proposed work.\",\"PeriodicalId\":148941,\"journal\":{\"name\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"volume\":\"104 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBSSC47189.2019.8972980\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC47189.2019.8972980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

卷积神经网络是计算机视觉领域的一场革命，被广泛应用于图像分类、目标检测、标题生成等领域。cnn大多被认为是内部功能未知的黑匣子。这项工作的目的是为CNN所做的预测的功能提供解释。我们提出了一种理解神经网络中间层功能和分类器操作的新技术。所提出的方法能够分析各种模型，这些模型被训练用于目标检测和识别等应用。在本研究中，采用了概率方法和基于梯度的方法进行目标定位。对两种方法的热图进行了几何平均。在前一种方法中，真实物体的梯度被流到最后一个卷积层，目的是确定最重要的点，这将有助于预测特定物体。在概率方法中，我们使用了CNN的自顶向下的注意力，从而生成了针对特定任务的注意力图。在神经网络的层次结构中，从上到下的反向传播过程中，采用了一种概率方案(在网络中选择一个重要的神经元)。所提出的工作已经在CLS-LOC数据集上执行，CLS-LOC数据集是Imagenet数据集的一部分。然后将所提出的工作与先前开发的技术(如显著性图、SmoothGrad、GradCam、自顶向下神经方法)进行比较，以展示所提出工作的更好准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

GPOL: Gradient and Probabilistic approach for Object Localization to understand the working of CNNs

Convolutional neural networks have been a revolution in the field of Computer Vision and are being extensively used for the purpose of image classification, object detection, generation of captions etc. CNNs are mostly considered black boxes where the internal functioning is not known. The objective of this work is to provide an explanation of the functioning of the the predictions made by the CNN. We propose a new technique for comprehending the functioning of the middle layers of the neural network and the classifier operations. The proposed approach is capable of analyzing multifarious models which are trained for applications such as object detection and recognition. In this work, probabilistic approach and gradient based approach have been used for the purpose of object localization. Geometric mean of heatmaps of both the approaches has been done. In the former approach, the true object’s gradient’s are made to flow into the last convolutional layer for the purpose of determining the most significant points which would help to predict that particular object. In the probabilistic approach, CNN’s top down attention has been used which serves the purpose of generation of attention maps which are task specific. A probabilistic scheme (to select a significant neuron in the network) has been used during backpropagation of signals from top to down in the hierarchy of network. The proposed work has been executed on CLS-LOC dataset which is a part of Imagenet dataset. The proposed work is then compared with the previously developed techniques such as saliency maps, SmoothGrad, GradCam, Top Down Neural approach to exhibit the better accuracy of the proposed work.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE Bombay Section Signature Conference (IBSSC)

自引率

0.00%

发文量