Visualization of Layers Within a Convolutional Neural Network Using Gradient Activation Maps

IF 0.2 Q4 BIOLOGY

University of Toronto Journal of Undergraduate Life Sciences Pub Date : 2020-12-31 DOI:10.33137/juls.v14i1.35833

D. McAllister, Mauro Mendez, Ariana Bermúdez, P. Tyrrell

{"title":"Visualization of Layers Within a Convolutional Neural Network Using Gradient Activation Maps","authors":"D. McAllister, Mauro Mendez, Ariana Bermúdez, P. Tyrrell","doi":"10.33137/juls.v14i1.35833","DOIUrl":null,"url":null,"abstract":"Introduction: Convolutional neural networks (CNNs) are machine learning tools that have great potential in the field of medical imaging. However, it is often regarded as a “black box” as the process that is used by the machine to acquire a result is not transparent. It would be valuable to find a method to be able to understand how the machine comes to its decision. Therefore, the purpose of this study is to examine how effective gradient-weighted class activation mapping (grad-CAM) visualizations are for certain layers in a CNN-based dental x-ray artifact prediction model. \nMethods: To tackle this project, Python code using PyTorch trained a CNN to classify dental plates as unusable or usable depending on the presence of artifacts. Furthermore, Python using PyTorch was also used to overlay grad-CAM visualizations on the given input images for various layers within the model. One image with seventeen different overlays of artifacts was used in this study. \nResults: In earlier layers, the model appeared to focus on general features such as lines and edges of the teeth, while in later layers, the model was more interested in detailed aspects of the image. All images that contained artifacts resulted in the model focusing on more detailed areas of the image rather than the artifacts themselves. Whereas the images without artifacts resulted in the model focusing on the visualization of areas that surrounded the teeth. \nDiscussion and Conclusion: As subsequent layers examined more detailed aspects of the image as shown by the grad-CAM visualizations, they provided better insight into how the model processes information when it is making its classifications. Since all the images with artifacts showed similar trends in the visualizations of the various layers, it provides evidence to suggest that the location and size of the artifact does not affect the model’s pattern recognition and image classification.","PeriodicalId":40102,"journal":{"name":"University of Toronto Journal of Undergraduate Life Sciences","volume":" ","pages":""},"PeriodicalIF":0.2000,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"University of Toronto Journal of Undergraduate Life Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33137/juls.v14i1.35833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 2

Abstract

Introduction: Convolutional neural networks (CNNs) are machine learning tools that have great potential in the field of medical imaging. However, it is often regarded as a “black box” as the process that is used by the machine to acquire a result is not transparent. It would be valuable to find a method to be able to understand how the machine comes to its decision. Therefore, the purpose of this study is to examine how effective gradient-weighted class activation mapping (grad-CAM) visualizations are for certain layers in a CNN-based dental x-ray artifact prediction model. Methods: To tackle this project, Python code using PyTorch trained a CNN to classify dental plates as unusable or usable depending on the presence of artifacts. Furthermore, Python using PyTorch was also used to overlay grad-CAM visualizations on the given input images for various layers within the model. One image with seventeen different overlays of artifacts was used in this study. Results: In earlier layers, the model appeared to focus on general features such as lines and edges of the teeth, while in later layers, the model was more interested in detailed aspects of the image. All images that contained artifacts resulted in the model focusing on more detailed areas of the image rather than the artifacts themselves. Whereas the images without artifacts resulted in the model focusing on the visualization of areas that surrounded the teeth. Discussion and Conclusion: As subsequent layers examined more detailed aspects of the image as shown by the grad-CAM visualizations, they provided better insight into how the model processes information when it is making its classifications. Since all the images with artifacts showed similar trends in the visualizations of the various layers, it provides evidence to suggest that the location and size of the artifact does not affect the model’s pattern recognition and image classification.

查看原文本刊更多论文

使用梯度激活图的卷积神经网络层可视化

简介:卷积神经网络(cnn)是在医学成像领域具有巨大潜力的机器学习工具。然而，它通常被认为是一个“黑匣子”，因为机器用来获取结果的过程并不透明。找到一种能够理解机器如何做出决定的方法是很有价值的。因此，本研究的目的是检验梯度加权类激活映射(梯度- cam)可视化在基于cnn的牙科x射线伪影预测模型中对某些层的有效性。方法:为了解决这个项目，使用PyTorch的Python代码训练了一个CNN，根据人工制品的存在将牙板分类为不可用或可用。此外，使用PyTorch的Python还用于在模型内的各个层的给定输入图像上覆盖gradcam可视化。本研究使用了一幅具有17种不同叠加伪影的图像。结果:在较早的图层中，模型似乎更关注牙齿的线条和边缘等一般特征，而在较晚的图层中，模型对图像的细节方面更感兴趣。所有包含工件的图像都会导致模型关注图像的更详细的区域，而不是工件本身。然而，没有伪影的图像导致模型专注于牙齿周围区域的可视化。讨论和结论:由于后续的图层检查了由grad-CAM可视化显示的图像的更详细的方面，它们提供了更好的洞察模型在进行分类时如何处理信息。由于所有带有伪影的图像在各层的可视化中都表现出相似的趋势，这提供了证据表明伪影的位置和大小不影响模型的模式识别和图像分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

University of Toronto Journal of Undergraduate Life Sciences BIOLOGY-

自引率

0.00%

发文量