动态卷积神经网络门控模式中的语义信息

2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA) Pub Date : 2021-07-12 DOI:10.1109/IISA52424.2021.9555567

Ilias Theodorakopoulos, G. Economou

{"title":"动态卷积神经网络门控模式中的语义信息","authors":"Ilias Theodorakopoulos, G. Economou","doi":"10.1109/IISA52424.2021.9555567","DOIUrl":null,"url":null,"abstract":"Dynamic Convolutional Neural Networks are an emerging class of models characterized by their ability to dynamically adjust inference complexity at run-time, by identifying parts of the model with minimal contribution to the result and skipping the corresponding computations. A prominent such category includes models that generate binary gating signals indicating whether specific convolutional kernels need to be computed or can be omitted based on the characteristics of each processed datum. These signals are usually generated by branches of the same model which are typically learned simultaneously to the main task, with their main objective being to enable good performance with parsimony of computations. We argue that such objective incentivizes the model to implicitly optimize and utilize kernels in class/concept –specific groups, hence ascribing semantic information to the gating signals. We demonstrate this behavior by studying the characteristics of such signals for popular CNN architectures in the ImageNet database. By comparing the relationship between gating signals from different visual categories in the ImageNet hierarchy, it is shown that the gating patterns’ dissimilarity correlates well with semantic span of the underlying classes. It is also demonstrated that through appropriate distance measures, gating patterns can be used for ranking classes’ similarity with comparable performance to that off standard CNN-generated image descriptors, but in a significantly more compact representation due to their binary nature. (Abstract)","PeriodicalId":437496,"journal":{"name":"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic Information in Gating Patterns of Dynamic Convolutional Neural Networks\",\"authors\":\"Ilias Theodorakopoulos, G. Economou\",\"doi\":\"10.1109/IISA52424.2021.9555567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dynamic Convolutional Neural Networks are an emerging class of models characterized by their ability to dynamically adjust inference complexity at run-time, by identifying parts of the model with minimal contribution to the result and skipping the corresponding computations. A prominent such category includes models that generate binary gating signals indicating whether specific convolutional kernels need to be computed or can be omitted based on the characteristics of each processed datum. These signals are usually generated by branches of the same model which are typically learned simultaneously to the main task, with their main objective being to enable good performance with parsimony of computations. We argue that such objective incentivizes the model to implicitly optimize and utilize kernels in class/concept –specific groups, hence ascribing semantic information to the gating signals. We demonstrate this behavior by studying the characteristics of such signals for popular CNN architectures in the ImageNet database. By comparing the relationship between gating signals from different visual categories in the ImageNet hierarchy, it is shown that the gating patterns’ dissimilarity correlates well with semantic span of the underlying classes. It is also demonstrated that through appropriate distance measures, gating patterns can be used for ranking classes’ similarity with comparable performance to that off standard CNN-generated image descriptors, but in a significantly more compact representation due to their binary nature. (Abstract)\",\"PeriodicalId\":437496,\"journal\":{\"name\":\"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IISA52424.2021.9555567\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA52424.2021.9555567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

动态卷积神经网络是一类新兴的模型，其特点是能够在运行时动态调整推理复杂性，通过识别对结果贡献最小的模型部分并跳过相应的计算。其中一个突出的类别包括生成二进制门控信号的模型，该信号指示是否需要计算特定的卷积核，或者可以根据每个处理过的数据的特征忽略该卷积核。这些信号通常由同一模型的分支生成，这些分支通常同时学习到主要任务，其主要目标是通过简化计算实现良好的性能。我们认为，这样的目标激励模型隐式优化和利用特定类别/概念组中的核，从而将语义信息归因于门控信号。我们通过研究ImageNet数据库中流行的CNN架构的这些信号的特征来证明这种行为。通过比较ImageNet层次结构中不同视觉类别的门控信号之间的关系，表明门控模式的差异性与底层类别的语义跨度有很好的相关性。研究还表明，通过适当的距离度量，门控制模式可以用于对类的相似性进行排序，其性能与非标准cnn生成的图像描述符相当，但由于其二值性，门控制模式的表示更加紧凑。(抽象)

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Semantic Information in Gating Patterns of Dynamic Convolutional Neural Networks

Dynamic Convolutional Neural Networks are an emerging class of models characterized by their ability to dynamically adjust inference complexity at run-time, by identifying parts of the model with minimal contribution to the result and skipping the corresponding computations. A prominent such category includes models that generate binary gating signals indicating whether specific convolutional kernels need to be computed or can be omitted based on the characteristics of each processed datum. These signals are usually generated by branches of the same model which are typically learned simultaneously to the main task, with their main objective being to enable good performance with parsimony of computations. We argue that such objective incentivizes the model to implicitly optimize and utilize kernels in class/concept –specific groups, hence ascribing semantic information to the gating signals. We demonstrate this behavior by studying the characteristics of such signals for popular CNN architectures in the ImageNet database. By comparing the relationship between gating signals from different visual categories in the ImageNet hierarchy, it is shown that the gating patterns’ dissimilarity correlates well with semantic span of the underlying classes. It is also demonstrated that through appropriate distance measures, gating patterns can be used for ranking classes’ similarity with comparable performance to that off standard CNN-generated image descriptors, but in a significantly more compact representation due to their binary nature. (Abstract)

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)

自引率

0.00%

发文量