{"title":"动态卷积神经网络门控模式中的语义信息","authors":"Ilias Theodorakopoulos, G. Economou","doi":"10.1109/IISA52424.2021.9555567","DOIUrl":null,"url":null,"abstract":"Dynamic Convolutional Neural Networks are an emerging class of models characterized by their ability to dynamically adjust inference complexity at run-time, by identifying parts of the model with minimal contribution to the result and skipping the corresponding computations. A prominent such category includes models that generate binary gating signals indicating whether specific convolutional kernels need to be computed or can be omitted based on the characteristics of each processed datum. These signals are usually generated by branches of the same model which are typically learned simultaneously to the main task, with their main objective being to enable good performance with parsimony of computations. We argue that such objective incentivizes the model to implicitly optimize and utilize kernels in class/concept –specific groups, hence ascribing semantic information to the gating signals. We demonstrate this behavior by studying the characteristics of such signals for popular CNN architectures in the ImageNet database. By comparing the relationship between gating signals from different visual categories in the ImageNet hierarchy, it is shown that the gating patterns’ dissimilarity correlates well with semantic span of the underlying classes. It is also demonstrated that through appropriate distance measures, gating patterns can be used for ranking classes’ similarity with comparable performance to that off standard CNN-generated image descriptors, but in a significantly more compact representation due to their binary nature. (Abstract)","PeriodicalId":437496,"journal":{"name":"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic Information in Gating Patterns of Dynamic Convolutional Neural Networks\",\"authors\":\"Ilias Theodorakopoulos, G. Economou\",\"doi\":\"10.1109/IISA52424.2021.9555567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dynamic Convolutional Neural Networks are an emerging class of models characterized by their ability to dynamically adjust inference complexity at run-time, by identifying parts of the model with minimal contribution to the result and skipping the corresponding computations. A prominent such category includes models that generate binary gating signals indicating whether specific convolutional kernels need to be computed or can be omitted based on the characteristics of each processed datum. These signals are usually generated by branches of the same model which are typically learned simultaneously to the main task, with their main objective being to enable good performance with parsimony of computations. We argue that such objective incentivizes the model to implicitly optimize and utilize kernels in class/concept –specific groups, hence ascribing semantic information to the gating signals. We demonstrate this behavior by studying the characteristics of such signals for popular CNN architectures in the ImageNet database. By comparing the relationship between gating signals from different visual categories in the ImageNet hierarchy, it is shown that the gating patterns’ dissimilarity correlates well with semantic span of the underlying classes. It is also demonstrated that through appropriate distance measures, gating patterns can be used for ranking classes’ similarity with comparable performance to that off standard CNN-generated image descriptors, but in a significantly more compact representation due to their binary nature. (Abstract)\",\"PeriodicalId\":437496,\"journal\":{\"name\":\"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IISA52424.2021.9555567\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA52424.2021.9555567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Semantic Information in Gating Patterns of Dynamic Convolutional Neural Networks
Dynamic Convolutional Neural Networks are an emerging class of models characterized by their ability to dynamically adjust inference complexity at run-time, by identifying parts of the model with minimal contribution to the result and skipping the corresponding computations. A prominent such category includes models that generate binary gating signals indicating whether specific convolutional kernels need to be computed or can be omitted based on the characteristics of each processed datum. These signals are usually generated by branches of the same model which are typically learned simultaneously to the main task, with their main objective being to enable good performance with parsimony of computations. We argue that such objective incentivizes the model to implicitly optimize and utilize kernels in class/concept –specific groups, hence ascribing semantic information to the gating signals. We demonstrate this behavior by studying the characteristics of such signals for popular CNN architectures in the ImageNet database. By comparing the relationship between gating signals from different visual categories in the ImageNet hierarchy, it is shown that the gating patterns’ dissimilarity correlates well with semantic span of the underlying classes. It is also demonstrated that through appropriate distance measures, gating patterns can be used for ranking classes’ similarity with comparable performance to that off standard CNN-generated image descriptors, but in a significantly more compact representation due to their binary nature. (Abstract)