PatternNet: Visual Pattern Mining with Deep Neural Network

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval Pub Date : 2017-03-18 DOI:10.1145/3206025.3206039

Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang

{"title":"PatternNet: Visual Pattern Mining with Deep Neural Network","authors":"Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang","doi":"10.1145/3206025.3206039","DOIUrl":null,"url":null,"abstract":"Visual patterns represent the discernible regularity in the visual world. They capture the essential nature of visual objects or scenes. Understanding and modeling visual patterns is a fundamental problem in visual recognition that has wide ranging applications. In this paper, we study the problem of visual pattern mining and propose a novel deep neural network architecture called PatternNet for discovering these patterns that are both discriminative and representative. The proposed PatternNet leverages the filters in the last convolution layer of a convolutional neural network to find locally consistent visual patches, and by combining these filters we can effectively discover unique visual patterns. In addition, PatternNet can discover visual patterns efficiently without performing expensive image patch sampling, and this advantage provides an order of magnitude speedup compared to most other approaches. We evaluate the proposed PatternNet subjectively by showing randomly selected visual patterns which are discovered by our method and quantitatively by performing image classification with the identified visual patterns and comparing our performance with the current state-of-the-art. We also directly evaluate the quality of the discovered visual patterns by leveraging the identified patterns as proposed objects in an image and compare with other relevant methods. Our proposed network and procedure, PatterNet, is able to outperform competing methods for the tasks described.","PeriodicalId":224132,"journal":{"name":"Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3206025.3206039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 33

Abstract

Visual patterns represent the discernible regularity in the visual world. They capture the essential nature of visual objects or scenes. Understanding and modeling visual patterns is a fundamental problem in visual recognition that has wide ranging applications. In this paper, we study the problem of visual pattern mining and propose a novel deep neural network architecture called PatternNet for discovering these patterns that are both discriminative and representative. The proposed PatternNet leverages the filters in the last convolution layer of a convolutional neural network to find locally consistent visual patches, and by combining these filters we can effectively discover unique visual patterns. In addition, PatternNet can discover visual patterns efficiently without performing expensive image patch sampling, and this advantage provides an order of magnitude speedup compared to most other approaches. We evaluate the proposed PatternNet subjectively by showing randomly selected visual patterns which are discovered by our method and quantitatively by performing image classification with the identified visual patterns and comparing our performance with the current state-of-the-art. We also directly evaluate the quality of the discovered visual patterns by leveraging the identified patterns as proposed objects in an image and compare with other relevant methods. Our proposed network and procedure, PatterNet, is able to outperform competing methods for the tasks described.

查看原文本刊更多论文

PatternNet:基于深度神经网络的视觉模式挖掘

视觉模式代表了视觉世界中可识别的规律性。它们捕捉了视觉对象或场景的本质。视觉模式的理解和建模是视觉识别中的一个基本问题，具有广泛的应用。在本文中，我们研究了视觉模式挖掘问题，并提出了一种新的深度神经网络架构，称为PatternNet，用于发现这些既具有判别性又具有代表性的模式。提出的PatternNet利用卷积神经网络最后一层卷积中的滤波器来寻找局部一致的视觉斑块，通过组合这些滤波器，我们可以有效地发现独特的视觉模式。此外，PatternNet可以有效地发现视觉模式，而无需执行昂贵的图像补丁采样，与大多数其他方法相比，这一优势提供了一个数量级的加速。我们主观上通过展示随机选择的由我们的方法发现的视觉模式来评估我们提出的PatternNet，定量地通过使用识别的视觉模式进行图像分类，并将我们的性能与当前最先进的技术进行比较。我们还通过将识别的模式作为图像中的建议对象直接评估发现的视觉模式的质量，并与其他相关方法进行比较。我们提出的网络和过程，PatterNet，能够在描述的任务中优于竞争方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval

自引率

0.00%

发文量