Scalable Logo Detection and Recognition with Minimal Labeling

D. M. Montserrat, Qian Lin, J. Allebach, E. Delp
{"title":"Scalable Logo Detection and Recognition with Minimal Labeling","authors":"D. M. Montserrat, Qian Lin, J. Allebach, E. Delp","doi":"10.1109/MIPR.2018.00034","DOIUrl":null,"url":null,"abstract":"In this paper we describe a new approach to detecting and locating brand logos in an image using machine learning methods and synthetic training data. Deep learning methods, particularly the use of Convolutional Neural Networks (CNN), have been very popular for extracting visual information, such as image shapes and objects, from images. A CNN has parameters and configuration information that are learned from training images. To obtain good accuracy usually a large amount of labeled (groundtruthed) images are required for training. Collecting the training images and labeling them can be expensive and time consuming. Methods that include data augmentation, image synthesis, and bootstrapping techniques provide useful alternatives to creating training images. In this paper, we present a logo detection method that requires minimum labeled images. First, we use synthetic images to train a CNN to detect logos. Then, this CNN is used to automatically detect and localize logos from images extracted from the web. Finally, these images are used to train a logo classifier. The combination of the logo detector and the classifier allows us to locate and classify multiple logos in a scene. While existing methods rely on manually labeled images, our method is fully trained with images obtained in an automated manner with minimal human supervision.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In this paper we describe a new approach to detecting and locating brand logos in an image using machine learning methods and synthetic training data. Deep learning methods, particularly the use of Convolutional Neural Networks (CNN), have been very popular for extracting visual information, such as image shapes and objects, from images. A CNN has parameters and configuration information that are learned from training images. To obtain good accuracy usually a large amount of labeled (groundtruthed) images are required for training. Collecting the training images and labeling them can be expensive and time consuming. Methods that include data augmentation, image synthesis, and bootstrapping techniques provide useful alternatives to creating training images. In this paper, we present a logo detection method that requires minimum labeled images. First, we use synthetic images to train a CNN to detect logos. Then, this CNN is used to automatically detect and localize logos from images extracted from the web. Finally, these images are used to train a logo classifier. The combination of the logo detector and the classifier allows us to locate and classify multiple logos in a scene. While existing methods rely on manually labeled images, our method is fully trained with images obtained in an automated manner with minimal human supervision.
可扩展的标识检测和识别与最小的标签
在本文中,我们描述了一种使用机器学习方法和综合训练数据来检测和定位图像中的品牌标识的新方法。深度学习方法,特别是卷积神经网络(CNN)的使用,在从图像中提取视觉信息(如图像形状和物体)方面非常流行。CNN具有从训练图像中学习到的参数和配置信息。为了获得良好的准确性,通常需要大量的标记(ground - truth)图像进行训练。收集训练图像并对其进行标记既昂贵又耗时。包括数据增强、图像合成和引导技术在内的方法为创建训练图像提供了有用的替代方法。在本文中,我们提出了一种需要最小标记图像的标识检测方法。首先,我们使用合成图像来训练CNN来检测徽标。然后,该CNN用于从网络中提取的图像中自动检测和定位徽标。最后,这些图像被用来训练一个标识分类器。logo检测器和分类器的结合使我们能够对一个场景中的多个logo进行定位和分类。虽然现有的方法依赖于手动标记的图像,但我们的方法是用最少人工监督的自动化方式获得的图像进行全面训练的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信