Large-Scale Aerial Image Categorization by Multi-Task Discriminative Topologies Discovery

WISMM '14 Pub Date : 2014-11-07 DOI:10.1145/2661714.2661718

Yingjie Xia, Luming Zhang, Suhua Tang

{"title":"Large-Scale Aerial Image Categorization by Multi-Task Discriminative Topologies Discovery","authors":"Yingjie Xia, Luming Zhang, Suhua Tang","doi":"10.1145/2661714.2661718","DOIUrl":null,"url":null,"abstract":"Fast and accurately categorizing the millions of aerial images on Google Maps is a useful technique in multimedia applications. Existing methods cannot handle this task effectively due to two reasons. 1) It is challenging to build a realtime image categorization system, as some geo-aware Apps update over 20 aerial images per second. 2) The aerial images' topologies are the key to distinguish their categories, but they cannot be encoded by the generic visual descriptors. To solve these two problems, we propose an efficient aerial image categorization system, aiming at mining discriminative topologies of aerial images under a multi-task learning framework. Particularly, we first construct a region adjacency graph (RAG) that describes the topology of each aerial image. Thereby, aerial image categorization can be formulated as RAG-to-RAG matching. Based on graph theory, RAG-to-RAG matching is conducted by comparing all their respective graphlets (i.e., small subgraphs). Because the number of graphlets is huge, a multi-task feature selection algorithm is derived to discover topologies jointly discriminative to multiple categories. The discovered topologies are used to extract the discriminative graphlets. Finally, these graphlets are integrated into an AdaBoost model for predicting aerial image categories. Experiments show that our approach is competitive several existing recognition models. Further, over 24 aerial images are categorized per second, reflecting that our system is ready for real-world applications.","PeriodicalId":365687,"journal":{"name":"WISMM '14","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"WISMM '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2661714.2661718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Fast and accurately categorizing the millions of aerial images on Google Maps is a useful technique in multimedia applications. Existing methods cannot handle this task effectively due to two reasons. 1) It is challenging to build a realtime image categorization system, as some geo-aware Apps update over 20 aerial images per second. 2) The aerial images' topologies are the key to distinguish their categories, but they cannot be encoded by the generic visual descriptors. To solve these two problems, we propose an efficient aerial image categorization system, aiming at mining discriminative topologies of aerial images under a multi-task learning framework. Particularly, we first construct a region adjacency graph (RAG) that describes the topology of each aerial image. Thereby, aerial image categorization can be formulated as RAG-to-RAG matching. Based on graph theory, RAG-to-RAG matching is conducted by comparing all their respective graphlets (i.e., small subgraphs). Because the number of graphlets is huge, a multi-task feature selection algorithm is derived to discover topologies jointly discriminative to multiple categories. The discovered topologies are used to extract the discriminative graphlets. Finally, these graphlets are integrated into an AdaBoost model for predicting aerial image categories. Experiments show that our approach is competitive several existing recognition models. Further, over 24 aerial images are categorized per second, reflecting that our system is ready for real-world applications.

查看原文本刊更多论文

基于多任务判别拓扑发现的大规模航空图像分类

快速准确地对谷歌地图上数以百万计的航拍图像进行分类是多媒体应用中的一项有用技术。由于两个原因，现有的方法不能有效地处理这个任务。1)构建实时图像分类系统具有挑战性，因为一些地理感知应用程序每秒更新超过20张航空图像。2)航拍图像的拓扑结构是区分航拍图像类别的关键，但航拍图像的拓扑结构不能被通用的视觉描述符编码。为了解决这两个问题，我们提出了一种高效的航空图像分类系统，旨在在多任务学习框架下挖掘航空图像的判别拓扑。特别地，我们首先构造一个区域邻接图(RAG)来描述每个航空图像的拓扑结构。因此，航空图像分类可以表述为RAG-to-RAG匹配。基于图论，通过比较它们各自的所有图元(即小子图)来进行RAG-to-RAG匹配。针对石墨烯数量庞大的特点，提出了一种多任务特征选择算法，用于发现对多个类别联合判别的拓扑。将发现的拓扑用于提取鉴别石墨烯。最后，将这些小块集成到AdaBoost模型中，用于预测航空图像类别。实验表明，我们的方法与现有的几种识别模型相比具有一定的竞争力。此外，每秒可以对超过24张航拍图像进行分类，这表明我们的系统已经为现实世界的应用做好了准备。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

WISMM '14

自引率

0.00%

发文量