Attribute-based vehicle search in crowded surveillance videos

Proceedings of the 1st ACM International Conference on Multimedia Retrieval Pub Date : 2011-04-18 DOI:10.1145/1991996.1992014

R. Feris, Behjat Siddiquie, Y. Zhai, James Petterson, L. Brown, Sharath Pankanti

{"title":"Attribute-based vehicle search in crowded surveillance videos","authors":"R. Feris, Behjat Siddiquie, Y. Zhai, James Petterson, L. Brown, Sharath Pankanti","doi":"10.1145/1991996.1992014","DOIUrl":null,"url":null,"abstract":"We present a novel application for searching for vehicles in surveillance videos based on semantic attributes. At the interface, the user specifies a set of vehicle characteristics (such as color, direction of travel, speed, length, height, etc.) and the system automatically retrieves video events that match the provided description. A key differentiating aspect of our system is the ability to handle challenging urban conditions such as high volumes of activity and environmental factors. This is achieved through a novel multi-view vehicle detection approach which relies on what we call motionlet classifiers, i.e. classifiers that are learned with vehicle samples clustered in the motion configuration space. We employ massively parallel feature selection to learn compact and accurate motionlet detectors. Moreover, in order to deal with different vehicle types (buses, trucks, SUVs, cars), we learn the motionlet detectors in a shape-free appearance space, where all training samples are resized to the same aspect ratio, and then during test time the aspect ratio of the sliding window is changed to allow the detection of different vehicle types. Once a vehicle is detected and tracked over the video, fine-grained attributes are extracted and ingested into a database to allow future search queries such as \"Show me all blue trucks larger than 7ft length traveling at high speed northbound last Saturday, from 2pm to 5pm\".","PeriodicalId":390933,"journal":{"name":"Proceedings of the 1st ACM International Conference on Multimedia Retrieval","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st ACM International Conference on Multimedia Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1991996.1992014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 45

Abstract

We present a novel application for searching for vehicles in surveillance videos based on semantic attributes. At the interface, the user specifies a set of vehicle characteristics (such as color, direction of travel, speed, length, height, etc.) and the system automatically retrieves video events that match the provided description. A key differentiating aspect of our system is the ability to handle challenging urban conditions such as high volumes of activity and environmental factors. This is achieved through a novel multi-view vehicle detection approach which relies on what we call motionlet classifiers, i.e. classifiers that are learned with vehicle samples clustered in the motion configuration space. We employ massively parallel feature selection to learn compact and accurate motionlet detectors. Moreover, in order to deal with different vehicle types (buses, trucks, SUVs, cars), we learn the motionlet detectors in a shape-free appearance space, where all training samples are resized to the same aspect ratio, and then during test time the aspect ratio of the sliding window is changed to allow the detection of different vehicle types. Once a vehicle is detected and tracked over the video, fine-grained attributes are extracted and ingested into a database to allow future search queries such as "Show me all blue trucks larger than 7ft length traveling at high speed northbound last Saturday, from 2pm to 5pm".

查看原文本刊更多论文

拥挤监控视频中基于属性的车辆搜索

提出了一种基于语义属性的监控视频车辆搜索新方法。在界面上，用户指定一组车辆特征(如颜色、行驶方向、速度、长度、高度等)，系统自动检索与提供的描述相匹配的视频事件。我们的系统与众不同的一个关键方面是能够处理具有挑战性的城市条件，如大量的活动和环境因素。这是通过一种新的多视图车辆检测方法实现的，该方法依赖于我们称之为运动let分类器的方法，即通过在运动配置空间中聚集的车辆样本来学习的分类器。我们采用大规模并行特征选择来学习紧凑和精确的运动检测器。此外，为了处理不同的车辆类型(公共汽车，卡车，suv，轿车)，我们在无形状的外观空间中学习运动检测器，其中所有训练样本都调整为相同的宽高比，然后在测试期间改变滑动窗口的宽高比以允许检测不同的车辆类型。一旦通过视频检测到并跟踪车辆，系统就会提取出细粒度的属性，并将其输入数据库，以便将来进行搜索查询，例如“显示上周六下午2点至5点高速向北行驶的所有长度大于7英尺的蓝色卡车”。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 1st ACM International Conference on Multimedia Retrieval

自引率

0.00%

发文量