Simulating vision through time: Hierarchical, sparse models of visual cortex for motion imagery

2012 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) Pub Date : 2012-10-09 DOI:10.1109/AIPR.2012.6528200

A. Galbraith, S. Brumby, R. Chartrand

{"title":"Simulating vision through time: Hierarchical, sparse models of visual cortex for motion imagery","authors":"A. Galbraith, S. Brumby, R. Chartrand","doi":"10.1109/AIPR.2012.6528200","DOIUrl":null,"url":null,"abstract":"Efficient pattern recognition in motion imagery has become a growing challenge as the number of video sources proliferates worldwide. Historically, automated analysis of motion imagery, such as object detection, classification and tracking, has been accomplished using hand-designed feature detectors. Though useful, these feature detectors are not easily extended to new data sets or new target categories since they are often task specific, and typically require substantial effort to design. Rather than hand-designing filters, recent advances in the field of image processing have resulted in a theoretical framework of sparse, hierarchical, learned representations that can describe video data of natural scenes at many spatial and temporal scales and many levels of object complexity. These sparse, hierarchical models learn the information content of imagery and video from the data itself and lead to state-of-the-art performance and more efficient processing. Processing efficiency is important as it allows scaling up of research to work with dataset sizes and numbers of categories approaching real-world conditions. We now describe recent work at Los Alamos National Laboratory developing hierarchical sparse learning computer vision models that can process high definition color video in real time. We present preliminary results extending our prior work on object classification in still imagery [1] to discovery of useful features at different time scales in motion imagery for detection, classification and tracking of objects.","PeriodicalId":406942,"journal":{"name":"2012 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2012.6528200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Efficient pattern recognition in motion imagery has become a growing challenge as the number of video sources proliferates worldwide. Historically, automated analysis of motion imagery, such as object detection, classification and tracking, has been accomplished using hand-designed feature detectors. Though useful, these feature detectors are not easily extended to new data sets or new target categories since they are often task specific, and typically require substantial effort to design. Rather than hand-designing filters, recent advances in the field of image processing have resulted in a theoretical framework of sparse, hierarchical, learned representations that can describe video data of natural scenes at many spatial and temporal scales and many levels of object complexity. These sparse, hierarchical models learn the information content of imagery and video from the data itself and lead to state-of-the-art performance and more efficient processing. Processing efficiency is important as it allows scaling up of research to work with dataset sizes and numbers of categories approaching real-world conditions. We now describe recent work at Los Alamos National Laboratory developing hierarchical sparse learning computer vision models that can process high definition color video in real time. We present preliminary results extending our prior work on object classification in still imagery [1] to discovery of useful features at different time scales in motion imagery for detection, classification and tracking of objects.

查看原文本刊更多论文

通过时间模拟视觉:运动图像视觉皮层的分层稀疏模型

随着世界范围内视频源数量的激增，有效的运动图像模式识别已经成为一个越来越大的挑战。历史上，运动图像的自动分析，如目标检测、分类和跟踪，都是使用手工设计的特征检测器来完成的。虽然有用，但这些特征检测器不容易扩展到新的数据集或新的目标类别，因为它们通常是特定于任务的，并且通常需要大量的设计工作。图像处理领域的最新进展不是手工设计滤波器，而是产生了稀疏、分层、学习表征的理论框架，可以在许多空间和时间尺度以及许多对象复杂性级别上描述自然场景的视频数据。这些稀疏的、分层的模型从数据本身学习图像和视频的信息内容，并导致最先进的性能和更有效的处理。处理效率很重要，因为它允许扩大研究规模，以处理接近现实世界条件的数据集大小和类别数量。我们现在描述最近在洛斯阿拉莫斯国家实验室开发的分层稀疏学习计算机视觉模型的工作，该模型可以实时处理高清彩色视频。我们提出了初步的结果，扩展了我们之前在静止图像中物体分类的工作[1]，以发现运动图像中不同时间尺度的有用特征，用于物体的检测、分类和跟踪。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)

自引率

0.00%

发文量