Perception Principles Guided Video Segmentation

2005 IEEE 7th Workshop on Multimedia Signal Processing Pub Date : 2005-10-01 DOI:10.1109/MMSP.2005.248664

Cheng Chen, Guoliang Fan

{"title":"Perception Principles Guided Video Segmentation","authors":"Cheng Chen, Guoliang Fan","doi":"10.1109/MMSP.2005.248664","DOIUrl":null,"url":null,"abstract":"In this paper, we present a perception principles-guided video segmentation method, where statistical modeling and graph-theoretic approaches are combined in a multi-layer classification architecture. Various visual cues are effectively incorporated in a sequential segmentation process. Specifically, low-level pixel-wise features are used in the first layer where a joint spatio-temporal statistical modeling approach is used to construct entry-level visual units in space-time. In the second layer, all units are first classified into dynamic or static units based their motion magnitudes. Then dynamic units are further parsed into over-segmented moving regions that are connected in space and time, and a mid-level feature, motion trajectory, is extracted for each moving region. In the third layer, still and moving regions are merged into background and moving objects by a graph-based approach with different similarity metrics. The proposed algorithm employs both long-range motion information, i.e., trajectory, and short-range motion information, i.e., change detection, to retain temporal continuity and spatial homogeneity of moving objects. The proposed multi-layer structure ensembles the joint spatio-temporal and cascade process of perception principles and support efficient and accurate object segmentation","PeriodicalId":191719,"journal":{"name":"2005 IEEE 7th Workshop on Multimedia Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE 7th Workshop on Multimedia Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2005.248664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In this paper, we present a perception principles-guided video segmentation method, where statistical modeling and graph-theoretic approaches are combined in a multi-layer classification architecture. Various visual cues are effectively incorporated in a sequential segmentation process. Specifically, low-level pixel-wise features are used in the first layer where a joint spatio-temporal statistical modeling approach is used to construct entry-level visual units in space-time. In the second layer, all units are first classified into dynamic or static units based their motion magnitudes. Then dynamic units are further parsed into over-segmented moving regions that are connected in space and time, and a mid-level feature, motion trajectory, is extracted for each moving region. In the third layer, still and moving regions are merged into background and moving objects by a graph-based approach with different similarity metrics. The proposed algorithm employs both long-range motion information, i.e., trajectory, and short-range motion information, i.e., change detection, to retain temporal continuity and spatial homogeneity of moving objects. The proposed multi-layer structure ensembles the joint spatio-temporal and cascade process of perception principles and support efficient and accurate object segmentation

查看原文本刊更多论文

感知原则指导视频分割

在本文中，我们提出了一种以感知原则为指导的视频分割方法，该方法将统计建模和图论方法结合在多层分类体系结构中。各种视觉线索有效地结合在一个顺序分割过程。具体来说，在第一层中使用低级像素特征，其中使用联合时空统计建模方法来构建时空中的入门级视觉单元。在第二层中，所有单位首先根据其运动幅度分为动态或静态单位。然后将动态单元进一步解析为在空间和时间上相互连接的过度分割的运动区域，并为每个运动区域提取一个中级特征——运动轨迹。在第三层，采用基于图的方法，使用不同的相似度度量将静止区域和运动区域合并为背景和运动对象。该算法同时利用远程运动信息(即轨迹)和近距离运动信息(即变化检测)来保持运动目标的时间连续性和空间同质性。所提出的多层结构集成了感知原理的时空联合和级联过程，支持高效、准确的目标分割

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2005 IEEE 7th Workshop on Multimedia Signal Processing

自引率

0.00%

发文量