嵌入格式塔原理的分层距离相关非参数贝叶斯模型视频分割

2019 3rd International Symposium on Autonomous Systems (ISAS) Pub Date : 2019-05-01 DOI:10.1109/ISASS.2019.8757719

Yue Gao

{"title":"嵌入格式塔原理的分层距离相关非参数贝叶斯模型视频分割","authors":"Yue Gao","doi":"10.1109/ISASS.2019.8757719","DOIUrl":null,"url":null,"abstract":"Gestalt is a psychology term meaning “unified whole” which refers to the theories of visual perception developed in the 1920s. These theories attempt to describe how people tend to organize visual elements into groups or unified wholes when certain principles are applied. These principles such as similarity, common fate, continuation and etc. are intuitive to understand. However the challenge is to encode Gestalt theory as the principle to construct a computational model for visual process. In the domain of computer vision, visual processing tasks such as segmentation benefits greatly from spatio-temporal information from videos. Hence, we propose to study video segmentation where the number of objects is unknown. We achieve this by formulating a hierarchical nonparametric Bayesian model. Our model contains three key features 1) it embeds Gestalt principles as the prior of the model 2) it is a distance dependent nonparametric Bayesian model where the spatial temporal order of the data points matters. 3) it is a hierarchical model where we considered both the local and global aspects. We show that that our unsupervised generative model share similar results in human visual segmentation tasks as well as some psychology experiments.","PeriodicalId":359959,"journal":{"name":"2019 3rd International Symposium on Autonomous Systems (ISAS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Embedding Gestalt Principles to Hierarchical Distance Dependent Nonparametric Bayesian Model for Video Segmentation\",\"authors\":\"Yue Gao\",\"doi\":\"10.1109/ISASS.2019.8757719\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gestalt is a psychology term meaning “unified whole” which refers to the theories of visual perception developed in the 1920s. These theories attempt to describe how people tend to organize visual elements into groups or unified wholes when certain principles are applied. These principles such as similarity, common fate, continuation and etc. are intuitive to understand. However the challenge is to encode Gestalt theory as the principle to construct a computational model for visual process. In the domain of computer vision, visual processing tasks such as segmentation benefits greatly from spatio-temporal information from videos. Hence, we propose to study video segmentation where the number of objects is unknown. We achieve this by formulating a hierarchical nonparametric Bayesian model. Our model contains three key features 1) it embeds Gestalt principles as the prior of the model 2) it is a distance dependent nonparametric Bayesian model where the spatial temporal order of the data points matters. 3) it is a hierarchical model where we considered both the local and global aspects. We show that that our unsupervised generative model share similar results in human visual segmentation tasks as well as some psychology experiments.\",\"PeriodicalId\":359959,\"journal\":{\"name\":\"2019 3rd International Symposium on Autonomous Systems (ISAS)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Symposium on Autonomous Systems (ISAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISASS.2019.8757719\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Symposium on Autonomous Systems (ISAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISASS.2019.8757719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

格式塔是一个心理学术语，意思是“统一的整体”，指的是20世纪20年代发展起来的视觉感知理论。这些理论试图描述当应用某些原则时，人们如何倾向于将视觉元素组织成组或统一的整体。这些原则如相似性、共同命运、延续性等都是直观易懂的。然而，如何将格式塔理论编码为构建视觉过程计算模型的原理是一个挑战。在计算机视觉领域，分割等视觉处理任务很大程度上得益于视频的时空信息。因此，我们提出研究对象数量未知的视频分割。我们通过制定一个分层非参数贝叶斯模型来实现这一点。我们的模型包含三个关键特征:1)它嵌入格式塔原则作为模型的先验;2)它是一个距离依赖的非参数贝叶斯模型，其中数据点的时空顺序很重要。3)这是一个层次模型，我们同时考虑了本地和全球方面。我们表明，我们的无监督生成模型在人类视觉分割任务和一些心理学实验中具有相似的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Embedding Gestalt Principles to Hierarchical Distance Dependent Nonparametric Bayesian Model for Video Segmentation

Gestalt is a psychology term meaning “unified whole” which refers to the theories of visual perception developed in the 1920s. These theories attempt to describe how people tend to organize visual elements into groups or unified wholes when certain principles are applied. These principles such as similarity, common fate, continuation and etc. are intuitive to understand. However the challenge is to encode Gestalt theory as the principle to construct a computational model for visual process. In the domain of computer vision, visual processing tasks such as segmentation benefits greatly from spatio-temporal information from videos. Hence, we propose to study video segmentation where the number of objects is unknown. We achieve this by formulating a hierarchical nonparametric Bayesian model. Our model contains three key features 1) it embeds Gestalt principles as the prior of the model 2) it is a distance dependent nonparametric Bayesian model where the spatial temporal order of the data points matters. 3) it is a hierarchical model where we considered both the local and global aspects. We show that that our unsupervised generative model share similar results in human visual segmentation tasks as well as some psychology experiments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 3rd International Symposium on Autonomous Systems (ISAS)

自引率

0.00%

发文量