嵌入格式塔原理的分层距离相关非参数贝叶斯模型视频分割

Yue Gao
{"title":"嵌入格式塔原理的分层距离相关非参数贝叶斯模型视频分割","authors":"Yue Gao","doi":"10.1109/ISASS.2019.8757719","DOIUrl":null,"url":null,"abstract":"Gestalt is a psychology term meaning “unified whole” which refers to the theories of visual perception developed in the 1920s. These theories attempt to describe how people tend to organize visual elements into groups or unified wholes when certain principles are applied. These principles such as similarity, common fate, continuation and etc. are intuitive to understand. However the challenge is to encode Gestalt theory as the principle to construct a computational model for visual process. In the domain of computer vision, visual processing tasks such as segmentation benefits greatly from spatio-temporal information from videos. Hence, we propose to study video segmentation where the number of objects is unknown. We achieve this by formulating a hierarchical nonparametric Bayesian model. Our model contains three key features 1) it embeds Gestalt principles as the prior of the model 2) it is a distance dependent nonparametric Bayesian model where the spatial temporal order of the data points matters. 3) it is a hierarchical model where we considered both the local and global aspects. We show that that our unsupervised generative model share similar results in human visual segmentation tasks as well as some psychology experiments.","PeriodicalId":359959,"journal":{"name":"2019 3rd International Symposium on Autonomous Systems (ISAS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Embedding Gestalt Principles to Hierarchical Distance Dependent Nonparametric Bayesian Model for Video Segmentation\",\"authors\":\"Yue Gao\",\"doi\":\"10.1109/ISASS.2019.8757719\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gestalt is a psychology term meaning “unified whole” which refers to the theories of visual perception developed in the 1920s. These theories attempt to describe how people tend to organize visual elements into groups or unified wholes when certain principles are applied. These principles such as similarity, common fate, continuation and etc. are intuitive to understand. However the challenge is to encode Gestalt theory as the principle to construct a computational model for visual process. In the domain of computer vision, visual processing tasks such as segmentation benefits greatly from spatio-temporal information from videos. Hence, we propose to study video segmentation where the number of objects is unknown. We achieve this by formulating a hierarchical nonparametric Bayesian model. Our model contains three key features 1) it embeds Gestalt principles as the prior of the model 2) it is a distance dependent nonparametric Bayesian model where the spatial temporal order of the data points matters. 3) it is a hierarchical model where we considered both the local and global aspects. We show that that our unsupervised generative model share similar results in human visual segmentation tasks as well as some psychology experiments.\",\"PeriodicalId\":359959,\"journal\":{\"name\":\"2019 3rd International Symposium on Autonomous Systems (ISAS)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Symposium on Autonomous Systems (ISAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISASS.2019.8757719\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Symposium on Autonomous Systems (ISAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISASS.2019.8757719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

格式塔是一个心理学术语,意思是“统一的整体”,指的是20世纪20年代发展起来的视觉感知理论。这些理论试图描述当应用某些原则时,人们如何倾向于将视觉元素组织成组或统一的整体。这些原则如相似性、共同命运、延续性等都是直观易懂的。然而,如何将格式塔理论编码为构建视觉过程计算模型的原理是一个挑战。在计算机视觉领域,分割等视觉处理任务很大程度上得益于视频的时空信息。因此,我们提出研究对象数量未知的视频分割。我们通过制定一个分层非参数贝叶斯模型来实现这一点。我们的模型包含三个关键特征:1)它嵌入格式塔原则作为模型的先验;2)它是一个距离依赖的非参数贝叶斯模型,其中数据点的时空顺序很重要。3)这是一个层次模型,我们同时考虑了本地和全球方面。我们表明,我们的无监督生成模型在人类视觉分割任务和一些心理学实验中具有相似的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Embedding Gestalt Principles to Hierarchical Distance Dependent Nonparametric Bayesian Model for Video Segmentation
Gestalt is a psychology term meaning “unified whole” which refers to the theories of visual perception developed in the 1920s. These theories attempt to describe how people tend to organize visual elements into groups or unified wholes when certain principles are applied. These principles such as similarity, common fate, continuation and etc. are intuitive to understand. However the challenge is to encode Gestalt theory as the principle to construct a computational model for visual process. In the domain of computer vision, visual processing tasks such as segmentation benefits greatly from spatio-temporal information from videos. Hence, we propose to study video segmentation where the number of objects is unknown. We achieve this by formulating a hierarchical nonparametric Bayesian model. Our model contains three key features 1) it embeds Gestalt principles as the prior of the model 2) it is a distance dependent nonparametric Bayesian model where the spatial temporal order of the data points matters. 3) it is a hierarchical model where we considered both the local and global aspects. We show that that our unsupervised generative model share similar results in human visual segmentation tasks as well as some psychology experiments.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信