视觉和视频的分层表示

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI:10.1109/WVRS.1995.476846

E. Adelson

{"title":"视觉和视频的分层表示","authors":"E. Adelson","doi":"10.1109/WVRS.1995.476846","DOIUrl":null,"url":null,"abstract":"Human vision, machine vision, and image coding, each demand representations that are useful and efficient. The best-established techniques today are based on low-level processing. Future systems for image analysis and image coding will increasingly use image representations that involve such concepts as surfaces, lighting, transparency, etc. These representations fall in the domain of \"mid-level\" vision, and there is accumulating evidence of their importance in human vision. By representing images with these more sophisticated vocabularies we can increase the flexibility and efficiency of our vision and image coding systems. We are developing systems that decompose image sequences into overlapping layers, rather like the \"cels\" used by a traditional animator. These layers are ordered in depth, sliding over one another and being combined according to the rules of transparency and occlusion. Using the layered representation we can achieve greatly improved motion analysis and image segmentation. By applying layers to image coding we can achieve data compression far better than MPEG, and achieve frame-rate independence as a side benefit. Moreover, the image sequence is decomposed in a meaningful way, which allows flexible image editing and access.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Layered representations for vision and video\",\"authors\":\"E. Adelson\",\"doi\":\"10.1109/WVRS.1995.476846\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human vision, machine vision, and image coding, each demand representations that are useful and efficient. The best-established techniques today are based on low-level processing. Future systems for image analysis and image coding will increasingly use image representations that involve such concepts as surfaces, lighting, transparency, etc. These representations fall in the domain of \\\"mid-level\\\" vision, and there is accumulating evidence of their importance in human vision. By representing images with these more sophisticated vocabularies we can increase the flexibility and efficiency of our vision and image coding systems. We are developing systems that decompose image sequences into overlapping layers, rather like the \\\"cels\\\" used by a traditional animator. These layers are ordered in depth, sliding over one another and being combined according to the rules of transparency and occlusion. Using the layered representation we can achieve greatly improved motion analysis and image segmentation. By applying layers to image coding we can achieve data compression far better than MPEG, and achieve frame-rate independence as a side benefit. Moreover, the image sequence is decomposed in a meaningful way, which allows flexible image editing and access.\",\"PeriodicalId\":447791,\"journal\":{\"name\":\"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WVRS.1995.476846\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WVRS.1995.476846","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

摘要

人类视觉、机器视觉和图像编码都需要有用和高效的表示。目前最成熟的技术是基于低级处理的。未来的图像分析和图像编码系统将越来越多地使用涉及诸如表面、照明、透明度等概念的图像表示。这些表征属于“中级”视觉领域，越来越多的证据表明它们在人类视觉中的重要性。通过使用这些更复杂的词汇表来表示图像，我们可以提高视觉和图像编码系统的灵活性和效率。我们正在开发将图像序列分解成重叠层的系统，就像传统动画师使用的“细胞”一样。这些图层按深度排序，彼此滑动，并根据透明度和遮挡规则组合。使用分层表示可以大大提高运动分析和图像分割的效果。通过将层应用于图像编码，我们可以实现比MPEG更好的数据压缩，并实现帧率独立性作为附带好处。此外，对图像序列进行了有意义的分解，使图像编辑和访问更加灵活。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Layered representations for vision and video

Human vision, machine vision, and image coding, each demand representations that are useful and efficient. The best-established techniques today are based on low-level processing. Future systems for image analysis and image coding will increasingly use image representations that involve such concepts as surfaces, lighting, transparency, etc. These representations fall in the domain of "mid-level" vision, and there is accumulating evidence of their importance in human vision. By representing images with these more sophisticated vocabularies we can increase the flexibility and efficiency of our vision and image coding systems. We are developing systems that decompose image sequences into overlapping layers, rather like the "cels" used by a traditional animator. These layers are ordered in depth, sliding over one another and being combined according to the rules of transparency and occlusion. Using the layered representation we can achieve greatly improved motion analysis and image segmentation. By applying layers to image coding we can achieve data compression far better than MPEG, and achieve frame-rate independence as a side benefit. Moreover, the image sequence is decomposed in a meaningful way, which allows flexible image editing and access.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)

自引率

0.00%

发文量