2005 IEEE International Conference on Multimedia and Expo最新文献

筛选
英文 中文
Separable bilateral filtering for fast video preprocessing 用于快速视频预处理的可分离双边滤波
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/icme.2005.1521458
Tuan Q. Pham, L. Vliet
{"title":"Separable bilateral filtering for fast video preprocessing","authors":"Tuan Q. Pham, L. Vliet","doi":"10.1109/icme.2005.1521458","DOIUrl":"https://doi.org/10.1109/icme.2005.1521458","url":null,"abstract":"Bilateral filtering is an edge-preserving filtering technique that employs both geometric closeness and photometric similarity of neighboring pixels to construct its filter kernel. Multi-dimensional bilateral filtering is computationally expensive because the adaptive kernel has to be recomputed at every pixel. In this paper, we present a separable implementation of the bilateral filter. The separable implementation offers equivalent adaptive filtering capability at a fraction of execution time compared to the traditional filter. Because of this efficiency, the separable bilateral filter can be used for fast preprocessing of images and videos. Experiments show that better image quality and higher compression efficiency is achievable if the original video is preprocessed with the separable bilateral filter.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114654778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 271
Segmentation of 3D Objects Using Pulse-Coupled Oscillator Networks 基于脉冲耦合振荡器网络的三维物体分割
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521383
Eva Ceccarelli, A. Bimbo, P. Pala
{"title":"Segmentation of 3D Objects Using Pulse-Coupled Oscillator Networks","authors":"Eva Ceccarelli, A. Bimbo, P. Pala","doi":"10.1109/ICME.2005.1521383","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521383","url":null,"abstract":"Along with image and video libraries, archives of 3D models have recently gained increasing attention. Accordingly, there is an increasing demand for solutions enabling retrieval of 3D models based on global properties as well as properties of object parts. In particular, retrieval based on object parts relies on segmentation of 3D objects into their constituent parts. This is a challenging task, as the identification of object parts should conform to human perceptual judgement. Therefore, definition of models and solutions that enable decomposition of 3D objects into perceptually relevant parts is a fundamental step to enable effective retrieval based on object parts. However, a few approaches have been proposed to support segmentation of 3D meshes into perceptually relevant parts. In this paper, we propose a model based on pulse-coupled oscillator networks. Preliminary experiments are reported to demonstrate the validity and potential of the proposed solution","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116034783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An integrated approach for generic object detection using kernel PCA and boosting 一种基于核主成分分析和增强的通用目标检测方法
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521600
Saad Ali, M. Shah
{"title":"An integrated approach for generic object detection using kernel PCA and boosting","authors":"Saad Ali, M. Shah","doi":"10.1109/ICME.2005.1521600","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521600","url":null,"abstract":"In this paper, we present a novel framework for generic object class detection by integrating Kernel PCA with AdaBoost. The classifier obtained in this way is invariant to changes in appearance, illumination conditions and surrounding clutter. A nonlinear shape subspace is learned for positive and negative object classes using kernel PCA. Features are derived by projecting example images onto the learned sub-spaces. Base learners are modeled using Bayes classifier. AdaBoost is then employed to discover the features that are most relevant for the object detection task at hand. Proposed method has been successfully tested on wide range of object classes (cars, airplanes, pedestrians, motorcycles etc) using standard data sets and has shown good performance. Using a small training set, the classifier learned in this way was able to generalize the intra-class variation while still maintaining high detection rate. In most object categories, we achieved detection rates of above 95% with minimal false alarm rates. We demonstrate the comparative performance of our method against current state of the art approaches.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123577461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Context-Aware Dynamic Presentation Synthesis for Exploratory Multimodal Environments 探索性多模态环境的上下文感知动态表示合成
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521596
H. Sridharan, Ankur Mani, H. Sundaram, J. Brungart, David Birchfield
{"title":"Context-Aware Dynamic Presentation Synthesis for Exploratory Multimodal Environments","authors":"H. Sridharan, Ankur Mani, H. Sundaram, J. Brungart, David Birchfield","doi":"10.1109/ICME.2005.1521596","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521596","url":null,"abstract":"In this paper, we develop a novel real-time, interactive, automatic multimodal exploratory environment that dynamically adapts the media presented, to user context. There are two key contributions of this paper-(a) development of multimodal user-context model and (b) modeling the dynamics of the presentation to maximize coherence. We develop a novel user-context model comprising interests, media history, interaction behavior and tasks, that evolves based on the specific interaction. We also develop novel metrics between media elements and the user context. The presentation environment dynamically adapts to the current user context. We develop an optimal media selection and display framework that maximizes coherence, while constrained by the user-context, user goals and the structure of the knowledge in the exploratory environment. The experimental results indicate that the system performs well. The results also show that user-context models significantly improve presentation coherence","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121896553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Constraint-Based Approach for the Authoring of Multi-Topic Multimedia Presentations 一种基于约束的多主题多媒体演示文稿创作方法
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521489
E. Bertino, E. Ferrari, A. Perego, Diego Santi
{"title":"A Constraint-Based Approach for the Authoring of Multi-Topic Multimedia Presentations","authors":"E. Bertino, E. Ferrari, A. Perego, Diego Santi","doi":"10.1109/ICME.2005.1521489","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521489","url":null,"abstract":"Synchronized multimedia applications play an important role in a digital library environment, since they allow one to efficiently disseminate knowledge among differently skilled users through an approach, which is more direct than the classic 'static' documents. In this paper, we propose a new authoring approach based on an innovative presentation structure and a new class of content-based constraints. Thanks to a flexible heuristic process, such features allow the author to easily combine several multimedia objects into a multi-topic presentation, whose different contents can be freely chosen by end users according to their preferences or skills","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121957669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Fast Search Method for Image Vector Quantization Based on Equal-Average Equal-Variance and Partial Sum Concept 基于等平均等方差和部分和概念的图像矢量量化快速搜索方法
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521702
Z. Pan, K. Kotani, T. Ohmi
{"title":"Fast Search Method for Image Vector Quantization Based on Equal-Average Equal-Variance and Partial Sum Concept","authors":"Z. Pan, K. Kotani, T. Ohmi","doi":"10.1109/ICME.2005.1521702","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521702","url":null,"abstract":"The encoding process of image vector quantization (VQ) is very heavy due to it performing a lot of k-dimensional Euclidean distance computations. In order to speed up VQ encoding, it is most important to avoid unnecessary exact Euclidean distance computations as many as possible by using features of a vector to estimate how large it is first so as to reject most of unlikely codewords. The mean, the variance, L 2 norm and partial sum of a vector have been proposed as effective features in previous works for fast VQ encoding. Recently, in the previous work (Z. Lu et al., 2003), three features of the mean, the variance and L2 norm are used together to derive an EEENNS search method, which is very search efficient but still has obvious computational redundancy. This paper aims at modifying the results of EEENNS method further by introducing another feature of partial sum to replace L2 norm feature so as to reduce more search space. Mathematical analysis and experimental results confirmed that the proposed method is more search efficient compared to (Z. Lu et al., 2003)","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117089064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatiotemporal saliency for human action recognition 人类动作识别的时空显著性
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521452
A. Oikonomopoulos, I. Patras, M. Pantic
{"title":"Spatiotemporal saliency for human action recognition","authors":"A. Oikonomopoulos, I. Patras, M. Pantic","doi":"10.1109/ICME.2005.1521452","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521452","url":null,"abstract":"This paper addresses the problem of human action recognition by introducing a sparse representation of image sequences as a collection of spatiotemporal events that are localized at points that are salient both in space and time. We detect the spatiotemporal salient points by measuring changes in the information content of pixel neighborhoods not only in space but also in time. We introduce an appropriate distance metric between two collections of spatiotemporal salient points that is based on the Chamfer distance and an iterative linear time warping technique that deals with time expansion or time compression issues. We propose a classification scheme that is based on relevance vector machines and on the proposed distance measure. We present results on real image sequences from a small database depicting people performing 19 aerobic exercises.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124025872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Aggregating signatures of MPEG-4 elementary streams 聚合MPEG-4基本流的签名
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521390
Yongdong Wu
{"title":"Aggregating signatures of MPEG-4 elementary streams","authors":"Yongdong Wu","doi":"10.1109/ICME.2005.1521390","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521390","url":null,"abstract":"A complete MPEG-4 stream consists of many elementary streams, which may be generated by different authors. In the scenario of this paper, each author signs his own authentic elementary stream independently, and then an untrusted distributor aggregates these signatures into only one. Based on the unique signature, a client is able to verify the received MPEG-4 stream with the certificates of all the authors other than the certificate of the distributor. In addition, each author cannot deny what he has signed even if he is willing to admit a signature on another ES. This aggregated signature scheme is efficient in terms of transmission overhead and verification time since only one signature is processed in the client side.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124760486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Watermarking based Image Authentication using Feature Amplification 基于水印的图像特征放大认证
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521497
Shuiming Ye, E. Chang, Qibin Sun
{"title":"Watermarking based Image Authentication using Feature Amplification","authors":"Shuiming Ye, E. Chang, Qibin Sun","doi":"10.1109/ICME.2005.1521497","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521497","url":null,"abstract":"In a typical content and watermarking based image authentication approach, a feature is extracted from the given image, and then embedded back into the image using a watermarking method. Since the entropy of the feature might be higher than the capacity of the watermarking scheme, or the feature is represented in a continuous domain, it has to be further quantized before embedding. The lost of information during quantization potentially degrades the overall performance of the authentication scheme. This paper propose a simple but effective approach that avoids the feature quantization by additive feature: the feature is firstly added into the image before watermark embedding, and latterly subtracted from the watermarked image. In our experiments, the proposed approach obtains larger achievable robustness/sensitivity region and has a smaller fuzzy region of authenticity than the typical approach","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128702665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Infolink: Analysis of Dutch Broadcast News and Cross-Media Browsing 荷兰广播新闻与跨媒体浏览分析
2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521738
Jeroen Morang, R. Ordelman, F. D. Jong, A. V. Hessen
{"title":"Infolink: Analysis of Dutch Broadcast News and Cross-Media Browsing","authors":"Jeroen Morang, R. Ordelman, F. D. Jong, A. V. Hessen","doi":"10.1109/ICME.2005.1521738","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521738","url":null,"abstract":"In this paper, a cross-media browsing demonstrator named InfoLink is described. InfoLink automatically links the content of Dutch broadcast news videos to related information sources in parallel collections containing text and/or video. Automatic segmentation, speech recognition and available meta-data are used to index and link items. The concept is visualized using SMIL-scripts for presenting the streaming broadcast news video and the information links","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127034801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信