Proceedings. IEEE International Conference on Multimedia and Expo最新文献

筛选
英文 中文
Watermark detection: benchmarking perspectives 水印检测:基准视角
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035654
N. Nikolaidis, V. Solachidis, A. Tefas, I. Pitas
{"title":"Watermark detection: benchmarking perspectives","authors":"N. Nikolaidis, V. Solachidis, A. Tefas, I. Pitas","doi":"10.1109/ICME.2002.1035654","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035654","url":null,"abstract":"Benchmarking of watermarking algorithms is a complicated task that requires examination of a set of mutually dependent performance factors (algorithm complexity, decoding/detection performance, and perceptual quality). This paper will focus on detection/decoding performance evaluation and try to summarize its basic principles. A methodology for deriving the corresponding performance metrics will also be provided.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"33 1","pages":"493-496 vol.2"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76717678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Classifying emotions in human-machine spoken dialogs 对人机对话中的情绪进行分类
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035887
C. Lee, Shrikanth S. Narayanan, R. Pieraccini
{"title":"Classifying emotions in human-machine spoken dialogs","authors":"C. Lee, Shrikanth S. Narayanan, R. Pieraccini","doi":"10.1109/ICME.2002.1035887","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035887","url":null,"abstract":"This paper reports on the comparison between various acoustic feature sets and classification algorithms for classifying spoken utterances based on the emotional state of the speaker. The data set used for the analysis comes from a corpus of human-machine dialogs obtained from a commercial application. Emotion recognition is posed as a pattern recognition problem. We used three different techniques - linear discriminant classifier (LDC), k-nearest neighborhood (k-NN) classifier, and support vector machine classifier (SVC) -for classifying utterances into 2 emotion classes: negative and non-negative. In this study, two feature sets were used; the base feature set obtained from the utterance-level statistics of the pitch and energy of the speech, and the feature set analyzed by principal component analysis (PCA). PCA showed a performance comparable to the base feature sets. Overall, the LDC achieved the best performance with error rates of 27.54% on female data and 25.46% on males with the base feature set. The SVC, however, showed a better performance in the problem of data sparsity.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"3 1","pages":"737-740 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77236247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 75
New scalable three-stage motion estimation technique for mobile MPEG encoding 移动MPEG编码中新的可扩展三级运动估计技术
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035874
S. Mietens, P. D. With, C. Hentschel
{"title":"New scalable three-stage motion estimation technique for mobile MPEG encoding","authors":"S. Mietens, P. D. With, C. Hentschel","doi":"10.1109/ICME.2002.1035874","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035874","url":null,"abstract":"The paper presents a new scalable three-stage motion estimation technique, which includes processing of frames in display order and approximating motion vector fields using multiple references. Quality refinement is added as an optional stage. The complete system provides a flexible framework with a large scalability range in computational effort, resulting in different picture-quality levels or bitrates. Experiments show a scalable computational effort with a factor of 14, resulting in a global variation of 7 dB SNR in picture quality (with the \"Stefan\" sequence). In high-quality operation, the new method is comparable to a full-search motion estimation with a search window of 32/spl times/32 pixels (or even outperforms it). The innovation provides an excellent starting point for scalability in resource-constrained mobile system design (see Mietens, S. et al., IEEE Int. Conf. on Image Proc., ICIP 2001, vol.3, p.462-5, 2001).","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"67 1","pages":"685-688 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81119408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design and implementation of a dynamic VRML-browsable, movie on-demand system distributed over Internet 一个动态的、可浏览的、分布在Internet上的电影点播系统的设计与实现
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035765
G. Fortino, G. Confessore, A. Mantuano
{"title":"Design and implementation of a dynamic VRML-browsable, movie on-demand system distributed over Internet","authors":"G. Fortino, G. Confessore, A. Mantuano","doi":"10.1109/ICME.2002.1035765","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035765","url":null,"abstract":"This paper presents the design and the implementation of the Virtual Video Gallery (VVG), an advanced distributed video on-demand system accessible through a dynamic virtual world, which mimes an art gallery where movie posters are exhibited The design phase is driven by object-oriented modeling techniques which depend on UML and its extensions purposely suited to model multimedia information systems. The implementation centers on an approach blending (i) Java, which provides powerful computing, multimedia and networking capabilities, (ii) VRML, which supports an easy construction of complex virtual worlds, and (iii) WWW facilities.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"31 1","pages":"249-252 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84607609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Video realistic avatar for virtual face-to-face conferencing 视频逼真的虚拟面对面会议的化身
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035359
Yao-Jen Chang, Chien-Chia Chien, Yung-Chang Chen
{"title":"Video realistic avatar for virtual face-to-face conferencing","authors":"Yao-Jen Chang, Chien-Chia Chien, Yung-Chang Chen","doi":"10.1109/ICME.2002.1035359","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035359","url":null,"abstract":"Facial animation standardized by MPEG-4 provides a common form of description and transmission for talking head related applications. With animated talking heads, a virtual conferencing system can be created by providing a 3-D virtual environment for face-to-face communication and casual navigation. Not only bit-rate consumption is reduced, 3-D visualization is also provided which greatly improves the sense of presence when compared to conventional video conferencing system. An integrated architecture is presented by taking advantage of 2-D video coding and 3-D model-based coding to create video realistic avatars for virtual face-to-face conferencing system. Preliminary experimental results indicate more than 4 dB improvement in PSNR can be achieved at the same bit rate when compared to conventional video coding. The incorporation of 3-D facial models also enables rendering from arbitrary viewpoints for use in a virtual conferencing system.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"9 1","pages":"1-4 vol.2"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79983118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Image indexing and retrieval using visual keyword histograms 使用视觉关键字直方图的图像索引和检索
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035756
Joo-Hwee Lim, Jesse S. Jin
{"title":"Image indexing and retrieval using visual keyword histograms","authors":"Joo-Hwee Lim, Jesse S. Jin","doi":"10.1109/ICME.2002.1035756","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035756","url":null,"abstract":"We propose a novel image representation called visual keyword histogram (VKH) for content-based indexing and retrieval. Visual keywords are domain-relevant visual prototypes (e.g. faces, foliage, buildings etc) with both perceptual appearance and textual semantics. Collectively, VKHs axe computed over spatial tessellation to represent the distribution of visual keywords in various parts of an image. To construct a vocabulary of visual keywords, an incremental neural network is deployed to learn visual keywords from examples. This allows us to build domain-specific visual vocabularies rapidly and incrementally. Last but not least, we propose a new visual query language called Query by Spatial Icons (QBSI) that allows a user to specify a query in terms of \"what\" and \"where\". A visual query term constrains whether a visual keyword should be present and a query formals chains these terms into a disjunctive normal form via logical operators. We show our approach on real and complex home photos with very promising results.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"35 1","pages":"213-216 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82060707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A performance analysis of spread-spectrum watermarking based on redundant transforms 基于冗余变换的扩频水印性能分析
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035675
Li Hua, J. Fowler
{"title":"A performance analysis of spread-spectrum watermarking based on redundant transforms","authors":"Li Hua, J. Fowler","doi":"10.1109/ICME.2002.1035675","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035675","url":null,"abstract":"Spread-spectrum watermarking, in which random noise is added to transform coefficients and detected with a correlation operator has become a preferred paradigm for many watermarking applications. This paper analyzes the performance of such a watermarking system when the underlying transform is a tight frame rather than a traditional orthonormal expansion. The analysis indicates that a tight frame offers no inherent performance advantage over an orthonormal transform in the watermark-detection process despite the well known ability of redundant transforms to accommodate greater amounts of added noise for a given distortion.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"32 1","pages":"553-556 vol.2"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86947577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
An overcomplete discrete wavelet transform for video compression 一种用于视频压缩的过完备离散小波变换
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035863
N. Sebe, C. Lamba, M. Lew
{"title":"An overcomplete discrete wavelet transform for video compression","authors":"N. Sebe, C. Lamba, M. Lew","doi":"10.1109/ICME.2002.1035863","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035863","url":null,"abstract":"The translated function with any integer multiple of the sampling period is completely represented in the wavelet space by one of the overcomplete discrete wavelet transform (ODWT) members. This theoretical result leads to a new motion estimation and motion compensation scheme working in the wavelet transform domain. Our experiments, performed on real image sequences, show high quality and low bit rate performances. Moreover, by performing the motion estimation in the wavelet space a major reduction of the computational cost is achieved.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"41 1","pages":"641-644 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86506532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Let the sunshine on your screen: introducing augmented reality into interactive television 让阳光照在你的屏幕上:将增强现实技术引入互动电视
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035912
J. Stauder, P. Robert
{"title":"Let the sunshine on your screen: introducing augmented reality into interactive television","authors":"J. Stauder, P. Robert","doi":"10.1109/ICME.2002.1035912","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035912","url":null,"abstract":"The paper discusses the integration of augmented reality techniques into interactive television (ITV) and presents a new method for ensuring photometric realism. ITV services known today are email and World Wide Web (WWW) access. This article focuses on a future option for interactive television: the integration of games and virtual 3D scenes. For example, while watching football, you may be invited by a joint WWW commercial to have a look at a virtual 3D model of a sporting shoe. Displaying the sporting shoe on the screen in front of the football scene is in fact a case of augmented reality. The paper proposes a fast and simple method to extract a number of light sources from the football video that ensure a dynamic illumination of the virtual object depending on the content of the TV screen. The method works for all kinds of video content since only the light source intensities are estimated from the video signal while the light source positions are set by a fixed rule.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"34 1","pages":"837-840 vol.1"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82638626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The Certimark benchmark: architecture and future perspectives Certimark基准:架构和未来前景
Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI: 10.1109/ICME.2002.1035651
J. Vorbrüggen, François Cayre
{"title":"The Certimark benchmark: architecture and future perspectives","authors":"J. Vorbrüggen, François Cayre","doi":"10.1109/ICME.2002.1035651","DOIUrl":"https://doi.org/10.1109/ICME.2002.1035651","url":null,"abstract":"The Certimark Consortium consists of 15 partners from European industry and academia. The consortium has been developing a benchmark suite that will enable its users to evaluate digital watermarking technologies. Here, we present the basic architecture of the benchmark suite and its underlying rationale. We also provide an outlook on future plans for making the benchmark suite available outside the consortium and for its further development.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"36 1","pages":"485-488 vol.2"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86775225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信