1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)最新文献

筛选
英文 中文
Information theoretic bounds for data hiding in compressed images 压缩图像中数据隐藏的信息理论边界
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-12-07 DOI: 10.1109/MMSP.1998.738945
M. Ramkumar, A. Akansu
{"title":"Information theoretic bounds for data hiding in compressed images","authors":"M. Ramkumar, A. Akansu","doi":"10.1109/MMSP.1998.738945","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738945","url":null,"abstract":"We present an information-theoretic approach to obtain an estimate of the number of bits that can be hidden in still images, or, the capacity of the data-hiding channel. We show how addition of the message signal in a suitable transform domain rather than the spatial domain can significantly increase the channel capacity. We compare the capacities achievable with different decompositions like DCT, DFT, Hadamard, and subband transforms.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123910929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Joint source channel coding with hybrid FEC/ARQ for buffer constrained video transmission 基于FEC/ARQ混合源信道编码的缓冲约束视频传输
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-12-07 DOI: 10.1109/MMSP.1998.739041
R. Puri, K. Ramchandran, Antonio Ortega
{"title":"Joint source channel coding with hybrid FEC/ARQ for buffer constrained video transmission","authors":"R. Puri, K. Ramchandran, Antonio Ortega","doi":"10.1109/MMSP.1998.739041","DOIUrl":"https://doi.org/10.1109/MMSP.1998.739041","url":null,"abstract":"We propose an automatic repeat request (ARQ)/forward error correction (FEC) scheme for synchronous transmission of video over a binary symmetric constant rate channel. The approach consists of jointly allocating source and channel rates to video blocks from a given admissible set subject to the buffer or equivalently end-end delay constraints. The channel codes used are the popular class of powerful FEC codes known as rate-compatible punctured convolutional (RCPC) codes. The method used involves independent coding of the video units and optimization of the end-to-end expected delivered video quality. The existence of a return channel is assumed through which the decoder informs the encoder about the success/failure of the transmission. In the event of a failure, incremental parity information is sent to the decoder for correcting errors and a reallocation performed at the encoder. The simulations done point out the efficacy of the proposed scheme.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124757728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Ordered statistics decoding of linear block codes for robust H.263 video transmission in AWGN channel 基于AWGN信道的H.263视频鲁棒传输中线性分组码的有序统计解码
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-12-07 DOI: 10.1109/MMSP.1998.739043
Wu-Hsiang Jonas Chen, Jenq-Neng Hwang
{"title":"Ordered statistics decoding of linear block codes for robust H.263 video transmission in AWGN channel","authors":"Wu-Hsiang Jonas Chen, Jenq-Neng Hwang","doi":"10.1109/MMSP.1998.739043","DOIUrl":"https://doi.org/10.1109/MMSP.1998.739043","url":null,"abstract":"Soft-decision decoding of linear block codes for robust H.263 video transmission in a zero-mean, additive white Gaussian noise (AWGN) channel is investigated. We implement an effective error concealment (EC) scheme at the source decoder to reduce the annoying artifacts caused by decoding a corrupted bit stream. To alleviate the spatial and temporal error propagation, an error prevention (EP) strategy is introduced at the H.263 encoder. Simulation results show that a large portion of the peak signal-to-noise ratio (PSNR) gain is obtained by ordered statistics decoding of the received sequence. Furthermore, the residual channel coding errors are concealed and compensated by realizing the proposed EC and EP schemes.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115960351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Image watermarking based on the fractal transform: a draft demonstration 基于分形变换的图像水印:初步演示
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-12-07 DOI: 10.1109/MMSP.1998.738960
Stéphane Roche, J. Dugelay
{"title":"Image watermarking based on the fractal transform: a draft demonstration","authors":"Stéphane Roche, J. Dugelay","doi":"10.1109/MMSP.1998.738960","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738960","url":null,"abstract":"The aim is to present the ongoing performance of our R and D watermarking scheme software. The proposed illustrations cover a large panel of original images (in grey levels and colors), signatures and attacks. Evaluation is performed according to ratio, visibility and robustness.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116707955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Limited retransmission of real-time layered multimedia 实时分层多媒体的有限重传
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-12-07 DOI: 10.1109/MMSP.1998.739045
Matthew Podolsky, M. Vetterli, S. McCanne
{"title":"Limited retransmission of real-time layered multimedia","authors":"Matthew Podolsky, M. Vetterli, S. McCanne","doi":"10.1109/MMSP.1998.739045","DOIUrl":"https://doi.org/10.1109/MMSP.1998.739045","url":null,"abstract":"In contrast to multimedia applications that involve human-to-human communication, streaming media over the Internet enjoys relaxed delay constraints. Thus, streaming media servers are at liberty to retransmit missing packets to avoid unnecessary signal corruption. While state-of-the-art media servers employ such strategies, no work to date has proposed an optimal strategy for delay-constrained retransmissions of streaming media. In this paper, we propose a framework for streaming media retransmission based on layered media representations and explore the performance advantage of integrating layered signal structure into the retransmission strategy. In our approach, the source must choose between transmitting an older layer that expires sooner and a newer layer that expires later but is more important. To arrive at the proper mix of these two extreme strategies, we derive an optimal strategy for transmitting layered data over a binary erasure channel with instantaneous feedback. To provide a quantitative performance comparison of different transmission policies, we conduct a Markov-chain analysis, which shows that the best transmission policy is time-invariant and thus does not change as the layers approach their expiration times.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126598702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Linear discriminant analysis for speechreading 语音阅读的线性判别分析
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-12-07 DOI: 10.1109/MMSP.1998.738938
G. Potamianos, H. Graf
{"title":"Linear discriminant analysis for speechreading","authors":"G. Potamianos, H. Graf","doi":"10.1109/MMSP.1998.738938","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738938","url":null,"abstract":"This paper investigates the use of Fisher-Rao (1965) linear discriminant analysis (LDA) as a means of visual feature extraction for hidden Markov model based automatic speechreading. For every video frame, a three-dimensional region of interest containing the speaker's mouth over a sequence of adjacent frames is lexicographically arranged into a data vector. Such vectors are then projected onto the space of the most discriminant \"eigensequences\", estimated by means of LDA on a training set of image sequence vectors, labeled from a set of a-priori chosen classes. The resulting projections, as well as their first and second derivatives over time, are used as features for automatic speechreading. The proposed method is applied to single-speaker, multi-speaker, and speaker-independent visual-only recognition tasks, consistently outperforming principal component analysis and discrete wavelet transform based visual features. Specific issues relevant to LDA are also discussed, namely, class selection, automatic data class labelling, and dimensionality reduction prior to LDA.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128467123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Thai polysyllabic word recognition using fuzzy-neural network 基于模糊神经网络的泰语多音节词识别
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-11-30 DOI: 10.1109/MMSP.1998.738925
C. Wutiwiwatchai, S. Jitapunkul, V. Ahkuputra, E. Maneenoi, S. Luksaneeyanawin
{"title":"Thai polysyllabic word recognition using fuzzy-neural network","authors":"C. Wutiwiwatchai, S. Jitapunkul, V. Ahkuputra, E. Maneenoi, S. Luksaneeyanawin","doi":"10.1109/MMSP.1998.738925","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738925","url":null,"abstract":"A fuzzy-neural network (fuzzy-NN) model was proposed for speaker-independent Thai polysyllabic word recognition. Fuzzy features converted from exact features were used to be input of multilayer perceptron (MLP) neural network. Various fuzzy membership functions on linguistic properties were used for fuzzy conversion and compared together. The binary desired outputs were used during training. 70 Thai words consist of ten numerals, the others were single-syllable, double-syllable and triple-syllable, 20 words in each group, were used for system evaluation. In order to improve recognition accuracy, number of syllable and tonal level detected were conducted for speech preclassification. The Pi fuzzy membership function provided the best recognition accuracy among other functions; trapezoidal, and triangular function. Under an optimal condition, the achieved recognition error rates were 5.6% on dependent test and 6.7% on independent test, which were respectively 3.3% and 3.4% decreasing from the conventional neural network system.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133076452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Beyond query by example 超越示例查询
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1998-09-01 DOI: 10.1145/290747.290800
S. Santini, R. Jain
{"title":"Beyond query by example","authors":"S. Santini, R. Jain","doi":"10.1145/290747.290800","DOIUrl":"https://doi.org/10.1145/290747.290800","url":null,"abstract":"This paper considers some of the problems we found trying to extract meaning from images in database applications, and proposes some ways to solve them. We argue that the meaning of an image is an ill-defined entity, and it is not in general possible to derive from an image the meaning that the user of the database wants. Rather, we should be content with a correlation between the intended meaning and simple perceptual clues that databases can extract. Rather than working on the impossible task of extracting unambiguous meaning from images, we should provide the user with the tools he needs to drive the database in the areas of the feature space where \"interesting\" images are.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123872816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 88
Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm 基于EM算法最大化视听联合概率的语唇运动合成
1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175) Pub Date : 1900-01-01 DOI: 10.1109/MMSP.1998.738912
Satoshi Nakamura, E. Yamamoto, K. Shikano
{"title":"Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm","authors":"Satoshi Nakamura, E. Yamamoto, K. Shikano","doi":"10.1109/MMSP.1998.738912","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738912","url":null,"abstract":"We investigate methods using the hidden Markov model (HMM) to drive a lip movement sequence with input speech. We have already investigated a mapping method based on the Viterbi decoding algorithm which converts an input speech to a lip movement sequence through the most likely HMM state sequence conducted by audio HMMs. However, the method contains a substantial problem of producing errors along incorrectly decoded HMM states. This paper newly proposes a method to re-estimate the visual parameters using the HMMs of the audio-visual joint probability under the expectation-maximization (EM) algorithm. In experiments, the proposed mapping method using the EM algorithm shows an error reduction of 26% compared to a method using the Viterbi algorithm at incorrectly decoded bi-labial consonants.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128523706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信