2006 IEEE Workshop on Multimedia Signal Processing最新文献

筛选
英文 中文
3D Shape Reconstruction of Moving Object By Tracking the Sparse Singular Points 基于稀疏奇异点跟踪的运动物体三维形状重建
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285295
H. Ebrahimnezhad, H. Ghassemian
{"title":"3D Shape Reconstruction of Moving Object By Tracking the Sparse Singular Points","authors":"H. Ebrahimnezhad, H. Ghassemian","doi":"10.1109/MMSP.2006.285295","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285295","url":null,"abstract":"In this paper we propose a method to reconstruct the 3D shape of object using its different silhouettes through the rigid movement. The moving object is captured by two cameras during time. A robust curve stereo matching algorithm is employed to extract the precise location of some singular-points for any sequence. The motion of object is estimated by tracking these points and a large number of cameras can be constructed for the moving object during time. Finally, the silhouette cones of all virtual cameras are intersected to extract the fine visual hull. In the proposed method, the quality of reconstruction is improved by fusing the advantages of silhouette, motion and stereo. Because of using the curve matching scheme instead of color matching, our method is less sensitive to color adjustment between cameras and illumination changes of light source. Our method is applicable also to the low-texture object","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133806395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
JPEG Steganalysis Using Empirical Transition Matrix in Block DCT Domain 基于经验转移矩阵的块DCT域JPEG隐写分析
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285320
Dongdong Fu, Y. Shi, D. Zou, Guorong Xuan
{"title":"JPEG Steganalysis Using Empirical Transition Matrix in Block DCT Domain","authors":"Dongdong Fu, Y. Shi, D. Zou, Guorong Xuan","doi":"10.1109/MMSP.2006.285320","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285320","url":null,"abstract":"This paper presents a novel steganalysis scheme to effectively attack the JPEG steganographic schemes. The proposed method exploits the correlations between block-DCT coefficients in both intra-block and inter-block sense. We use Markov empirical transition matrices to capture these dependencies. The experimental results demonstrate that the proposed scheme is superior to the existing steganalyzers in attacking OutGuess, F5, and MB1","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130934043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 71
A Novel Motion Compensated Frame Interpolation Based on Block-Merging and Residual Energy 一种基于块合并和剩余能量的运动补偿帧插值方法
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285338
Ai-Mei Huang, Truong Q. Nguyen
{"title":"A Novel Motion Compensated Frame Interpolation Based on Block-Merging and Residual Energy","authors":"Ai-Mei Huang, Truong Q. Nguyen","doi":"10.1109/MMSP.2006.285338","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285338","url":null,"abstract":"In this paper, a novel motion compensated frame interpolation (MCFI) algorithm by merging blocks that have unreliable motion vectors (MVs) based on their residual errors is proposed. Unlike the conventional methods that find true motion using smaller blocks and vector median filter, we proposed to find one single motion vector to represent a group of adjacent macroblocks (MBs) where the conventional MCFI methods are likely to fail, likely to fail. The proposed method is able to preserve the structure of different objects and their edge information, without requiring complicated edge detection and object-based motion estimation. Experimental results show that the proposed scheme improves both visual quality and PSNR, especially in the areas with different motions and the motion boundary","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134413632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
On New Audio Codec Specifications 关于新的音频编解码器规范
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285282
I. Varga
{"title":"On New Audio Codec Specifications","authors":"I. Varga","doi":"10.1109/MMSP.2006.285282","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285282","url":null,"abstract":"This contribution presents the work in 3GPP on the standardization of a new audio codec for mobile multimedia applications including packet-switched streaming (PSS), multimedia messaging (MMS) and multimedia broadcast/multicast service (MBMS). design constraints, performance requirements, test plans, selection rules were finalized first. Next, extensive subjective listening testing was conducted. The test results showed good performance for the enhanced AAC+ and for the AMR-WB+ candidates. Enhanced AAC+ and AMR-WB+ are recommended for 3GPP Rel6 mobile multimedia services PSS, MMS, and MBMS. Both fixed-point and floating-point specifications are given in 3GPP in form of a C-code for both encoder and decoder. Conformance testing methods are specified as well","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131346389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning the Kernel Matrix for Superresolution 学习核矩阵的超分辨率
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285347
K. Ni, Sanjeev Kumar, Truong Q. Nguyen
{"title":"Learning the Kernel Matrix for Superresolution","authors":"K. Ni, Sanjeev Kumar, Truong Q. Nguyen","doi":"10.1109/MMSP.2006.285347","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285347","url":null,"abstract":"This paper proposes the application of learned kernels in support vector regression to superresolution in the discrete cosine transform (DCT) domain. Though previous works involve kernel learning, their problem formulation is examined to reformulate the semi-definite programming problem of finding the optimal kernel matrix. For the particular application to superresolution, downsampling properties derived in the DCT domain are exploited to add structure to the learning algorithm. The advantage of the proposed method over other learning-based superresolution algorithms include specificity with regard to image content, structured consideration of energy compaction, and the added degrees of freedom that regression has over classification-based algorithms","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125088004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
XM-flow: An Extensible Micro-flow for Multimodal Interaction xml -flow:用于多模态交互的可扩展微流
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285359
Li Li, W. Chou, Feng Liu, Fei Cao
{"title":"XM-flow: An Extensible Micro-flow for Multimodal Interaction","authors":"Li Li, W. Chou, Feng Liu, Fei Cao","doi":"10.1109/MMSP.2006.285359","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285359","url":null,"abstract":"This paper presents a synchronization module in multimodal dialogue system architecture based on the model-view-controller (MVC) pattern for human-computer interaction. The MVC pattern is based on a clear separation of objects into three categories, i.e. model for defining and maintaining data, view for rendering interactions based on the data, and controller for coordinating actions and events that affect the model and view(s). As part of our layered multimodal dialog system architecture, this synchronization module in our approach controls the synchronization of multiple modalities, such as speech, mouse and keyboard, by interpreting XML document that incorporates SMIL and EMMA. It isolates dialog model from complex presentations associated with different channels and user interfaces through the adoption of a generic object binding mechanism. These flexibilities lead to enhanced design freedom in multimodal dialog system architecture that supports client based, sever based and distributed solutions","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122115693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Embedded Julius: Continuous Speech Recognition Software for Microprocessor 嵌入式朱利叶斯:微处理器连续语音识别软件
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285334
H. Kokubo, Hiroaki Hataoka, Akinobu Lee, Tatsuya Kawahara, K. Shikano
{"title":"Embedded Julius: Continuous Speech Recognition Software for Microprocessor","authors":"H. Kokubo, Hiroaki Hataoka, Akinobu Lee, Tatsuya Kawahara, K. Shikano","doi":"10.1109/MMSP.2006.285334","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285334","url":null,"abstract":"To expand CSR (continuous speech recognition) software to the mobile environmental use, we have developed embedded version of \"Julius\". Julius is open source CSR software, and has been used by many researchers and developers in Japan as a standard decoder on PCs. Julius works as a real time decoder on a PC. However further computational reduction is necessary to use Julius on a microprocessor. Further cost reduction is needed. For reducing cost of calculating pdfs (probability density function), Julius adopts a GMS (Gaussian mixture selection) method. In this paper, we modify the GMS method to realize a continuous speech recognizer on microprocessors. This approach does not change the structure of acoustic models in consistency with that used by conventional Julius, and enables developers to use acoustic models developed by popular modeling tools. On simulation, the proposed method has archived 20% reduction of computational costs compared to conventional GMS, 40% reduction compared to no GMS. Finally, the embedded version of Julius was tested on a developmental hardware platform named \"T-engine\". The proposed method showed 2.23 of RTF (real time factor) resulting 79% of that of no GMS without any degradation of recognition performance","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116627500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Taxonomy in Fish Species Complexes: A Role for Multimedia Information 鱼类复合体的分类学:多媒体信息的作用
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285354
Huimin Chen, Shuqing Huang, H. Bart
{"title":"Taxonomy in Fish Species Complexes: A Role for Multimedia Information","authors":"Huimin Chen, Shuqing Huang, H. Bart","doi":"10.1109/MMSP.2006.285354","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285354","url":null,"abstract":"Biologists could make valuable use of the wealth of specimen information in natural history museum databases. \"Taxonomy via the Internet\" aims to build a centralized database where biologists can store, manipulate and retrieve biologically meaningful data from images of specimens and use the data to classify the specimens taxonomically. Multimedia information representation provides a new computational tool for extracting useful features from large databases of specimen images and has potential to expedite the pace of taxonomic research. In this paper, we use a taxonomic problem involving species of suckers in the genus Carpiodes to demonstrate the utility of this method. Logistic regression classifier with fully automated feature selection procedure is compared with the best landmark based classifier to illustrate how image quality affects classification accuracy. We discuss the need of creating a multimedia database using images of specimens from a fish collection","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132352388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
4D Scalable Multi-View Video Coding Using Disparity Compensated View Filtering and Motion Compensated Temporal Filtering 使用视差补偿视图滤波和运动补偿时间滤波的4D可扩展多视图视频编码
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285268
Jens-Uwe Garbas, U. Fecker, Tobias Tröger, André Kaup
{"title":"4D Scalable Multi-View Video Coding Using Disparity Compensated View Filtering and Motion Compensated Temporal Filtering","authors":"Jens-Uwe Garbas, U. Fecker, Tobias Tröger, André Kaup","doi":"10.1109/MMSP.2006.285268","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285268","url":null,"abstract":"In this paper, a novel framework for scalable multi-view video coding is described. A well known wavelet based scalable coding scheme for single-view video sequences has been adopted and extended to match the specific needs of scalable multi-view video coding. Motion compensated temporal filtering (MCTF) is applied to each video sequence of each camera. The use of a wavelet lifting structure guarantees perfect invertibility of this step, and as a consequence of its open-loop architecture, SNR and temporal scalability are attained. Correlations between the temporal subbands of adjacent cameras are reduced by a novel disparity compensated view filtering (DCVF), method which is also lifting based and open-loop to enable view scalability. Spatial scalability and entropy coding are achieved by the JPEG2000 spatial wavelet transform and EBCOT coding, respectively. Rate allocation along the temporal-view-filtered subbands is done by means of an RD-optimal algorithm. Experimental results show the high scaling capability in terms of SNR, temporal and view scalability","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124375044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Segmentation of Epipolar-Plane Image Volumes with Occlusion and Disocclusion Competition 基于遮挡与去遮挡竞争的外极平面图像体分割
2006 IEEE Workshop on Multimedia Signal Processing Pub Date : 2006-10-01 DOI: 10.1109/MMSP.2006.285293
Jesse Berent, P. Dragotti
{"title":"Segmentation of Epipolar-Plane Image Volumes with Occlusion and Disocclusion Competition","authors":"Jesse Berent, P. Dragotti","doi":"10.1109/MMSP.2006.285293","DOIUrl":"https://doi.org/10.1109/MMSP.2006.285293","url":null,"abstract":"Consider a dense array of cameras uniformly distributed along a line. A solid block of 3D data can be constructed by arranging the images into a stack. This volume, also known as the epipolar-plane image volume, contains highly structured data that can be segmented for object removal, insertion and compression. In this paper, we propose a segmentation scheme that takes fully advantage of the known geometry in order to model occlusions explicitly as a result of disparity. Moreover, we include this knowledge into an energy minimization scheme based on region competition with active contours. Instead of extracting layers sequentially from front to back, each layer is made to compete with the regions it is going to occlude and the ones it is going to disocclude. This enables a virtually unsupervised segmentation","PeriodicalId":267577,"journal":{"name":"2006 IEEE Workshop on Multimedia Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125591003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信