{"title":"The ITEA project EUROPA, a software platform for digital CE appliances","authors":"J. Gelissen","doi":"10.1109/ICCE.2002.1013910","DOIUrl":"https://doi.org/10.1109/ICCE.2002.1013910","url":null,"abstract":"The ITEA (Information Technology for European Advancement, an eight year funding program labeled by EUREKA) project EUROPA (End User Resident Open Platform Architecture) started in October 1999 for a duration of 2.5 years in the first call of the ITEA program. The project aims at the definition of a high-end set-top-box (StB) reference architecture with extended functionality to enable next-generation services for the DVB-MHP (Digital Video Broadcast-Multimedia Home Platform) system. The next generation high-end StBs will boost the possibilities of interactive services in digital broadcasting into new dimensions. This approach includes the adoption of new standards like MPEG-4 and MPEG-2, cryptography for secure online-banking and online-shopping as well as agent technology for advanced and attractive user interfaces. The results of the project will be validated in demonstrators and the resulting reference architecture will be made publicly available. The project operates in close co-operation with standardization bodies like MPEG and DVB. The paper introduces the project in more detail with emphasis on the functionality extension aspects and the demonstrator scenario leading to the assessment of the defined functionality.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129658099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speech bandwidth extension","authors":"H. Gustafsson, I. Claesson, U. Lindgren","doi":"10.1109/ICME.2001.1237845","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237845","url":null,"abstract":"A common narrow-band speech signal is expanded into a wide-band speech signal. The expanded signal gives the impression of a wide-band speech signal regardless of what type of vocoder is used in a receiver. The robust techniques suggested herein are based on speech acoustics and fundamentals of human hearing. That is the techniques extend the harmonic structure of the speech signal during voiced speech segments and introduce a linearly estimated amount of speech energy in the wide frequency-band. During unvoiced speech segments, a fricated noise may be introduced in the upper frequency-band.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124203774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A music similarity function based on signal analysis","authors":"B. Logan, Ariel Salomon","doi":"10.1109/ICME.2001.1237829","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237829","url":null,"abstract":"The present invention computer method and apparatus determines music similarity by generating a K-means (instead of Gaussian) cluster signature and a beat signature for each piece of music. The beat of the music is included in the subsequent distance measurement.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117292933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Object segmentation with affine motion similarity measure","authors":"Hong Li, Weisi Lin, B. Tye, E. Ong, C. Ko","doi":"10.1109/ICME.2001.1237853","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237853","url":null,"abstract":"Affine motion model is widely used in motion segmentation. Accurately estimating and evaluating affine parameters are two key problems in such kind of approaches. This paper tries to address these issues by presenting an effective video object segmentation method based on affine motion similarity measure. The image is firstly segmented into irregularly-shaped intensity homogenous regions. Then, relatively reliable affine parameters for each region are estimated by a robust motion estimator according to the individual coordinate system of each region. Finally, a new motion similarity measure and merge process is applied to obtain meaningful objects. Experimental results demonstrate the effectiveness of the proposed method.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123097363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design collaboration based on video user scenarios for VDSL systems","authors":"K. Yamazaki, K. Hirano","doi":"10.1109/ICME.2001.1237956","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237956","url":null,"abstract":"This paper focuses on a design method suitable for design collaboration during the creation of information appliances. For design collaboration by various professionals, the proposal method utilizes video user scenarios and VDSL (Very High- Bit-Rate DSL) technology. VDSL technology has the capability to transmit multichannel video data utilizing exciting telephone lines. The proposed design process is starting from the making video user scenario by the lead designer and he acts out the user scenario in front of a video camera to collaborate other professionals through the VDSL system. The proposed design system uses VDSL technology and an application software for video user scenarios.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"06 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127272947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Jinzenji, Hiroshi Watanabe, Shigeki Okada, N. Kobayashi
{"title":"MPEG-4 very low bit-rate video compression using sprite coding","authors":"K. Jinzenji, Hiroshi Watanabe, Shigeki Okada, N. Kobayashi","doi":"10.1109/ICME.2001.1237641","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237641","url":null,"abstract":"This paper focuses on the “sprite coding” that supports the MPEG-4 Version 1 Main profile in order to transfer “near VHS quality video “ across narrow-band transmission links such as the Internet. Automatic VOP (Video Object Plane) generation technologies are being studied as one of the most important issues of MPEG-4 object coding. This paper proposes a two-layer VOP generation scheme with some core algorithms such as GME (Global Motion Estimation), foreground moving object extraction, and background sprite generation. This paper also describes a shape information reduction method for the foreground object. The foreground object is object-coded in the , while the background sprite is coded using sprite coding in MPEG-4. We call this coding scheme “sprite mode”; MPEG-4 simple profile coding is called “normal mode”. Experiments are conducted on VOP generation and video coding with MPEG-4. We compare sprite mode to normal mode. The coding efficiency of sprite mode is several times higher than that of normal mode at the same objective image quality if the foreground ratio is within 10-15%. Given the target of very low bitrate (128kbps, 64kbp) rate coding, sprite mode achieved almost the same SNR but more than twice the frame rate compared to normal mode.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"27 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123553434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multimodal tracking for smart videoconferencing","authors":"D. Zotkin, R. Duraiswami, H. Nanda, L. Davis","doi":"10.1109/ICME.2001.1237649","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237649","url":null,"abstract":"Many interactive multimedia applications require the ability to track the 3-D motion of participants in a room. Particle filters are attractive for this since they do not require solution of the inverse problem of obtaining the state from measurements, and since the tracking can be easily extended to integrate multimodal measurements. We extend our previous work on smart videoconferencing to include a multimodal tracker of the session participants using multiple cameras and microphone arrays. We verify the correctness and robustness of the multimodal tracker using synthetic and real data. We also present practical details of how such a system can be implemented using off-the-shelf hardware and computers.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122876661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis and architecture design of JPEG2000","authors":"Liang-Gee Chen, Chung-Jr Lian, Kuanfu Chen, Hong-Hui Chen","doi":"10.1109/ICME.2001.1237693","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237693","url":null,"abstract":"Analysis and architecture design of the key modules in JPEG2000 are presented in this paper. For Discrete Wavelet Transform (DWT), a lifting based DWT core for the default 5-3 and 9-7 filters in part I of JPEG2000 is proposed. Folded architecture is adopted in DWT to reduce the hardware cost and to achieve the higher hardware utilization. For Embedded Block Coding with Optimized Truncation (EBCOT), column-based coding architecture of Tier-1 block coding engine is proposed. The context formation efficiency is increased by adopting two speedup methods. The computation cycle of the block coding engine is reduced to about 40% of previous work.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"133 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126980156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A robust and fast watermarking scheme for compressed audio","authors":"Changsheng Xu, Yongwei Zhu, D. Feng","doi":"10.1109/ICME.2001.1237687","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237687","url":null,"abstract":"This paper proposes a method to embed and extract the watermark into and from digital compressed audio. The watermark is embedded in partially uncompressed domain and the embedding scheme is high related to audio content. The watermark embedding can be done very fast. The experimental result illustrates that the embedded watermark can survive the decoding and reencoding process.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129145775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Shallow copying of multimedia sources for virtual documents","authors":"K. Hewagamage, N. Jayawardana, M. Hirakawa","doi":"10.1109/ICME.2001.1237910","DOIUrl":"https://doi.org/10.1109/ICME.2001.1237910","url":null,"abstract":"In this paper, we present a technique called shallow copying on multimedia sources that can be done in a new authoring mechanism called virtual authoring. It can be used to create new documents by customizing and integrating existing multimedia documents without replicating them. Hence this mechanism provides the architecture to develop extended information layers on top of existing collection of multimedia documents.","PeriodicalId":405589,"journal":{"name":"IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.","volume":"256 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121343255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}