2005 IEEE International Conference on Multimedia and Expo最新文献_第4页

Telling Stories with Mylifebits 用生活点滴讲故事

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521726

J. Gemmell, Aleks Aris, Roger Lueder

引用次数: 89

A method for extracting a musical unit to phrase music data in the compressed domain of TwinVQ audio compression 一种在TwinVQ音频压缩的压缩域中提取音乐单元以短语化音乐数据的方法

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521491

Motohiro Nakanishi, M. Kobayakawa, M. Hoshi, Tadashi Ohmori

{"title":"A method for extracting a musical unit to phrase music data in the compressed domain of TwinVQ audio compression","authors":"Motohiro Nakanishi, M. Kobayakawa, M. Hoshi, Tadashi Ohmori","doi":"10.1109/ICME.2005.1521491","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521491","url":null,"abstract":"A method for phrasing music data into meaningful musical pieces (e.g., bar and phrase) is an important function to analyze music data. To realize this function, we propose a method for extracting a unit of music data (musical unit) in the compressed domain of TwinVQ audio compression (MPEG-4 audio). Our key idea is to extract a musical unit from a sequence of autocorrelation coefficients computed in the encoding step of TwinVQ audio compression. We call the sequence of the autocorrelation coefficients the \"autocorrelation sequence r\". We use the k-th autocorrelation sequence r/sub k/ (k=1, 2, ..., 20) of music data for extracting a musical unit of music data. First, we calculate the j/sub k/-th autocorrelation coefficient a/sub k//sup j//sub k/ of the k-th autocorrelation sequence r/sub k/ (j/sub k/=38, 39, ..., 208; k=1, 2, ...,20). Second, for detecting the peak in the sequence (a/sub k//sup 38/, a/sub k//sup 39/, ..., a/sub k//sup 208/), the Laplacian filter is applied to the sequence. We then obtain the order p/sub k/ for which the maximum differential coefficient is attained. Finally, we compute the musical unit using p/sub k/. To evaluate the performance of extracting the musical unit by our method, we collected 64 music data and obtained autocorrelation sequences by applying the TwinVQ encoder to each data. We then applied our extraction algorithm to each autocorrelation sequence. The experimental results reveal a very good performance in the extraction of a musical unit for phrasing music data.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133624484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Fast Motion Estimation by Motion Vector Merging Procedure for H. 264 基于H. 264的运动矢量合并快速运动估计

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521703

Kai-Chung Hou, Mei-Juan Chen, Ching-Ting Hsu

引用次数: 21

A User-Oriented Multimodal-Interface Framework for General Content-Based Multimedia Retrieval 面向用户的基于内容的多媒体检索多模态接口框架

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521547

Jinchang Ren, T. Vlachos, V. Argyriou

引用次数: 2

Robust learning-based TV commercial detection 基于鲁棒学习的电视广告检测

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521382

Xiansheng Hua, Lie Lu, HongJiang Zhang

引用次数: 91

Urban Traffic Control: A Streaming Multimedia Approach 城市交通控制:一种流媒体方法

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521499

C. Palau, M. Esteve, J. Martínez, B. Molina, I. Pérez-Llopis

{"title":"Urban Traffic Control: A Streaming Multimedia Approach","authors":"C. Palau, M. Esteve, J. Martínez, B. Molina, I. Pérez-Llopis","doi":"10.1109/ICME.2005.1521499","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521499","url":null,"abstract":"Urban traffic control systems have based their technological infrastructure on both advanced analogical close-circuit television systems (TVCC) and point-to-point links, providing low-scalable and very expensive systems. The main goal of an urban traffic monitoring system is to capture, send, play and distribute video information from the streets of a certain city. Current digitalization process of video networks, and the research carried out in the field of streaming media, has led vendors to present proprietary hardware and software solutions resulting in a strong dependency among their customers. The existence of open standards for video encoding and protocols for streaming media transmission over IP networks has led us to propose this system. The work presents an open urban traffic control system which bases its design on COTS philosophy for hardware and software, as well as open source and standardized protocols. The proposed system is a suitable solution in terms of scalability, cost, interoperability and performance for traffic control systems. Furthermore, its architecture can be easily adapted to other video applications and tools","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127801388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Performance of Multiple Description Coding in Sensor Networks with Finite Buffers 有限缓冲区传感器网络中多重描述编码的性能研究

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521707

E. Baccaglini, G. Barrenetxea, B. Beferull-Lozano

引用次数: 4

Emotional Speech Classification Using Gaussian Mixture Models and the Sequential Floating Forward Selection Algorithm 基于高斯混合模型和顺序浮动前向选择算法的情绪语音分类

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521717

D. Ververidis, Constantine Kotropoulos

{"title":"Emotional Speech Classification Using Gaussian Mixture Models and the Sequential Floating Forward Selection Algorithm","authors":"D. Ververidis, Constantine Kotropoulos","doi":"10.1109/ICME.2005.1521717","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521717","url":null,"abstract":"Emotional speech classification can be treated as a supervised learning task where the statistical properties of emotional speech segments are the features and the emotional styles form the labels. The Akaike criterion is used for estimating automatically the number of Gaussian densities that model the probability density function of the emotional speech features. A procedure for reducing the computational burden of crossvalidation in sequential floating forward selection algorithm is proposed that applies the t-test on the probability of correct classification for the Bayes classifier designed for various feature sets. For the Bayes classifier, the sequential floating forward selection algorithm is found to yield a higher probability of correct classification by 3% than that of the sequential forward selection algorithm either taking into account the gender information or ignoring it. The experimental results indicate that the utterances from isolated words and sentences are more colored emotional than those from paragraphs. Without taking into account the gender information, the probability of correct classification for the Bayes classifier admits a maximum when the probability density function of emotional speech features extracted from the aforementioned utterances is modeled as a mixture of 2 Gaussian densities","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115513191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 75

An overview of technologies for e-meeting and e-lecture 电子会议和电子讲座技术综述

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521593

B. Erol, Ying Li

引用次数: 34

Current and Emerging Topics in Sports Video Processing 当前和新兴的主题在体育视频处理

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521476

Xinguo Yu, D. Farin

引用次数: 59