Seventh IEEE International Symposium on Multimedia (ISM'05)最新文献_第6页

Melody extraction on MIDI music files 旋律提取MIDI音乐文件

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.77

Giyasettin Ozcan, Cihan Isikhan, A. Alpkocak

引用次数: 34

Video broadcasting using overlay multicast 视频广播使用覆盖组播

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.119

D. Milic, M. Brogle, T. Braun

引用次数: 11

On the usefulness of object shape coding with MPEG-4 论用MPEG-4进行对象形状编码的有效性

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.87

A. Prati, R. Cucchiara

引用次数: 1

A pitch-based rapid speech segmentation for speaker indexing 基于音高的说话人索引快速语音分割

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.17

Min Yang, Yingchun Yang, Zhaohui Wu

{"title":"A pitch-based rapid speech segmentation for speaker indexing","authors":"Min Yang, Yingchun Yang, Zhaohui Wu","doi":"10.1109/ISM.2005.17","DOIUrl":"https://doi.org/10.1109/ISM.2005.17","url":null,"abstract":"Segmentation of continuous audio is an important processing in many applications. In speaker indexing, the reliability of speaker model depends much on segmentation. Commonly used methods are based on the Bayesian information criteria (BIC), which is however not so capable when dealing with short utterances. In this paper, we present a pitch-based speech segmentation method, which can detect frequent speaker changes accurately and rapidly. In our algorithm, pitch is introduced in speaker segmentation. Firstly, utterance segments are detected by pitch. Then distances of pitch are computed, and compared with a self-adaptable threshold. Speaker changes are finally decided among utterance segments. We applied our method and three comparative methods on the HUB4-NE broadcast data. Speaker indexing experiments have been taken following each algorithm. We also suggested two indicators as complements of false alarm and missing rate in the evaluation of segmentation. The experiment results show that our algorithm works faster and better, with most of short time speaker changes detected. Speaker indexing equal error rate of our method is 10.43%, which is much lower than 12.94%, 25.84% and 15.91% of other methods.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126643370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Striping delay-sensitive packets over multiple burst-loss channels with random delays 在多个随机延迟的突发丢失通道上对延迟敏感的数据包进行条带化

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.110

Gene Cheung, P. Sharma, Sung-Ju Lee

引用次数: 7

Generating MPEG-21 BSDL descriptions using context-related attributes 使用与上下文相关的属性生成MPEG-21 BSDL描述

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.63

D. D. Schrijver, W. D. Neve, K. D. Wolf, R. Walle

引用次数: 12

Framework and network based multimedia object management environment 基于框架和网络的多媒体对象管理环境

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.61

Chris Germano, Taehyung Wang, A. Onoma

引用次数: 0

Automatically generating user interfaces for device federations 自动生成设备联合的用户界面

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.38

E. Braun, M. Mühlhäuser

引用次数: 2

Fast adaptive inter mode decision method in H.264 based on spatial correlation 基于空间相关的H.264快速自适应模式间决策方法

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.58

Bin Feng, G.X. Zhu, Wen-yu Liu

引用次数: 8

Web-based video editing system for sharing clips collected from multi-users 基于web的视频编辑系统，用于共享从多用户收集的剪辑

Seventh IEEE International Symposium on Multimedia (ISM'05) Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.123

Satoshi Ichimura, Y. Matsushita

引用次数: 1