2011 IEEE International Symposium on Multimedia最新文献_第5页

Deftpack: A Robust Piece-Picking Algorithm for Scalable Video Coding in P2P Systems Deftpack:用于P2P系统中可扩展视频编码的鲁棒片段挑选算法

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.52

R. Petrocco, Michael Eberhard, J. Pouwelse, D. Epema

{"title":"Deftpack: A Robust Piece-Picking Algorithm for Scalable Video Coding in P2P Systems","authors":"R. Petrocco, Michael Eberhard, J. Pouwelse, D. Epema","doi":"10.1109/ISM.2011.52","DOIUrl":"https://doi.org/10.1109/ISM.2011.52","url":null,"abstract":"The volume of Internet video is growing, and is expected to exceed 57 percent of global consumer Internet traffic by 2014. Peer-to-Peer technology can help delivering this massive volume of traffic in a cost-efficient, scalable, and reliable manner. However, single bit rate streaming is not sufficient given today's device and network connection diversity. A possible solution to this problem is provided by layered coding techniques, such as Scalable Video Coding, which allow addressing this diversity by providing content in various qualities within a single bit stream. In this paper we propose a new self-adapting piece-picking algorithm for downloading layered video streams, called Deft pack. Our algorithm significantly reduces the number of stalls, minimises the frequency of quality changes during playback, and maximizes the effective usage of the available bandwidth. Deft pack is the first algorithm that is specifically crafted to take all these three quality dimensions into account simultaneously, thus increasing the overall quality of experience. Additionally, Deft pack can be integrated into Bit torrent-based P2P systems and so has the chance of large-scale deployment. Our results from realistic swarm simulations show that Deft pack significantly outperforms previously proposed algorithms for retrieving layered content when all three quality dimensions are taken into account.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130074725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Masking Effect of Out-of-sync News Content with Different Distractors 不同干扰因素对不同步新闻内容的遮蔽效应

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.86

S. Buchinger

引用次数: 0

Audio Quality Assessment Improvement via Circular and Flexible Overlap 通过循环和灵活重叠改进音频质量评估

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.17

Mengyao Zhu, Jia Zheng, Xiaoqing Yu, W. Wan

引用次数: 1

Regions of Interest Extraction Based on Visual Saliency in Compressed Domain 基于压缩域视觉显著性的兴趣区域提取

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.107

L. Sui, Jing Zhang, L. Zhuo, Yuncong Yang

{"title":"Regions of Interest Extraction Based on Visual Saliency in Compressed Domain","authors":"L. Sui, Jing Zhang, L. Zhuo, Yuncong Yang","doi":"10.1109/ISM.2011.107","DOIUrl":"https://doi.org/10.1109/ISM.2011.107","url":null,"abstract":"Recently bag-of-words (BoW) model having been widely used in textual information processing has been extended into many tasks in visual domain such as image classification, scene analysis, image annotation and image retrieval, namely bag-of-visual-words (BoVW) model. Therefore, it is essential to create an effective visual vocabulary. Most of existing approaches create visual vocabularies from image in pixel domain, which requires extra processing time in decompressed images, since most images are stored in compressed format. In this paper we propose to create a visual vocabulary based on Scale Invariant Feature Transform(SIFT) descriptor in compressed domain with the following three steps, (1) constructing low-resolution images in compressed domain, (2) extracting SIFT descriptor from low-resolution images, and (3) creating a visual vocabulary based on extracted SIFT descriptors. In order to evaluate the performance of the visual words, experiments have been conducted on identifying pornographic images. Experimental results indicate that the proposed method can recognize pornographic images accurately with much reduced computational time.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126991307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Utterance Rate Feedback for Enhancing Mealtime Communication 话语率反馈增强用餐时沟通

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.67

Kyohei Ogawa, Toshiki Takeuchi, Kunihiro Nishimura, T. Tanikawa, M. Hirose

引用次数: 3

A QoE Evaluation Methodology for HD Video Streaming Using Social Networking 基于社交网络的高清视频流QoE评价方法

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.43

B. Gardlo, M. Ries, M. Rupp, R. Jarina

引用次数: 13

TSF-Slider: Combining Time- and Structure-Based Media Navigation in One Navigation Component tsf滑块:在一个导航组件中结合基于时间和结构的媒体导航

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.59

Sebastian Pospiech, R. Mertens, Martin E. Muller, M. Ketterl

引用次数: 2

Similarity-Based Visualization for Image Browsing Revisited 基于相似性的图像浏览可视化

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.76

Klaus Schöffmann, David Ahlström

引用次数: 17

Multimodal Event Detection in User Generated Videos 用户生成视频中的多模态事件检测

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.49

Francesco Cricri, Kostadin Dabov, I. Curcio, Sujeet Mate, M. Gabbouj

{"title":"Multimodal Event Detection in User Generated Videos","authors":"Francesco Cricri, Kostadin Dabov, I. Curcio, Sujeet Mate, M. Gabbouj","doi":"10.1109/ISM.2011.49","DOIUrl":"https://doi.org/10.1109/ISM.2011.49","url":null,"abstract":"Nowadays most camera-enabled electronic devices contain various auxiliary sensors such as accelerometers, gyroscopes, compasses, GPS receivers, etc. These sensors are often used during the media acquisition to limit camera degradations such as shake and also to provide some basic tagging information such as the location used in geo-tagging. Surprisingly, exploiting the sensor-recordings modality for high-level event detection has been a subject of rather limited research, further constrained to highly specialized acquisition setups. In this work, we show how these sensor modalities, alone or in combination with content-based analysis, allow inferring information about the video content. In addition, we consider a multi-camera scenario, where multiple user generated recordings of a common scene (e.g., music concerts, public events) are available. In order to understand some higher-level semantics of the recorded media, we jointly analyze the individual video recordings and sensor measurements of the multiple users. The detected semantics include generic interesting events and some more specific events. The detection exploits correlations in the camera motion and in the audio content of multiple users. We show that the proposed multimodal analysis methods perform well on various recordings obtained in real live music performances.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122878055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Automatic Bird Species Identification for Large Number of Species 大量鸟类物种自动识别

2011 IEEE International Symposium on Multimedia Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.27

Marcelo Teider Lopes, Lucas L. Gioppo, Thiago T. Higushi, Celso A. A. Kaestner, C. Silla, Alessandro Lameiras Koerich

{"title":"Automatic Bird Species Identification for Large Number of Species","authors":"Marcelo Teider Lopes, Lucas L. Gioppo, Thiago T. Higushi, Celso A. A. Kaestner, C. Silla, Alessandro Lameiras Koerich","doi":"10.1109/ISM.2011.27","DOIUrl":"https://doi.org/10.1109/ISM.2011.27","url":null,"abstract":"In this paper we focus on the automatic identification of bird species from their audio recorded song. Bird monitoring is important to perform several tasks, such as to evaluate the quality of their living environment or to monitor dangerous situations to planes caused by birds near airports. We deal with the bird species identification problem using signal processing and machine learning techniques. First, features are extracted from the bird recorded songs using specific audio treatment, next the problem is performed according to a classical machine learning scenario, where a labeled database of previously known bird songs are employed to create a decision procedure that is used to predict the species of a new bird song. Experiments are conducted in a dataset of recorded songs of bird species which appear in a specific region. The experimental results compare the performance obtained in different situations, encompassing the complete audio signals, as recorded in the field, and short audio segments (pulses) obtained from the signals by a split procedure. The influence of the number of classes (bird species) in the identification accuracy is also evaluated.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114787808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 47