2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)最新文献_第6页

Eyeball Movement Model for Lecturer Character in Speech-Driven Embodied Group Entrainment System 言语驱动具身群体娱乐系统中讲师角色眼球运动模型

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.99

Yoshihiro Sejima, Tomio Watanabe, M. Jindai, Atsushi Osa

引用次数: 0

Visual Quality and File Size Prediction of H.264 Videos and Its Application to Video Transcoding for the Multimedia Messaging Service and Video on Demand H.264视频的视觉质量和文件大小预测及其在多媒体消息服务和视频点播视频转码中的应用

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.62

Didier Joset, S. Coulombe

{"title":"Visual Quality and File Size Prediction of H.264 Videos and Its Application to Video Transcoding for the Multimedia Messaging Service and Video on Demand","authors":"Didier Joset, S. Coulombe","doi":"10.1109/ISM.2013.62","DOIUrl":"https://doi.org/10.1109/ISM.2013.62","url":null,"abstract":"In this paper, we address the problem of adapting video files to meet terminal file size and resolution constraints while maximizing visual quality. First, two new quality estimation models are proposed, which predict quality as function of resolution, quantization step size, and frame rate parameters. The first model is generic and the second takes video motion into account. Then, we propose a video file size estimation model. Simulation results show a Pearson correlation coefficient (PCC) of 0.956 between the mean opinion score and our generic quality model (0.959 for the motion-conscious model). We obtain a PCC of 0.98 between actual and estimated file sizes. Using these models, we estimate the combination of parameters that yields the best video quality while meeting the target terminal's constraints. We obtain an average quality difference of 4.39% (generic model) and of 3.22% (motion-conscious model) when compared with the best theoretical transcoding possible. The proposed models can be applied to video transcoding for the Multimedia Messaging Service and for video on demand services such as YouTube and Netflix.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"12 1","pages":"321-328"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84013046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Evaluation of Image Browsing Interfaces for Smartphones and Tablets 智能手机和平板电脑图像浏览界面的评估

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.11

Marco A. Hudelist, Klaus Schöffmann, David Ahlström

引用次数: 4

Requirements for Mobile Learning Applications in Higher Education 高等教育中移动学习应用的需求

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.94

André Klassen, Marcus Eibrink-Lunzenauer, Till Gloggler

引用次数: 18

Efficient Content-Based Multimedia Retrieval Using Novel Indexing Structure in PostgreSQL 在PostgreSQL中使用新颖索引结构的高效内容多媒体检索

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.96

Fausto Fleites, Shu‐Ching Chen

引用次数: 2

A JND Profile Based on Hierarchically Selective Attention for Images 基于图像层次选择注意的JND配置文件

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.50

Dongdong Zhang, Lijing Gao, D. Zang, Yaoru Sun, Jiujun Cheng

{"title":"A JND Profile Based on Hierarchically Selective Attention for Images","authors":"Dongdong Zhang, Lijing Gao, D. Zang, Yaoru Sun, Jiujun Cheng","doi":"10.1109/ISM.2013.50","DOIUrl":"https://doi.org/10.1109/ISM.2013.50","url":null,"abstract":"Most of the traditional just-noticeable-distortion (JND) models in pixel domain compute the JND threshold by incorporating the spatial luminance adaptation effect and the textures contrast masking effect. Recently, with the rapid development of the computable models of visual attention, researchers started to improve the JND model by considering visual saliency of images, a foveated spatial JND model (FSJND) was proposed by incorporating the traditional visual characteristics and fovea characteristic of human eyes to enhance JND thresholds. However, the thresholds computed by the FSJND model may be overestimated for some high resolution images. In this paper, we proposed a new JND profile in pixel domain, in which a multi-level modulation function is built to reflect the effect of hierarchically selective visual attention on JND thresholds. The contrast masking is also considered in our modulation function to obtain more accurate JND thresholds. Compared with the lasted JND profiles, the proposed model can tolerate more distortion and has much better perceptual quality. The proposed JND model can be easily applied in many areas, such as compression, error protection, and so on.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"39 1","pages":"263-266"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90747643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improved Visibility of Single Hazy Images Captured in Inclement Weather Conditions 改善在恶劣天气条件下拍摄的单幅朦胧图像的能见度

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.51

Bo-Hao Chen, Shih-Chia Huang

引用次数: 7

Relevance Segmentation of Laparoscopic Videos 腹腔镜视频的相关分割

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.22

Bernd Münzer, Klaus Schöffmann, L. Böszörményi

引用次数: 40

Speeded-Up Video Summarization Based on Local Features 基于局部特征的加速视频摘要

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.70

Javier Iparraguirre, C. Delrieux

引用次数: 13

A Video Text Detection and Tracking System 视频文本检测与跟踪系统

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.106

Tuoerhongjiang Yusufu, Yiqing Wang, Xiangzhong Fang

{"title":"A Video Text Detection and Tracking System","authors":"Tuoerhongjiang Yusufu, Yiqing Wang, Xiangzhong Fang","doi":"10.1109/ISM.2013.106","DOIUrl":"https://doi.org/10.1109/ISM.2013.106","url":null,"abstract":"Faced with the increasing large scale video databases, retrieving videos quickly and efficiently has become a crucial problem. Video text, which carries high level semantic information, is a type of important source that is useful for this task. In this paper, we introduce a video text detecting and tracking approach. By these methods we can obtain clear binary text images, and these text images can be processed by OCR (Optical Character Recognition) software directly. Our approach including two parts, one is stroke-model based video text detection and localization method, the other is SURF (Speeded Up Robust Features) based text region tracking method. In our detection and localization approach, we use stroke model and morphological operation to roughly identify candidate text regions. Combine stroke-map and edge response to localize text lines in each candidate text regions. Several heuristics and SVM (Support Vector Machine) used to verifying text blocks. The core part of our text tracking method is fast approximate nearest-neighbour search algorithm for extracted SURF features. Text-ending frame is determined based on SURF feature point numbers, while, text motion estimation is based on correct matches in adjacent frames. Experimental result on large number of different video clips shows that our approach can effectively detect and track both static texts and scrolling texts.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"25 1","pages":"522-529"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81590513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19