2017 IEEE International Conference on Multimedia and Expo (ICME)最新文献_第2页

QOE enhancement through cost-effective adaptation decision process for multiple-server streaming over HTTP 通过对HTTP上的多服务器流进行经济有效的适应决策过程来增强QOE

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019378

Joachim Bruneau-Queyreix, Mathias Lacaud, D. Négru, J. M. Batalla, E. Borcoci

{"title":"QOE enhancement through cost-effective adaptation decision process for multiple-server streaming over HTTP","authors":"Joachim Bruneau-Queyreix, Mathias Lacaud, D. Négru, J. M. Batalla, E. Borcoci","doi":"10.1109/ICME.2017.8019378","DOIUrl":"https://doi.org/10.1109/ICME.2017.8019378","url":null,"abstract":"Single-source HTTP Adaptive Streaming protocols (HAS), such as MPEG-DASH, have become the de-facto solutions to deliver video over the Internet. By avoiding buffer stalling events that are mainly caused by the lack of throughput at client or at server side, HAS protocols increase end-user's Quality of Experience (QoE). We propose to extend HAS capabilities to a pragmatic DASH-compliant Multiple-Source Streaming solution (MS-Stream) that simultaneously utilizes several servers. MS-Stream offers the opportunity to obtain higher QoE by exploiting expanded bandwidth and link diversity in heterogeneous distributed streaming infrastructures, such as distributed home-gateways or geographically distributed set-top-boxes belonging to Over-The-Top video service providers. This paper exposes a cost-effective two-phase adaptation process with dual (i.e., bitrate and number of sources) adaptation decisioning prior segment request and in-segment download adaptation. Our approach was empirically evaluated for on-demand video streaming over the Internet. An online demonstration is also made available [1].","PeriodicalId":330977,"journal":{"name":"2017 IEEE International Conference on Multimedia and Expo (ICME)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122118240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Efficient image sensor noise estimation via iterative re-weighted least squares 基于迭代加权最小二乘的图像传感器噪声估计

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019427

Li Dong, Jiantao Zhou, Guangtao Zhai

引用次数: 5

Deep learning for multimodal-based video interestingness prediction 基于多模态视频兴趣预测的深度学习

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019300

Yuesong Shen, C. Demarty, Ngoc Q. K. Duong

引用次数: 9

Impact of video resolution changes on QoE for adaptive video streaming 视频分辨率变化对自适应视频流QoE的影响

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019297

Avsar Asan, W. Robitza, I. Mkwawa, Lingfen Sun, E. Ifeachor, A. Raake

{"title":"Impact of video resolution changes on QoE for adaptive video streaming","authors":"Avsar Asan, W. Robitza, I. Mkwawa, Lingfen Sun, E. Ifeachor, A. Raake","doi":"10.1109/ICME.2017.8019297","DOIUrl":"https://doi.org/10.1109/ICME.2017.8019297","url":null,"abstract":"HTTP adaptive streaming (HAS) has become the de-facto standard for video streaming to ensure continuous multimedia service delivery under irregularly changing network conditions. Many studies already investigated the detrimental impact of various playback characteristics on the Quality of Experience of end users, such as initial loading, stalling or quality variations. However, dedicated studies tackling the impact of resolution adaptation are still missing. This paper presents the results of an immersive audiovisual quality assessment test comprising 84 test sequences from four different video content types, emulated with an HAS adaptation mechanism. We employed a novel approach based on systematic creation of adaptivity conditions which were assigned to source sequences based on their spatio-temporal characteristics. Our experiment investigates the resolution switch effect with respect to the degradations in MOS for certain adaptation patterns. We further demonstrate that the content type and resolution change patterns have a significant impact on the perception of resolution changes. These findings will help develop better QoE models and adaptation mechanisms for HAS systems in the future.","PeriodicalId":330977,"journal":{"name":"2017 IEEE International Conference on Multimedia and Expo (ICME)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127508684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Recognition and retrieval of sound events using sparse coding convolutional neural network 基于稀疏编码卷积神经网络的声音事件识别与检索

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019552

Chien-Yao Wang, A. Santoso, S. Mathulaprangsan, Chin-Chin Chiang, Chung-Hsien Wu, Jia-Ching Wang

引用次数: 10

Cross-media retrieval with semantics clustering and enhancement 基于语义聚类和增强的跨媒体检索

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019310

Minfeng Zhan, L. Li, Qingming Huang, Yugui Liu

引用次数: 3

SynCam: Capturing sub-frame synchronous media using smartphones SynCam:使用智能手机捕获子帧同步媒体

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019430

Ishit Mehta, P. Sakurikar, R. Shah, P J Narayanan

{"title":"SynCam: Capturing sub-frame synchronous media using smartphones","authors":"Ishit Mehta, P. Sakurikar, R. Shah, P J Narayanan","doi":"10.1109/ICME.2017.8019430","DOIUrl":"https://doi.org/10.1109/ICME.2017.8019430","url":null,"abstract":"Smartphones have become the de-facto capture devices for everyday photography. Unlike traditional digital cameras, smartphones are versatile devices with auxiliary sensors, processing power, and networking capabilities. In this work, we harness the communication capabilities of smartphones and present a synchronous/co-ordinated multi-camera capture system. Synchronous capture is important for many image/video fusion and 3D reconstruction applications. The proposed system provides an inexpensive and effective means to capture multi-camera media for such applications. Our coordinated capture system is based on a wireless protocol that uses NTP based synchronization and device specific lag compensation. It achieves sub-frame synchronization across all participating smartphones of even heterogeneous make and model. We propose a new method based on fiducial markers displayed on an LCD screen to temporally calibrate smart-phone cameras. We demonstrate the utility and versatility of this system to enhance traditional videography and to create novel visual representations such as panoramic videos, HDR videos, multi-view 3D reconstruction, multi-flash imaging, and multi-camera social media.","PeriodicalId":330977,"journal":{"name":"2017 IEEE International Conference on Multimedia and Expo (ICME)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130881605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video salient object detection via cross-frame cellular automata 基于跨帧元胞自动机的视频显著目标检测

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019389

Jingfan Guo, Tongwei Ren, Lei Huang, Xingyu Liu, Ming-Ming Cheng, Gangshan Wu

引用次数: 12

Deep multimodal network for multi-label classification 深度多模态网络的多标签分类

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019322

T. Chen, Shangfei Wang, Shiyu Chen

引用次数: 9

A deep convolutional neural network approach for complexity reduction on intra-mode HEVC 基于深度卷积神经网络的模式内HEVC复杂度降低方法

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019316

Tianyi Li, Mai Xu, Xin Deng

{"title":"A deep convolutional neural network approach for complexity reduction on intra-mode HEVC","authors":"Tianyi Li, Mai Xu, Xin Deng","doi":"10.1109/ICME.2017.8019316","DOIUrl":"https://doi.org/10.1109/ICME.2017.8019316","url":null,"abstract":"The High Efficiency Video Coding (HEVC) standard significantly saves coding bit-rate over the proceeding H.264 standard, but at the expense of extremely high encoding complexity. In fact, the coding tree unit (CTU) partition consumes a large proportion of HEVC encoding complexity, due to the brute-force search for rate-distortion optimization (RDO). Therefore, we propose in this paper a complexity reduction approach for intra-mode HEVC, which learns a deep convolutional neural network (CNN) model to predict CTU partition instead of RDO. Firstly, we establish a large-scale database with diversiform patterns of CTU partition. Secondly, we model the partition as a three-level classification problem. Then, for solving the classification problem, we develop a deep CNN structure with various sizes of convolutional kernels and extensive trainable parameters, which can be learnt from the established database. Finally, experimental results show that our approach reduces intramode encoding time by 62.25% and 69.06% with negligible Bj⊘ntegaard delta bit-rate of 2.12% and 1.38%, over the test sequences and images respectively, superior to other state-of-the-art approaches.","PeriodicalId":330977,"journal":{"name":"2017 IEEE International Conference on Multimedia and Expo (ICME)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121216356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 59