2018 IEEE International Conference on Multimedia and Expo (ICME)最新文献_第10页

Challenges in Autonomous UAV Cinematography: An Overview 自主无人机电影摄影的挑战:概述

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486586

Ioannis Mademlis, V. Mygdalis, N. Nikolaidis, I. Pitas

引用次数: 47

Deep Multi-Metric Learning for Person Re-Identification 基于深度多度量学习的人物再识别

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486502

Yongxin Ge, Xinqian Gu, Min Chen, Hongxing Wang, Dan Yang

引用次数: 9

Feed-Net: Fully End-to-End Dehazing 馈网:完全端到端除雾

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486435

S. Zhang, Wenqi Ren, Jian Yao

引用次数: 19

Two Pass Rate Control for Consistent Quality Based on Down-Sampling Video in HEVC 基于下采样的HEVC视频一致性通过率控制

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486544

Yu-Yao Shen, Chih-Hung Kuo

{"title":"Two Pass Rate Control for Consistent Quality Based on Down-Sampling Video in HEVC","authors":"Yu-Yao Shen, Chih-Hung Kuo","doi":"10.1109/ICME.2018.8486544","DOIUrl":"https://doi.org/10.1109/ICME.2018.8486544","url":null,"abstract":"Rate control plays an important role in video coding and streaming applications with bandwidth constraints. While most researches are proposed to improve the coding efficiency’ the fluctuation of video quality is seldom considered. Many rate control schemes suffer from unreliable initialization of coding parameters, which leads to seriously inconsistent quality at the beginning of a video. Besides, the hierarchical structure for frame references introduces more quality fluctuations, although it improves the coding efficiency significantly. This paper presents a two pass rate control method that aims for a consistent visual quality. The video is downsampled by four times, and then encoded for the first pass. A fixed Lagrange multiplier $(lambda)$ is derived from the information recorded in the first pass, and then applied for all frames in the second coding pass. A QP adjustment policy is adopted to maintain a consistent quality and a constant bitrate. Experimental results show that the proposed rate control method can reduce the fluctuation of video quality to be averagely 94.63% less than that encoded by the HEVC Test Model (HM16.9).","PeriodicalId":426613,"journal":{"name":"2018 IEEE International Conference on Multimedia and Expo (ICME)","volume":"60 30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125782013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Image Exposure Assessment: A Benchmark and a Deep Convolutional Neural Networks Based Model 图像曝光评估:一个基准和基于深度卷积神经网络的模型

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486569

Lijun Zhang, Lin Zhang, Xiao Liu, Ying Shen, Dongqing Wang

引用次数: 3

Improving Tiny Vehicle Detection in Complex Scenes 改进复杂场景中的微型车辆检测

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486507

W. Liu, Shengcai Liao, W. Hu, Xuezhi Liang, Yan Zhang

{"title":"Improving Tiny Vehicle Detection in Complex Scenes","authors":"W. Liu, Shengcai Liao, W. Hu, Xuezhi Liang, Yan Zhang","doi":"10.1109/ICME.2018.8486507","DOIUrl":"https://doi.org/10.1109/ICME.2018.8486507","url":null,"abstract":"Vehicle detection is still a challenge in complex traffic scenes, especially for vehicles of tiny scales. Though RCNN based two-stage detectors have demonstrated considerably good performance, less attention has been paid to the quality of the first stage, where, however, tiny vehicles are very likely to be missed. In this paper, we propose a deep network for accurate vehicle detection, with the main idea of using a relatively large feature map for proposal generation, and keeping ROI feature's spatial layout to represent and detect tiny vehicles. However, large feature maps in lower levels of a deep network generally contain limited discriminant information. To address this, we introduce a backward feature enhancement operation, which absorbs higher level information step by step to enhance the base feature map. By doing so, even with only 100 proposals, the resulting proposal network achieves an encouraging recall over 99%. Furthermore, unlike a common practice which flatten features after ROI pooling, we argue that for a better detection of tiny vehicles, the spatial layout of the ROI features should be preserved and fully integrated. Accordingly, we use a multi-path light-weight processing chain to effectively integrate ROI features, while preserving the spatial layouts. Experiments done on the challenging DETRAC vehicle detection benchmark show that the proposed method largely improves a competitive baseline (ResNet50 based Faster RCNN) by 16.5% mAP, and it outperforms all previously published and unpublished results.","PeriodicalId":426613,"journal":{"name":"2018 IEEE International Conference on Multimedia and Expo (ICME)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126284294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Dense Reconstruction from Monocular Slam with Fusion of Sparse Map-Points and Cnn-Inferred Depth 基于稀疏地图点和cnn推断深度融合的单目Slam密集重建

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486548

Xiang Ji, Xinchen Ye, Hongcan Xu, Haojie Li

引用次数: 3

Soft Clustering Guided Image Smoothing 软聚类引导图像平滑

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486448

Liangkai Li, Xiaojie Guo, Wei Feng, Jiawan Zhang

{"title":"Soft Clustering Guided Image Smoothing","authors":"Liangkai Li, Xiaojie Guo, Wei Feng, Jiawan Zhang","doi":"10.1109/ICME.2018.8486448","DOIUrl":"https://doi.org/10.1109/ICME.2018.8486448","url":null,"abstract":"Image smoothing, which aims to remove unwanted textures and preserve desired structures, plays an important role in many multimedia and computer vision tasks. The key to image smoothing, despite different applications, is to distinguish the structures from the textures. This paper presents a novel image smoothing method, following the principle that, for a certain pixel, its neighbors in both space and intensity should contribute more on smoothing, while the distant ones be insulated for avoiding over-smoothing. Intuitively, clustering is a good candidate to achieve the goal. However, due to rich textures and clutters within images, simply performing the clustering on the input likely obtains inaccurate results, and thus leads to unsatisfied smoothing results. In addition, for our task, using traditional hard clustering techniques is at high risk of generating staircase artifacts. For addressing these issues, an algorithm is customized, which on the one hand adopts the soft clustering to more faithfully assign pixels, on the other hand iterates the soft clustering and smoothing, expecting to improve each other. Experiments on several challenging images are provided to show the efficacy of our method, and its superiority over other prevailing approaches.","PeriodicalId":426613,"journal":{"name":"2018 IEEE International Conference on Multimedia and Expo (ICME)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127495067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Dynamic Adaptation of Multimedia Presentations for Videoconferencing in Application Mobility 应用移动环境下视频会议多媒体演示的动态适配

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486565

Francisco Javier Velárquez-García, P. Halvorsen, H. Stensland, F. Eliassen

{"title":"Dynamic Adaptation of Multimedia Presentations for Videoconferencing in Application Mobility","authors":"Francisco Javier Velárquez-García, P. Halvorsen, H. Stensland, F. Eliassen","doi":"10.1109/ICME.2018.8486565","DOIUrl":"https://doi.org/10.1109/ICME.2018.8486565","url":null,"abstract":"Application mobility is the paradigm where users can move their running applications to heterogeneous devices in a seamless manner. This mobility involves dynamic context changes of hardware, network resources, user environment, and user preferences. In order to continue multimedia processing under these context changes, applications need to adapt not only the collection of media streams, i.e., multimedia presentation, but also their internal configuration to work on different hardware. We present the performance analysis to adapt a videoconferencing prototype application in a proposed adaptation control loop to autonomously adapt multimedia pipelines. Results show that the time spent to create an adaptation plan and execute it is in the order of hundreds of milliseconds. The reconfiguration of pipelines, compared to building them from scratch, is approximately 1000 times faster when re-utilizing already instantiated hardware-dependent components. Therefore, we conclude that the adaptation of multimedia pipelines is a feasible approach for multimedia applications that adhere to application mobility.","PeriodicalId":426613,"journal":{"name":"2018 IEEE International Conference on Multimedia and Expo (ICME)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114756223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Magnify-Net for Multi-Person 2D Pose Estimation 用于多人2D姿态估计的放大网

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI: 10.1109/ICME.2018.8486591

Haoqian Wang, Wangpeng An, Xingzheng Wang, Lu Fang, Jiahui Yuan

引用次数: 11