2017 IEEE International Conference on Multimedia and Expo (ICME)最新文献_第5页

A closer look: Small object detection in faster R-CNN 仔细观察:更快的R-CNN中的小物体检测

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019550

C. Eggert, Stephan Brehm, Anton Winschel, D. Zecha, R. Lienhart

引用次数: 87

Multi-scale exposure fusion via gradient domain guided image filtering 基于梯度域引导图像滤波的多尺度曝光融合

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019529

F. Kou, Zhengguo Li, C. Wen, Weihai Chen

引用次数: 83

Visual relationship detection with object spatial distribution 基于物体空间分布的视觉关系检测

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019448

Yaohui Zhu, Shuqiang Jiang, Xiangyang Li

引用次数: 23

Robust human detection with super-pixel segmentation and random ferns classification using RGB-D camera 基于RGB-D相机的超像素分割和随机蕨类分类鲁棒人体检测

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019303

Luchao Tian, Mingchen Li, Guyue Zhang, Jingwen Zhao, Y. Chen

引用次数: 6

Deep networks for compressed image sensing 用于压缩图像感知的深度网络

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-10 DOI: 10.1109/ICME.2017.8019428

Wuzhen Shi, F. Jiang, Shengping Zhang, Debin Zhao

引用次数: 109

Classifying derivative works with search, text, audio and video features 根据搜索、文本、音频和视频特征对衍生作品进行分类

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-01 DOI: 10.1109/ICME.2017.8019444

Jordan B. L. Smith, Masahiro Hamasaki, Masataka Goto

引用次数: 6

A joint model for action localization and classification in untrimmed video with visual attention 带有视觉注意的未修剪视频动作定位与分类联合模型

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-01 DOI: 10.1109/ICME.2017.8019335

Weimian Li, Wenmin Wang, Xiongtao Chen, Jinzhuo Wang, Ge Li

{"title":"A joint model for action localization and classification in untrimmed video with visual attention","authors":"Weimian Li, Wenmin Wang, Xiongtao Chen, Jinzhuo Wang, Ge Li","doi":"10.1109/ICME.2017.8019335","DOIUrl":"https://doi.org/10.1109/ICME.2017.8019335","url":null,"abstract":"In this paper, we introduce a joint model that learns to directly localize the temporal bounds of actions in untrimmed videos as well as precisely classify what actions occur. Most existing approaches tend to scan the whole video to generate action instances, which are really inefficient. Instead, inspired by human perception, our model is formulated based on a recurrent neural network to observe different locations within a video over time. And, it is capable of producing temporal localizations by only observing a fixed number of fragments, and the amount of computation it performs is independent of input video size. The decision policy for determining where to look next is learned by REINFORCE which is powerful in non-differentiable settings. In addition, different from relevant ways, our model runs localization and classification serially, and possesses a strategy for extracting appropriate features to classify. We evaluate our model on ActivityNet dataset, and it greatly outperforms the baseline. Moreover, compared with a recent approach, we show that our serial design can bring about 9% increase in detection performance.","PeriodicalId":330977,"journal":{"name":"2017 IEEE International Conference on Multimedia and Expo (ICME)","volume":"138 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120843102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A study on lidar data forensics 激光雷达数据取证研究

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-01 DOI: 10.1109/ICME.2017.8019395

K. Bahirat, B. Prabhakaran

{"title":"A study on lidar data forensics","authors":"K. Bahirat, B. Prabhakaran","doi":"10.1109/ICME.2017.8019395","DOIUrl":"https://doi.org/10.1109/ICME.2017.8019395","url":null,"abstract":"3D LiDAR (Light Imaging Detection and Ranging) data has recently been used in a wide range of applications such as vehicle automation and crime scene reconstruction. Decision making in such applications is highly dependent on LiDAR data. Thus, it becomes crucial to authenticate the data before using it. Though authentication of 2D digital images and video has been widely studied, the area of 3D data forensic is relatively unexplored. In this paper, we investigate and identify three possible attacks on the LiDAR data. We also propose two novel forensic approaches as a countermeasure for such attacks and study their effectiveness. The first forensic approach utilises the density consistency check while the second method leverages the occlusion effect for revealing the forgery. Experimental results demonstrate the effectiveness of the proposed forgery attacks and raise the awareness against unauthenticated use of LiDAR data. The performance analyses of the proposed forensic approaches indicate that the proposed methods are very efficient and provide the detection accuracy of more than 95% for certain kinds of forgery attacks. While the forensic approach is unable to handle all forgery attacks, the study motivates to explore more sophisticated forensic methods for LiDAR data.","PeriodicalId":330977,"journal":{"name":"2017 IEEE International Conference on Multimedia and Expo (ICME)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127344153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Learning deep semantic attributes for user video summarization 为用户视频摘要学习深度语义属性

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-01 DOI: 10.1109/ICME.2017.8019411

Ke Sun, Jiasong Zhu, Zhuo Lei, Xianxu Hou, Qian Zhang, Jiang Duan, G. Qiu

引用次数: 9

Image restoration via multi-scale non-local total variation regularization 基于多尺度非局部全变差正则化的图像恢复

2017 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2017-07-01 DOI: 10.1109/ICME.2017.8019463

Jing Mu, Ruiqin Xiong, Xiaopeng Fan, Siwei Ma

引用次数: 0