2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)最新文献_第5页

A Comprehensive Solution for Deep-Learning Based Cargo Inspection to Discriminate Goods in Containers 一种基于深度学习的集装箱货物识别综合解决方案

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00166

Jiahang Che, Yuxiang Xing, Li Zhang

引用次数: 3

Analysis of Efficient CNN Design Techniques for Semantic Segmentation 语义分割的高效CNN设计技术分析

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00109

Alexandre Briot, P. Viswanath, S. Yogamani

引用次数: 27

Realtime Quality Assessment of Iris Biometrics Under Visible Light 可见光下虹膜生物识别的实时质量评估

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00085

Mohsen Jenadeleh, Marius Pedersen, D. Saupe

引用次数: 6

Learning Hierarchical Models for Class-Specific Reconstruction from Natural Data 从自然数据中学习特定类重构的层次模型

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00153

Arun C. S. Kumar, S. Bhandarkar, Mukta Prasad

{"title":"Learning Hierarchical Models for Class-Specific Reconstruction from Natural Data","authors":"Arun C. S. Kumar, S. Bhandarkar, Mukta Prasad","doi":"10.1109/CVPRW.2018.00153","DOIUrl":"https://doi.org/10.1109/CVPRW.2018.00153","url":null,"abstract":"We propose a novel method for class-specific, single-view, object detection, pose estimation and deformable 3D reconstruction, where a two-pronged (sparse semantic and dense shape) representation is learned from natural image data automatically. Then, given a new image, it can estimate camera pose and deformable reconstruction using an effective, incremental optimization. Our method extracts a continuous, scaled-orthographic pose (without resorting to regression and/or discretized 1D azimuth-based representations). The method reconstructs a full free-form shape (rather than retrieving the closest 3D CAD shape proxy, typical in state-of-the-art). We learn our two-pronged model purely from natural image data, as automatically and faithfully as possible, reducing the human effort and bias typical to this problem. The pipeline combines data-driven deep learning based semantic part learning with principled modelling and effective optimization of the problem's physics, shape deformation, pose and occlusion. The underlying sparse (part-based) representation of the object is computationally efficient for purposes like detection and discriminative tasks, whereas the overlaid dense (skin like) representation, models and realistically renders comprehensive 3D structure including natural deformation, occlusion. The results for the car class are visually pleasing, and importantly, outperform the state-of-the-art quantitatively too. Our contribution to visual scene understanding through the two-pronged object representation shows promise for more accurate 3D scene understanding for real world applications on virtual/mixed reality, autonomous navigation, to cite a few.","PeriodicalId":150600,"journal":{"name":"2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132696040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SAM: Pushing the Limits of Saliency Prediction Models SAM:推动显著性预测模型的极限

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00250

M. Cornia, L. Baraldi, G. Serra, R. Cucchiara

引用次数: 18

A Comparative Study of Real-Time Semantic Segmentation for Autonomous Driving 自动驾驶实时语义分割的比较研究

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00101

Mennatullah Siam, M. Gamal, Moemen Abdel-Razek, S. Yogamani, Martin Jägersand, Hong Zhang

{"title":"A Comparative Study of Real-Time Semantic Segmentation for Autonomous Driving","authors":"Mennatullah Siam, M. Gamal, Moemen Abdel-Razek, S. Yogamani, Martin Jägersand, Hong Zhang","doi":"10.1109/CVPRW.2018.00101","DOIUrl":"https://doi.org/10.1109/CVPRW.2018.00101","url":null,"abstract":"Semantic segmentation is a critical module in robotics related applications, especially autonomous driving. Most of the research on semantic segmentation is focused on improving the accuracy with less attention paid to computationally efficient solutions. Majority of the efficient semantic segmentation algorithms have customized optimizations without scalability and there is no systematic way to compare them. In this paper, we present a real-time segmentation benchmarking framework and study various segmentation algorithms for autonomous driving. We implemented a generic meta-architecture via a decoupled design where different types of encoders and decoders can be plugged in independently. We provide several example encoders including VGG16, Resnet18, MobileNet, and ShuffleNet and decoders including SkipNet, UNet and Dilation Frontend. The framework is scalable for addition of new encoders and decoders developed in the community for other vision tasks. We performed detailed experimental analysis on cityscapes dataset for various combinations of encoder and decoder. The modular framework enabled rapid prototyping of a custom efficient architecture which provides ~x143 GFLOPs reduction compared to SegNet and runs real-time at ~15 fps on NVIDIA Jetson TX2. The source code of the framework is publicly available.","PeriodicalId":150600,"journal":{"name":"2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124221641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 117

Building Detection from Satellite Imagery Using a Composite Loss Function 基于复合损失函数的卫星图像建筑物检测

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00040

Sergey Golovanov, R. Kurbanov, A. Artamonov, A. Davydow, S. Nikolenko

引用次数: 19

Video Based Measurement of Heart Rate and Heart Rate Variability Spectrogram from Estimated Hemoglobin Information 基于视频的测量心率和心率变异性频谱图估计血红蛋白信息

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00180

Munenori Fukunishi, Kouki Kurita, Shoji Yamamoto, N. Tsumura

引用次数: 7

Deep Super Resolution for Recovering Physiological Information from Videos 从视频中恢复生理信息的深度超分辨率

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00185

Daniel J. McDuff

引用次数: 38

Persistent Memory Residual Network for Single Image Super Resolution 单图像超分辨率持久记忆残余网络

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI: 10.1109/CVPRW.2018.00125

Rongzhen Chen, Yanyun Qu, Kun Zeng, Jinkang Guo, Cuihua Li, Yuan Xie

{"title":"Persistent Memory Residual Network for Single Image Super Resolution","authors":"Rongzhen Chen, Yanyun Qu, Kun Zeng, Jinkang Guo, Cuihua Li, Yuan Xie","doi":"10.1109/CVPRW.2018.00125","DOIUrl":"https://doi.org/10.1109/CVPRW.2018.00125","url":null,"abstract":"Progresses has been witnessed in single image superresolution in which the low-resolution images are simulated by bicubic downsampling. However, for the complex image degradation in the wild such as downsampling, blurring, noises, and geometric deformation, the existing superresolution methods do not work well. Inspired by a persistent memory network which has been proven to be effective in image restoration, we implement the core idea of human memory on the deep residual convolutional neural network. Two types of memory blocks are designed for the NTIRE2018 challenge. We embed the two types of memory blocks in the framework of enhanced super resolution network (EDSR), which is the NTIRE2017 champion method. The residual blocks of EDSR is replaced by two types of memory blocks. The first type of memory block is a residual module, and one memory block contains four residual modules with four residual blocks followed by a gate unit, which adaptively selects the features needed to store. The second type of memory block is a residual dilated convolutional block, which contains seven dilated convolution layers linked to a gate unit. The two proposed models not only improve the super-resolution performance but also mitigate the image degradation of noises and blurring. Experimental results on the DIV2K dataset demonstrate our models achieve better performance than EDSR.","PeriodicalId":150600,"journal":{"name":"2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132330248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15