2021 International Conference on Visual Communications and Image Processing (VCIP)最新文献_第8页

MPEG Immersive Video tools for Light Field Head Mounted Displays MPEG沉浸式视频工具的光场头戴式显示器

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675317

Daniele Bonatto, Grégoire Hirt, Alexander Kvasov, Sarah Fachada, G. Lafruit

引用次数: 2

Learning in Compressed Domain for Faster Machine Vision Tasks 基于压缩域的快速机器视觉学习

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675369

Jinming Liu, Heming Sun, J. Katto

引用次数: 7

Evaluation Of Bitrate Ladders For Versatile Video Coder 多用途视频编码器的位率阶梯评价

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675425

Reda Kaafarani, Médéric Blestel, Thomas Maugey, M. Ropert, A. Roumy

引用次数: 4

Multi-camera system for placing the viewer between the players of a live sports match: Blind Review 多摄像机系统，放置观众之间的球员之间的实况体育比赛:盲目审查

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675336

引用次数: 0

Kalman filter-based prediction refinement and quality enhancement for geometry-based point cloud compression 基于卡尔曼滤波的几何点云压缩预测改进与质量增强

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675412

Lu Wang, Jianfeng Sun, Hui Yuan, R. Hamzaoui, Xiaohui Wang

引用次数: 1

Attention-guided Convolutional Neural Network for Lightweight JPEG Compression Artifacts Removal 轻量级JPEG压缩伪影去除的注意引导卷积神经网络

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675320

Gang Zhang, Haoquan Wang, Yedong Wang, Haijie Shen

引用次数: 0

CRC-Based Multi-Error Correction of H.265 Encoded Videos in Wireless Communications 无线通信中基于crc的H.265编码视频多纠错

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675400

Vivien Boussard, S. Coulombe, F. Coudoux, P. Corlay, Anthony Trioux

引用次数: 2

Cross-Block Difference Guided Fast CU Partition for VVC Intra Coding 跨块差分引导的VVC内编码快速CU划分

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675409

Hewei Liu, Shuyuan Zhu, Ruiqin Xiong, Guanghui Liu, B. Zeng

引用次数: 4

Action Recognition Improved by Correlations and Attention of Subjects and Scene 基于主体和场景相关性和注意力的动作识别

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675340

Manh-Hung Ha, O. Chen

{"title":"Action Recognition Improved by Correlations and Attention of Subjects and Scene","authors":"Manh-Hung Ha, O. Chen","doi":"10.1109/VCIP53242.2021.9675340","DOIUrl":"https://doi.org/10.1109/VCIP53242.2021.9675340","url":null,"abstract":"Comprehensive activity understanding of multiple subjects in a video requires subject detection, action identification, and behavior interpretation as well as the interactions among subjects and background. This work develops the action recognition of subject(s) based on the correlations and interactions of the whole scene and subject(s) by using the Deep Neural Network (DNN). The proposed DNN consists of 3D Convolutional Neural Network (CNN), Spatial Attention (SA) generation layer, mapping convolutional fused-depth layer, Transformer Encoder (TE), and two fully connected layers with late fusion for final classification. Especially, the attention mechanisms in SA and TE are implemented to find out meaningful action information on spatial and temporal domains for enhancing recognition performance, respectively. The experimental results reveal that the proposed DNN shows the superior accuracies of 97.8%, 98.4% and 85.6% in the datasets of traffic police, UCF101-24 and JHMDB-21, respectively. Therefore, our DNN is an outstanding classifier for various action recognitions involving one or multiple subjects.","PeriodicalId":114062,"journal":{"name":"2021 International Conference on Visual Communications and Image Processing (VCIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131363603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Nearly Reversible Image-to-Image Translation Using Joint Inter-Frame Coding and Embedding 基于联合帧间编码和嵌入的近可逆图像到图像的转换

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI: 10.1109/VCIP53242.2021.9675370

Xinzhu Cao, Yuanzhi Yao, Nenghai Yu

引用次数: 0