Proceedings of the 2022 5th International Conference on Image and Graphics Processing最新文献

Preconditioned Diffusion Multitask Clustering Graph Filters 预条件扩散多任务聚类图滤波器

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512438

Ying-Shin Lai, F. Chen, Tiantian Wang

引用次数: 0

RPViT: Vision Transformer Based on Region Proposal RPViT:基于区域建议的视觉转换器

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512421

Jing Ge, Qianxiang Wang, Jiahui Tong, Guangyu Gao

引用次数: 0

Automatic Marking based on Deep Learning 基于深度学习的自动标记

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512410

Yunxiang Liu, Jianlin Zhu, Xinxin Yuan, Chunya Wang

引用次数: 0

A Noise-robust Feature Fusion Model Combining Non-local Attention for Material Recognition 一种结合非局部注意的材料识别抗噪特征融合模型

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512450

Chuanbo Zhou, Guoan Yang, Zhengzhi Lu, Deyang Liu, Yong Yang

{"title":"A Noise-robust Feature Fusion Model Combining Non-local Attention for Material Recognition","authors":"Chuanbo Zhou, Guoan Yang, Zhengzhi Lu, Deyang Liu, Yong Yang","doi":"10.1145/3512388.3512450","DOIUrl":"https://doi.org/10.1145/3512388.3512450","url":null,"abstract":"Material recognition, as an important task of computer vision, is hugely challenging, due to large intra-class variances and small inter-class variances between material images. To address those recognition problems, multi-scale feature fusion methods based on deep convolutional neural networks are presented, which has been widely studied in recent years. However, the past research works paid too much attention to the local features of the image, while ignoring the non-local features that are also crucial for fine image recognition tasks such as material recognition. In this paper, Non-local Attentional Feature Fusion Network (NLA-FFNet) is proposed that combines local and non-local feature of images to improve the feature representation capability. Firstly, we utilize the pre-trained deep convolutional neural network to extract the image feature. Secondly, a Multilayer Non-local Attention (MNLA) block is designed to generate a non-local attention map which represents the long-range dependencies between features of different positions. Therefore, it can achieve stronger noise-robustness of model and better ability to represent fine features. Finally, combined our Multilayer Non-local Attention block with bilinear pooling which has been proved to be effective for feature fusion, we propose a deep neural network framework, NLA-FFNet, with noise-robust multi-layer feature fusion. Experiment prove that our model can achieve a competitive classification accuracy in material image recognition, and has stronger noise-robustness at the same time.","PeriodicalId":434878,"journal":{"name":"Proceedings of the 2022 5th International Conference on Image and Graphics Processing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128085000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Visualization of Plant Leaf Classification Process Based on Multi-Layer Network Model 基于多层网络模型的植物叶片分类过程可视化

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512430

Ziyi Wang, Hongjun Li

引用次数: 1

MAU-Net: A Multiscale Attention Encoder-decoder Network for Liver and Liver-tumor Segmentation MAU-Net:用于肝脏和肝脏肿瘤分割的多尺度注意力编码器-解码器网络

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512418

Le Liu, Jian Su, HuLin Liu, Weiqiang Zhao, Xiaogang Du, Tao Lei

引用次数: 0

SHTVS: Shot-level based Hierarchical Transformer for Video Summarization SHTVS:基于镜头级的视频摘要分层变压器

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512427

Yubo An, Shenghui Zhao

引用次数: 0

SpikeFormer: Image Reconstruction from the Sequence of Spike Camera Based on Transformer SpikeFormer:基于变压器的Spike相机序列图像重建

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512399

Chen She, Laiyun Qing

{"title":"SpikeFormer: Image Reconstruction from the Sequence of Spike Camera Based on Transformer","authors":"Chen She, Laiyun Qing","doi":"10.1145/3512388.3512399","DOIUrl":"https://doi.org/10.1145/3512388.3512399","url":null,"abstract":"The recently invented retina-inspired spike camera produces asynchronous binary spike streams to record the dynamic light intensity variation process. This paper develops a novel image reconstruction method, called SpikeFormer, which reconstructs the dynamic scene from binary spike streams in a supervised learning strategy. We construct the training dataset which composes of spike streams and corresponding ground truth images by simulating the working mechanism of spike camera. Spike noises are also taken into consideration in the simulator. Firstly, the input spike stream is encoded as an enlarged binary image by interlacing temporal and spatial information. Then the binary image is inputted to the SpikeFormer to recover the dynamic scene. SpikeFormer adopts Transformer architecture which includes an encoder and a decoder. In particular, we propose a hierarchical architecture encoder to exploit multi-scale temporal and spatial features progressively. The decoder aggregates information from different stages to incorporate both local and global attention. Multi-task loss including reconstruction loss, perception loss, edge loss, and temporal consistency loss are combined to restrict the model. Extensive experimental results demonstrate that the proposed framework achieves encouraging results in details reconstruction and noise alleviation.","PeriodicalId":434878,"journal":{"name":"Proceedings of the 2022 5th International Conference on Image and Graphics Processing","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116739326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

SU-UNet: A Novel Self-Updating Network for Hepatic Vessel Segmentation in CT Images SU-UNet:一种新的CT肝血管分割自更新网络

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512420

Yang Liu, Xukun Zhang, Haopeng Kuang, Zhongwei Yang, Shichao Yan, Peng Zhai, Lihua Zhang

引用次数: 1

Bayesian-based Security Distributed Estimation 基于贝叶斯的安全分布式估计

Proceedings of the 2022 5th International Conference on Image and Graphics Processing Pub Date : 2022-01-07 DOI: 10.1145/3512388.3512445

Tiantian Wang, Feng Chen, Ying-Shin Lai

引用次数: 0