2023 18th International Conference on Machine Vision and Applications (MVA)最新文献_第4页

Multi-Prior Based Multi-Scale Condition Network for Single-Image HDR Reconstruction 基于多先验的单图像HDR重构多尺度条件网络

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10216063

Haorong Jiang, Fengshan Zhao, Junda Liao, Qin Liu, T. Ikenaga

引用次数: 0

TinyPedSeg: A Tiny Pedestrian Segmentation Benchmark for Top-Down Drone Images TinyPedSeg:一个微小的行人分割基准自上而下无人机图像

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215829

Y. Sahin, Elvin Abdinli, M. A. Aydin, Gozde Unal

引用次数: 0

Safe height estimation of deformable objects for picking robots by detecting multiple potential contact points 基于多个潜在接触点的可变形物体拾取机器人安全高度估计

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215690

Jaesung Yang, Daisuke Hagihara, Kiyoto Ito, Nobuhiro Chihara

引用次数: 0

Video Anomaly Detection Using Encoder-Decoder Networks with Video Vision Transformer and Channel Attention Blocks 基于视频视觉变压器和信道注意块的编码器-解码器网络的视频异常检测

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215921

Shimpei Kobayashi, A. Hizukuri, R. Nakayama

引用次数: 0

Transformer with Task Selection for Continual Learning 持续学习的任务选择转换器

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215673

Sheng-Kai Huang, Chun-Rong Huang

引用次数: 0

Investigating self-supervised learning for Skin Lesion Classification 研究皮肤病变分类的自监督学习

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215580

Takumi Morita, X. Han

{"title":"Investigating self-supervised learning for Skin Lesion Classification","authors":"Takumi Morita, X. Han","doi":"10.23919/MVA57639.2023.10215580","DOIUrl":"https://doi.org/10.23919/MVA57639.2023.10215580","url":null,"abstract":"Skin cancer is one of the most common cancer worldwide, and is growing as a rising global health issue due to the damage of the natural protection from harmful ultraviolet radiation. Early diagnosis and proper treatment even for the deadliest malignant melanoma can greatly increase the survival rate. Thus, computer-aided diagnosis for skin lesions has been actively explored and made remarkable progress in medical practices benefiting from the the great advance of the deep convolution neural networks in vision tasks. However, most studies in skin lesion/cancer recognition and detection focus on reconstructing a robust prediction model with the annotated training samples in a fully-supervised manner, and cannot make full use of the available unlabeled data. This study investigates self-supervised learning using large amount of unlabeled skin lesion images to train a good initial network for representation learning, and transfer the knowledge of the initial model to the supervised skin lesion classification task with small number of annotated samples for enhancing the performance. Specifically, we employ a negative sample-free self-supervised framework by leveraging the interaction learning of the online and target networks for enforcing representative robustness with only positive samples. Moreover, according to the observation of the potential variations in the target skin images, we select the adaptive augmentation methods to produce the transformed positive views for self-supervised learning. Extensive experiments on two benchmark skin lesion datasets demonstrated that the proposed self-supervised pre-training can stably improve the recognition performance with different numbers of the labeled images compared with the baseline models.","PeriodicalId":338734,"journal":{"name":"2023 18th International Conference on Machine Vision and Applications (MVA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132750801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Age Prediction From Face Images Via Contrastive Learning 基于对比学习的人脸图像年龄预测

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10216074

Yeongnam Chae, Poulami Raha, Mijung Kim, B. Stenger

引用次数: 0

Hierarchical Spatio-Temporal Neural Network with Displacement Based Refinement for Monocular Head Pose Prediction 基于位移的分层时空神经网络单眼头姿预测

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10216167

Zhe Xu, Yuan Li, Yuhong Li, Songlin Du, T. Ikenaga

{"title":"Hierarchical Spatio-Temporal Neural Network with Displacement Based Refinement for Monocular Head Pose Prediction","authors":"Zhe Xu, Yuan Li, Yuhong Li, Songlin Du, T. Ikenaga","doi":"10.23919/MVA57639.2023.10216167","DOIUrl":"https://doi.org/10.23919/MVA57639.2023.10216167","url":null,"abstract":"Head pose prediction aims to forecast future head pose given observed sequence, which plays an increasingly important role in human computer interaction, virtual reality, and driver monitoring. However, since there are many moving possibilities, current head pose works, mainly focusing on estimation, fail to provide sufficient temporal information to meet the high demands for accurate predictions. This paper proposes (A) a Spatio-Temporal Encoder (STE), (B) a displacement based offset generating module, and (C) a time step feature aggregation module. The STE extracts spatial information via Transformer and temporal information according to the time order of frames. The displacement based offset generating module utilizes displacement information through a frequency domain process between adjacent frames to generate an offset to refine the prediction result. Furthermore, the time step feature aggregation module integrates time step features based on the information density and hierarchically extracts past motion information as prior knowledge to capture the motion recurrence. Extensive experiments have shown that the proposed network outperforms related methods, achieving a Mean Absolute Error (MAE) of 4.5865° on simple background sequences and 7.1325° on complex background sequences.","PeriodicalId":338734,"journal":{"name":"2023 18th International Conference on Machine Vision and Applications (MVA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115430653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Joint Learning with Group Relation and Individual Action 群体关系与个体行为的共同学习

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215994

Chihiro Nakatani, Hiroaki Kawashima, N. Ukita

引用次数: 0

Leveraging Embedding Information to Create Video Capsule Endoscopy Datasets 利用嵌入信息创建视频胶囊内窥镜数据集

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215919

Pere Gilabert, C. Malagelada, Hagen Wenzek, Jordi Vitrià, S. Seguí

引用次数: 0