2023 18th International Conference on Machine Vision and Applications (MVA)最新文献_第7页

MFFPN: an Anchor-Free Method for Patent Drawing Object Detection MFFPN:一种无锚点的专利图纸对象检测方法

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10216017

Yu-Hsien Chen, Chih-Yi Chiu

引用次数: 0

ASD-EVNet: An Ensemble Vision Network based on Facial Expression for Autism Spectrum Disorder Recognition 基于面部表情的集成视觉网络ASD-EVNet用于自闭症谱系障碍识别

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215688

Assil Jaby, Md Baharul Islam, Md Atiqur Rahman Ahad

{"title":"ASD-EVNet: An Ensemble Vision Network based on Facial Expression for Autism Spectrum Disorder Recognition","authors":"Assil Jaby, Md Baharul Islam, Md Atiqur Rahman Ahad","doi":"10.23919/MVA57639.2023.10215688","DOIUrl":"https://doi.org/10.23919/MVA57639.2023.10215688","url":null,"abstract":"Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder that affects individuals’ social interaction, communication, and behavior. Early diagnosis and intervention are critical for the well-being and development of children with ASD. Available methods for diagnosing ASD are unpredictable (or with limited accuracy) or require significant time and resources. We aim to enhance the precision of ASD diagnosis by utilizing facial expressions, a readily accessible and limited time-consuming approach. This paper presents ASD Ensemble Vision Network (ASD-EVNet) for recognizing ASD based on facial expressions. The model utilizes three Vision Transformer (ViT) architectures, pre-trained on imageNet-21K and fine-tuned on the ASD dataset. We also develop an extensive collection of facial expression-based ASD dataset for children (FADC). The ensemble learning model was then created by combining the predictions of the three ViT models and feeding it to a classifier. Our experiments demonstrate that the proposed ensemble learning model outperforms and achieves state-of-the-art results in detecting ASD based on facial expressions.","PeriodicalId":338734,"journal":{"name":"2023 18th International Conference on Machine Vision and Applications (MVA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128883697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

YOLOv5 with Mixed Backbone for Efficient Spatio-Temporal Hand Gesture Localization and Recognition 基于混合主干的YOLOv5高效时空手势定位与识别

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-23 DOI: 10.23919/MVA57639.2023.10215605

Luis Acevedo-Bringas, Gibran Benitez-Garcia, J. Olivares-Mercado, Hiroki Takahashi

{"title":"YOLOv5 with Mixed Backbone for Efficient Spatio-Temporal Hand Gesture Localization and Recognition","authors":"Luis Acevedo-Bringas, Gibran Benitez-Garcia, J. Olivares-Mercado, Hiroki Takahashi","doi":"10.23919/MVA57639.2023.10215605","DOIUrl":"https://doi.org/10.23919/MVA57639.2023.10215605","url":null,"abstract":"Spatio-temporal Hand Gesture Localization and Recognition (SHGLR) refers to analyzing the spatial and temporal aspects of hand movements for detecting and identifying hand gestures in a video. Current state-of-the-art approaches for SHGLR utilize large and complex architectures that result in a high computational cost. To address this issue, we present a new efficient method based on a mixed backbone for YOLOv5. We decided to use it since it is a lightweight and one-stage framework. We designed a mixed backbone that combines 2D and 3D convolutions to obtain temporal information from previous frames. The proposed method offers an efficient way to perform SHGLR on videos by inflating specific convolutions of the backbone while keeping a similar computational cost to the conventional YOLOv5. Due to its challenging and continuous hand gestures, we conduct experiments using the IPN Hand dataset. Our proposed method achieves a frame mAP@0.5 of 66.52% with a 6-frame clip input, outperforming conventional YOLOv5 by 7.89%, demonstrating the effectiveness of our approach.","PeriodicalId":338734,"journal":{"name":"2023 18th International Conference on Machine Vision and Applications (MVA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128485669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An X3D Neural Network Analysis for Runner’s Performance Assessment in a Wild Sporting Environment 野外运动环境下跑步者成绩评价的X3D神经网络分析

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-22 DOI: 10.23919/MVA57639.2023.10215918

David Freire-Obregón, J. Lorenzo-Navarro, Oliverio J. Santana, D. Hernández-Sosa, M. C. Santana

引用次数: 0

BandRe: Rethinking Band-Pass Filters for Scale-Wise Object Detection Evaluation BandRe:对尺度目标检测评估的带通滤波器的反思

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-21 DOI: 10.23919/MVA57639.2023.10216132

Yosuke Shinya

引用次数: 1

MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and Results MVA2023小目标检测挑战:数据集，方法和结果

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-18 DOI: 10.23919/MVA57639.2023.10215935

Yuki Kondo, N. Ukita, Takayuki Yamaguchi, Haoran Hou, Mu-Yi Shen, Chia-Chi Hsu, En-Ming Huang, Yu-Chen Huang, Yuelong Xia, Chien-Yao Wang, Chun-Yi Lee, Da Huo, Marc A. Kastner, Tingwei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, I. Ide, Yosuke Shinya, Xinyao Liu, Guang Liang, S. Yasui

引用次数: 3

TomatoDIFF: On–plant Tomato Segmentation with Denoising Diffusion Models * 番茄diff:基于去噪扩散模型的番茄切分方法*

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-07-03 DOI: 10.23919/MVA57639.2023.10215774

Marija Ivanovska, Vitomir Štruc, J. Pers

引用次数: 2

Lifelong Change Detection: Continuous Domain Adaptation for Small Object Change Detection in Everyday Robot Navigation 终身变化检测:机器人日常导航中小目标变化检测的连续域自适应

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-06-28 DOI: 10.23919/MVA57639.2023.10215686

Koji Takeda, Kanji Tanaka, Yoshimasa Nakamura

引用次数: 1

Automatic Reconstruction of Semantic 3D Models from 2D Floor Plans 从二维平面图自动重建语义三维模型

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-06-02 DOI: 10.23919/MVA57639.2023.10215746

Astrid Barreiro, Mariusz Trzeciakiewicz, A. Hilsmann, P. Eisert

引用次数: 0

Tackling Face Verification Edge Cases: In-Depth Analysis and Human-Machine Fusion Approach 解决人脸验证边缘案例:深度分析与人机融合方法

2023 18th International Conference on Machine Vision and Applications (MVA) Pub Date : 2023-04-17 DOI: 10.23919/MVA57639.2023.10216168

Martin Knoche, G. Rigoll

引用次数: 0