2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)最新文献_第3页

FLNet: Graph Constrained Floor Layout Generation FLNet:图形约束地板布局生成

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859350

Abhinav Upadhyay, Alpana Dubey, Veenu Arora, Mani Suma Kuriakose, Shaurya Agarawal

引用次数: 1

3D-DSPnet: Product Disassembly Sequence Planning 3D-DSPnet:产品拆卸顺序规划

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859434

Abhinav Upadhyay, Bharat Ladrecha, Alpana Dubey, Suma Mani Kuriakose, P. Goenka

引用次数: 0

CDTNET: Cross-Domain Transformer Based on Attributes for Person Re-Identification CDTNET:基于属性的跨域人员再识别转换器

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859330

Mengyuan Guan, Suncheng Xiang, Ting Liu, Yuzhuo Fu

引用次数: 1

CPS: Full-Song and Style-Conditioned Music Generation with Linear Transformer 使用线性变压器的全歌曲和风格条件音乐生成

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859286

Weipeng Wang, Xiaobing Li, Cong Jin, Di Lu, Qingwen Zhou, Tie Yun

引用次数: 2

Fire and Gun Detection Based on Sematic Embeddings 基于语义嵌入的火力和火炮检测

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859303

Yunbin Deng, Ryan Campbell, Piyush Kumar

引用次数: 1

Bottleneck Detection in Crowded Video Scenes Utilizing Lagrangian Motion Analysis Via Density and Arc Length Measures 基于密度和弧长测量的拉格朗日运动分析在拥挤视频场景中的瓶颈检测

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859348

Maik Simon, Erik Bochinski, Markus Küchhold, T. Sikora

引用次数: 1

Decentralized Federated Learning with Enhanced Privacy Preservation 增强隐私保护的去中心化联邦学习

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859507

Sheng-Po Tseng, Jan-Yue Lin, Wei-Chien Cheng, L. Yeh, Chih-Ya Shen

引用次数: 1

Surveillance Video Anomaly Detection with Feature Enhancement and Consistency Frame Prediction 基于特征增强和一致性帧预测的监控视频异常检测

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859414

Beiji Zou, Min Wang, Lingzi Jiang, Yue Zhang, Shu Liu

引用次数: 0

Multi-Augmentation for Efficient Self-Supervised Visual Representation Learning 基于多增强的高效自监督视觉表征学习

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859465

Van-Nhiem Tran, Chi-En Huang, Shenyao Liu, Kai-Lin Yang, Timothy Ko, Yung-Hui Li

{"title":"Multi-Augmentation for Efficient Self-Supervised Visual Representation Learning","authors":"Van-Nhiem Tran, Chi-En Huang, Shenyao Liu, Kai-Lin Yang, Timothy Ko, Yung-Hui Li","doi":"10.1109/ICMEW56448.2022.9859465","DOIUrl":"https://doi.org/10.1109/ICMEW56448.2022.9859465","url":null,"abstract":"In recent years, self-supervised learning has been studied to deal with the limitation of available labeled-dataset. Among the major components of self-supervised learning, the data augmentation pipeline is one key factor in enhancing the resulting performance. However, most researchers manually designed the augmentation pipeline, and the limited collections of transformation may cause the lack of robustness of the learned feature representation. In this work, we proposed Multi-Augmentations for Self-Supervised Representation Learning (MA-SSRL), which fully searched for various augmentation policies to build the entire pipeline to improve the robustness of the learned feature representation. MA-SSRL successfully learns the invariant feature representation and presents an efficient, effective, and adaptable data augmentation pipeline for self-supervised pre-training on different distribution and domain datasets. MA-SSRL outperforms the previous state-of-the-art methods on transfer and semi-supervised benchmarks while requiring fewer training epochs. Code available on GitHub1.","PeriodicalId":106759,"journal":{"name":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129542332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

GolfPose: Golf Swing Analyses with a Monocular Camera Based Human Pose Estimation GolfPose:高尔夫挥杆分析与单目相机为基础的人体姿态估计

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI: 10.1109/ICMEW56448.2022.9859415

Zhongyu Jiang, Haorui Ji, Samuel Menaker, Jenq-Neng Hwang

{"title":"GolfPose: Golf Swing Analyses with a Monocular Camera Based Human Pose Estimation","authors":"Zhongyu Jiang, Haorui Ji, Samuel Menaker, Jenq-Neng Hwang","doi":"10.1109/ICMEW56448.2022.9859415","DOIUrl":"https://doi.org/10.1109/ICMEW56448.2022.9859415","url":null,"abstract":"With the rapid developments of computer vision and deep learning technologies, artificial intelligence takes a more and more important role in sports analyses. In this paper, to attain the objective of automated golf swing analyses, we propose a lightweight temporal-based 2D human pose estimation (HPE) method, called GolfPose, which achieves improved performance than the state-of-the-art image-based HPE methods. Unlike traditional image-based methods, our temporal-based method, designed for efficient and effective golf swing analyses, takes advantage of the temporal information to improve the estimation accuracy of fast-moving and partially self-occluded keypoints. Furthermore, in order to make sure the golf swing analyses can run on mobile devices, we optimize the model architecture to achieve real-time inference. With around 10% of the parameters and half of the GFLOPs used in the state-of-the-art HRNet, our proposed GolfPose model can achieve 9.16 mean pixel error (MPE) in our golf swing dataset, compared with 9.20 MPE for HRNet. Furthermore, the proposed temporal-based method, facilitated with golf club detection(GCD), significantly improves the accuracy of keypoints on the golf club from 13.98 to 9.21 MPE.","PeriodicalId":106759,"journal":{"name":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129730450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4