2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)最新文献_第2页

Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model 基于人体测量数据和多阶段模型的个体HRTF预测

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00060

Yinliang Qiu, Zhiyu Li, Jing Wang

引用次数: 0

Blind Quality Assessment of Point Clouds Based on 3D Co-Occurrence Statistics 基于三维共生统计的点云盲测质量评估

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00084

Souheib Riache, M. Larabi, Mohamed Deriche

引用次数: 0

Efficient Low Light Video Enhancement Based on Improved Retinex Algorithms 基于改进Retinex算法的高效弱光视频增强

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00094

Sung-Ling Lee, Shih-Hsuan Yang

引用次数: 0

Leveraging Knowledge Graphs for CheapFakes Detection: Beyond Dataset Evaluation 利用知识图谱进行CheapFakes检测:超越数据集评估

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00024

Minh-Son Dao, K. Zettsu

{"title":"Leveraging Knowledge Graphs for CheapFakes Detection: Beyond Dataset Evaluation","authors":"Minh-Son Dao, K. Zettsu","doi":"10.1109/ICMEW59549.2023.00024","DOIUrl":"https://doi.org/10.1109/ICMEW59549.2023.00024","url":null,"abstract":"The proliferation of the internet and the availability of vast amounts of information have given rise to a critical and pressing issue of fake news. Among the various forms of fake news, cheapfakes are particularly prominent in deceiving people. Existing research on cheapfakes detection has primarily focused on analyzing the context and correlation between textual and visual information, but has largely overlooked the significance of external knowledge. As a result, most previous approaches, apart from the baseline of ICME‘23 Grand Challenge on Detecting Cheapfakes, have heavily relied on evaluating the dataset itself to improve performance. However, despite achieving impressive results on public test datasets, these approaches often suffer from poor performance in real-world scenarios due to their overreliance on the given dataset. In this study, we propose a novel approach that utilizes knowledge graphs to address the issue of insufficient information from external knowledge. Unlike previous approaches, our proposal does not directly alter or participate in the public test dataset to enhance performance, which can potentially result in significant overfitting. Our proposed approach achieved an accuracy score of 83.52% on Task 1, surpassing the baseline by 1.7%, and an accuracy score of 84% on Task 2, outperforming the best result from the previous challenge by 8%.","PeriodicalId":111482,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122553494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Multimodal Approach for Evaluating Algal Bloom Severity Using Deep Learning 利用深度学习评估藻华严重程度的多模态方法

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00097

Fei Zhao, Chengcui Zhang, Sheikh Abujar

引用次数: 0

Multi-Models from Computer Vision to Natural Language Processing for Cheapfakes Detection 从计算机视觉到自然语言处理的多模型廉价假货检测

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00023

Thanh-Son Nguyen, Minh-Triet Tran

{"title":"Multi-Models from Computer Vision to Natural Language Processing for Cheapfakes Detection","authors":"Thanh-Son Nguyen, Minh-Triet Tran","doi":"10.1109/ICMEW59549.2023.00023","DOIUrl":"https://doi.org/10.1109/ICMEW59549.2023.00023","url":null,"abstract":"Cheapfakes can compromise the integrity of information and erode trust in multimedia content, making their detection critical. Identifying Out of Context misuse of media is essential to prevent the spread of misinformation and to ensure that news and information are presented accurately and ethically. In this paper, we focus our efforts on Task 1 of the Grand Challenge on Detecting Cheapfakes in ICME2023, which involves detecting triplets consisting of an image and two captions as Out of Context. We propose a new robust approach for detecting Cheapfakes, which are instances of image reuse with different captions. Our proposed approach leverages multi-models in Computer vision and Natural language processing, such as Named entity recognition, Image captioning, and Natural language inference. In our experiments, the proposed multi-models method achieves an impressive accuracy of 78.6%, the highest accuracy among the candidates on the hidden test set. Overall, our approach demonstrates a promising solution for detecting Cheapfakes and safeguarding the integrity of multimedia content. Our source code is public on https://github.com/thanhson28/icme2023.git.","PeriodicalId":111482,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132252099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analysis of Physical Phenomena in Golf Swing 高尔夫挥杆物理现象分析

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00052

Sheng-Kai Chen, Tzu-Yu Liu, Yan-Di Liu, H. Shih

引用次数: 0

Differential Melody Generation Based on Time Series Prediction 基于时间序列预测的差分旋律生成

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00067

Xiang Xu, Wei Zhong, Yi Zou, Long Ye, Qin Zhang

引用次数: 0

Spatial-Temporal Consistency Refinement Network for Dynamic Point Cloud Frame Interpolation 动态点云帧插值的时空一致性细化网络

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00080

Lancao Ren, Lili Zhao, Zhuoqun Sun, Zhipeng Zhang, Jianwen Chen

{"title":"Spatial-Temporal Consistency Refinement Network for Dynamic Point Cloud Frame Interpolation","authors":"Lancao Ren, Lili Zhao, Zhuoqun Sun, Zhipeng Zhang, Jianwen Chen","doi":"10.1109/ICMEW59549.2023.00080","DOIUrl":"https://doi.org/10.1109/ICMEW59549.2023.00080","url":null,"abstract":"Point cloud frame interpolation aims to improve the frame rate of a point cloud sequence by synthesising intermediate frames between consecutive frames. Most of the existing works only use the scene flow or features, not fully exploring their local geometry context or temporal correlation, which results in inaccurate local structural details or motion estimation. In this paper, we organically combine scene flows and features to propose a two-stage network based on residual-learning, which can generate spatially and temporally consistent interpolated frames. At the Stage 1, we propose the spatial-temporal warping module to effectively integrate multi-scale local and global spatial features and temporal correlation into a fusion feature, and then transform it into a coarse interpolated frame. At the Stage 2, we introduce the residual-learning structure to conduct spatial-temporal consistency refinement. A temporal-aware feature aggregation module is proposed, which can facilitate the network adaptively adjusting the contributions of spatial features from input frames, and predict the point-wise offset as the compensations due to coarse estimation errors. The experimental results demonstrate our method achieves the state-of-the-art performance on most benchmarks with various interpolated modes. Code is available at https://github.com/renlancao/SR-Net.","PeriodicalId":111482,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123358361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video-Based Point Cloud Compression Using Density-Based Variable Size Hexahedrons 基于密度的可变大小六面体的视频点云压缩

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI: 10.1109/ICMEW59549.2023.00032

Faranak Tohidi, M. Paul

引用次数: 0