2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)最新文献_第9页

Cheap-Fake Detection with LLM Using Prompt Engineering 使用提示工程的LLM廉价假冒检测

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-06-05 DOI: 10.1109/ICMEW59549.2023.00025

Guangyang Wu, Weijie Wu, Xiaohong Liu, Kele Xu, Tianjiao Wan, Wenyi Wang

{"title":"Cheap-Fake Detection with LLM Using Prompt Engineering","authors":"Guangyang Wu, Weijie Wu, Xiaohong Liu, Kele Xu, Tianjiao Wan, Wenyi Wang","doi":"10.1109/ICMEW59549.2023.00025","DOIUrl":"https://doi.org/10.1109/ICMEW59549.2023.00025","url":null,"abstract":"The misuse of real photographs with conflicting image captions in news items is an example of the out-of-context (OOC) misuse of media. In order to detect OOC media, individuals must determine the accuracy of the statement and evaluate whether the triplet (i.e., the image and two captions) relates to the same event. This paper presents a novel learnable approach for detecting OOC media in ICME'23 Grand Challenge on Detecting Cheapfakes. The proposed method is based on the COSMOS structure, which assesses the coherence between an image and captions, as well as between two captions. We enhance the baseline algorithm by incorporating a Large Language Model (LLM), GPT3.5, as a feature extractor. Specifically, we propose an innovative approach to feature extraction utilizing prompt engineering to develop a robust and reliable feature extractor with GPT3.5 model. The proposed method captures the correlation between two captions and effectively integrates this module into the COSMOS baseline model, which allows for a deeper understanding of the relationship between captions. By incorporating this module, we demonstrate the potential for significant improvements in cheap-fakes detection performance. The proposed methodology holds promising implications for various applications such as natural language processing, image captioning, and text-to-image synthesis. Docker for submission is available at https://hub.docker.com/repository/docker/mulns/acmmmcheapfakes.","PeriodicalId":111482,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126211756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Half Title Page 半页标题

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-05-20 DOI: 10.1109/icmew59549.2023.00001

引用次数: 0

Semi-Supervised Federated Learning for Keyword Spotting 关键词识别的半监督联邦学习

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-05-09 DOI: 10.1109/ICMEW59549.2023.00087

Enmao Diao, Eric W. Tramel, Jie Ding, Tao Zhang

引用次数: 0

Prompt What You Need: Enhancing Segmentation in Rainy Scenes with Anchor-Based Prompting 提示你需要什么:增强分割在下雨的场景与锚为基础的提示

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-05-06 DOI: 10.1109/ICMEW59549.2023.00019

Xiaoyuan Guo, Xiang Wei, Q. Su, Hui-Huang Zhao, Shunli Zhan

引用次数: 1

Learn How to Prune Pixels for Multi-View Neural Image-Based Synthesis 学习如何为基于多视图的神经图像合成修剪像素

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-05-05 DOI: 10.1109/ICMEW59549.2023.00034

Marta Milovanovi'c, Enzo Tartaglione, Marco Cagnazzo, F. Henry

引用次数: 0

Conditional and Residual Methods in Scalable Coding for Humans and Machines 人类和机器可扩展编码中的条件和残差方法

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-05-04 DOI: 10.1109/ICMEW59549.2023.00040

Anderson de Andrade, Alon Harell, Yalda Foroutan, Ivan V. Baji'c

引用次数: 0

Exploiting Inductive Bias in Transformer for Point Cloud Classification and Segmentation 利用变压器中的感应偏置进行点云分类和分割

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-04-27 DOI: 10.1109/ICMEW59549.2023.00031

Zihao Li, Pan Gao, Hui Yuan, Ran Wei, M. Paul

引用次数: 0

An Order-Complexity Model for Aesthetic Quality Assessment of Homophony Music Performance 谐音音乐演奏审美质量评价的序复杂度模型

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-04-23 DOI: 10.1109/ICMEW59549.2023.00061

Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Jialin Sun

引用次数: 0

XGC-VQA: A Unified Video Quality Assessment Model for User, Professionally, and Occupationally-Generated Content XGC-VQA:用户、专业和职业生成内容的统一视频质量评估模型

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-03-24 DOI: 10.1109/ICMEW59549.2023.00081

Xinhui Huang, Chunyi Li, A. Bentaleb, Roger Zimmermann, Guangtao Zhai

引用次数: 1

A Perceptual Quality Assessment Exploration for AIGC Images AIGC图像的感知质量评价探索

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-03-22 DOI: 10.1109/ICMEW59549.2023.00082

Zicheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

{"title":"A Perceptual Quality Assessment Exploration for AIGC Images","authors":"Zicheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai","doi":"10.1109/ICMEW59549.2023.00082","DOIUrl":"https://doi.org/10.1109/ICMEW59549.2023.00082","url":null,"abstract":"AI Generated Content (AIGC) has gained widespread attention with the increasing efficiency of deep learning in content creation. AIGC, created with the assistance of artificial intelligence technology, includes various forms of content, among which the AI-generated images (AGIs) have brought significant impact to society and have been applied to various fields such as entertainment, education, social media, etc. However, due to hardware limitations and technical proficiency, the quality of AIGC images (AGIs) varies, necessitating refinement and filtering before practical use. Consequently, there is an urgent need for developing objective models to assess the quality of AGIs. Unfortunately, no research has been carried out to investigate the perceptual quality assessment for AGIs specifically. Therefore, in this paper, we first discuss the major evaluation aspects such as technical issues, AI artifacts, unnaturalness, discrepancy, and aesthetics for AGI quality assessment. Then we present the first perceptual AGI quality assessment database, AGIQA-1K, which consists of 1,080 AGIs generated from diffusion models. A well-organized subjective experiment is followed to collect the quality labels of the AGIs. Finally, we conduct a benchmark experiment to evaluate the performance of current image quality assessment (IQA) models. The database is released on https://github.com/lcysyzxdxc/AGIQA-1k-Database.","PeriodicalId":111482,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125937237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8