Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based Image Generation Systems

2023 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology (IAICT) Pub Date : 2023-01-05 DOI:10.1109/IAICT59002.2023.10205695

Michael Cahyadi, M. Rafi, William Shan, J. Moniaga, Henry Lucky

引用次数: 1

Abstract

Image generation system is a system which generates images using prompts in form of text. Due to the large-scale use of automatic image generation, mainly diffusion-based systems, there needs to be an evaluation regarding the output quality generated. We conduct a qualitative analysis of the accuracy and fidelity of two image generation systems, DALL-E 2 and Luna, which differ greatly in their training datasets, algorithms, prompt handling, and output scaling. We employ a qualitative benchmarking methodology and find that DALL-E 2 outperforms Luna significantly in terms of both alignment and fidelity.

查看原文本刊更多论文

Luna和dall - e2扩散成像系统的精度和保真度比较

图像生成系统是利用文本形式的提示生成图像的系统。由于大规模使用自动图像生成，主要是基于扩散的系统，需要对生成的输出质量进行评估。我们对两种图像生成系统dall - e2和Luna的精度和保真度进行了定性分析，这两种系统在训练数据集、算法、即时处理和输出缩放方面存在很大差异。我们采用定性基准测试方法，发现dall - e2在对齐和保真度方面都明显优于Luna。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology (IAICT)

自引率

0.00%

发文量