Connective Viewpoints of Signal-to-Noise Diffusion Models

arXiv - CS - Neural and Evolutionary Computing Pub Date : 2024-08-08 DOI:arxiv-2408.04221

Khanh Doan, Long Tung Vuong, Tuan Nguyen, Anh Tuan Bui, Quyen Tran, Thanh-Toan Do, Dinh Phung, Trung Le

引用次数: 0

Abstract

Diffusion models (DM) have become fundamental components of generative models, excelling across various domains such as image creation, audio generation, and complex data interpolation. Signal-to-Noise diffusion models constitute a diverse family covering most state-of-the-art diffusion models. While there have been several attempts to study Signal-to-Noise (S2N) diffusion models from various perspectives, there remains a need for a comprehensive study connecting different viewpoints and exploring new perspectives. In this study, we offer a comprehensive perspective on noise schedulers, examining their role through the lens of the signal-to-noise ratio (SNR) and its connections to information theory. Building upon this framework, we have developed a generalized backward equation to enhance the performance of the inference process.

查看原文本刊更多论文

信噪扩散模型的关联观点

扩散模型（DM）已成为生成模型的基本组成部分，在图像创建、音频生成和复杂数据插值等多个领域表现出色。信噪比扩散模型是一个多样化的模型系列，涵盖了大多数最先进的扩散模型。虽然已经有很多人尝试从不同角度研究信噪比（S2N）扩散模型，但仍然需要一项连接不同观点和探索新视角的全面研究。在本研究中，我们从信噪比（SNR）及其与信息论的联系的角度，全面审视了噪声调度器的作用。在此框架基础上，我们开发了一种广义的后向方程，以提高推理过程的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Neural and Evolutionary Computing

自引率

0.00%

发文量