扩散模型和生成式人工智能:框架、应用和挑战

IF 12.1 2区 工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Pranjal Kumar
{"title":"扩散模型和生成式人工智能:框架、应用和挑战","authors":"Pranjal Kumar","doi":"10.1007/s11831-025-10266-z","DOIUrl":null,"url":null,"abstract":"<div><p>Diffusion Models (DMs) have recently emerged as a highly effective category of deep generative models, achieving exceptional results in various domains, including image synthesis, video generation, and molecule design. This survey provides a comprehensive analysis of the expanding body of research on this topic. The primary objective of this study is to investigate the architecture and requirements of generative artificial intelligence systems. Initially, an analysis of the prerequisites and frontier ideas for the implementation of generative AI systems is performed. To clarify the operational mechanisms of the methodology, the design choices of DMs are thoroughly examined, covering aspects such as refinement, parallel generation, editing, in-painting, and cross-domain generation. This study extensively reviews fundamental DMs and their diverse applications in fields such as computer vision (CV), natural language processing (NLP), image synthesis, and interdisciplinary applications (scene generation, 3D vision, video modeling, medical image diagnosis, time-series analysis, audio generation, 3D molecule generation etc.) in other scientific domains. A comparative study for all the works that use generative AI methods for various downstream tasks in each domain is performed. A comprehensive study on datasets is also carried out. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.</p></div>","PeriodicalId":55473,"journal":{"name":"Archives of Computational Methods in Engineering","volume":"32 7","pages":"4049 - 4092"},"PeriodicalIF":12.1000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion Models and Generative Artificial Intelligence: Frameworks, Applications and Challenges\",\"authors\":\"Pranjal Kumar\",\"doi\":\"10.1007/s11831-025-10266-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Diffusion Models (DMs) have recently emerged as a highly effective category of deep generative models, achieving exceptional results in various domains, including image synthesis, video generation, and molecule design. This survey provides a comprehensive analysis of the expanding body of research on this topic. The primary objective of this study is to investigate the architecture and requirements of generative artificial intelligence systems. Initially, an analysis of the prerequisites and frontier ideas for the implementation of generative AI systems is performed. To clarify the operational mechanisms of the methodology, the design choices of DMs are thoroughly examined, covering aspects such as refinement, parallel generation, editing, in-painting, and cross-domain generation. This study extensively reviews fundamental DMs and their diverse applications in fields such as computer vision (CV), natural language processing (NLP), image synthesis, and interdisciplinary applications (scene generation, 3D vision, video modeling, medical image diagnosis, time-series analysis, audio generation, 3D molecule generation etc.) in other scientific domains. A comparative study for all the works that use generative AI methods for various downstream tasks in each domain is performed. A comprehensive study on datasets is also carried out. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.</p></div>\",\"PeriodicalId\":55473,\"journal\":{\"name\":\"Archives of Computational Methods in Engineering\",\"volume\":\"32 7\",\"pages\":\"4049 - 4092\"},\"PeriodicalIF\":12.1000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archives of Computational Methods in Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11831-025-10266-z\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Computational Methods in Engineering","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s11831-025-10266-z","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

扩散模型(Diffusion Models, DMs)最近成为深度生成模型的一个非常有效的类别,在包括图像合成、视频生成和分子设计在内的各个领域取得了卓越的成果。这项调查提供了对这一主题的不断扩大的研究机构的全面分析。本研究的主要目的是研究生成式人工智能系统的架构和需求。首先,分析了实现生成式人工智能系统的先决条件和前沿思想。为了阐明该方法的操作机制,对dm的设计选择进行了彻底的检查,涵盖了细化、并行生成、编辑、绘图和跨域生成等方面。本研究广泛回顾了基本的DMs及其在计算机视觉(CV)、自然语言处理(NLP)、图像合成以及其他科学领域的跨学科应用(场景生成、3D视觉、视频建模、医学图像诊断、时间序列分析、音频生成、3D分子生成等)中的各种应用。对每个领域的各种下游任务使用生成式人工智能方法的所有作品进行了比较研究。对数据集进行了全面的研究。最后,它讨论了现有方法的局限性,以及需要额外的技术和未来的方向,以便在这一领域取得有意义的进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Diffusion Models and Generative Artificial Intelligence: Frameworks, Applications and Challenges

Diffusion Models and Generative Artificial Intelligence: Frameworks, Applications and Challenges

Diffusion Models (DMs) have recently emerged as a highly effective category of deep generative models, achieving exceptional results in various domains, including image synthesis, video generation, and molecule design. This survey provides a comprehensive analysis of the expanding body of research on this topic. The primary objective of this study is to investigate the architecture and requirements of generative artificial intelligence systems. Initially, an analysis of the prerequisites and frontier ideas for the implementation of generative AI systems is performed. To clarify the operational mechanisms of the methodology, the design choices of DMs are thoroughly examined, covering aspects such as refinement, parallel generation, editing, in-painting, and cross-domain generation. This study extensively reviews fundamental DMs and their diverse applications in fields such as computer vision (CV), natural language processing (NLP), image synthesis, and interdisciplinary applications (scene generation, 3D vision, video modeling, medical image diagnosis, time-series analysis, audio generation, 3D molecule generation etc.) in other scientific domains. A comparative study for all the works that use generative AI methods for various downstream tasks in each domain is performed. A comprehensive study on datasets is also carried out. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
19.80
自引率
4.10%
发文量
153
审稿时长
>12 weeks
期刊介绍: Archives of Computational Methods in Engineering Aim and Scope: Archives of Computational Methods in Engineering serves as an active forum for disseminating research and advanced practices in computational engineering, particularly focusing on mechanics and related fields. The journal emphasizes extended state-of-the-art reviews in selected areas, a unique feature of its publication. Review Format: Reviews published in the journal offer: A survey of current literature Critical exposition of topics in their full complexity By organizing the information in this manner, readers can quickly grasp the focus, coverage, and unique features of the Archives of Computational Methods in Engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信