{"title":"扩散模型和生成式人工智能:框架、应用和挑战","authors":"Pranjal Kumar","doi":"10.1007/s11831-025-10266-z","DOIUrl":null,"url":null,"abstract":"<div><p>Diffusion Models (DMs) have recently emerged as a highly effective category of deep generative models, achieving exceptional results in various domains, including image synthesis, video generation, and molecule design. This survey provides a comprehensive analysis of the expanding body of research on this topic. The primary objective of this study is to investigate the architecture and requirements of generative artificial intelligence systems. Initially, an analysis of the prerequisites and frontier ideas for the implementation of generative AI systems is performed. To clarify the operational mechanisms of the methodology, the design choices of DMs are thoroughly examined, covering aspects such as refinement, parallel generation, editing, in-painting, and cross-domain generation. This study extensively reviews fundamental DMs and their diverse applications in fields such as computer vision (CV), natural language processing (NLP), image synthesis, and interdisciplinary applications (scene generation, 3D vision, video modeling, medical image diagnosis, time-series analysis, audio generation, 3D molecule generation etc.) in other scientific domains. A comparative study for all the works that use generative AI methods for various downstream tasks in each domain is performed. A comprehensive study on datasets is also carried out. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.</p></div>","PeriodicalId":55473,"journal":{"name":"Archives of Computational Methods in Engineering","volume":"32 7","pages":"4049 - 4092"},"PeriodicalIF":12.1000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion Models and Generative Artificial Intelligence: Frameworks, Applications and Challenges\",\"authors\":\"Pranjal Kumar\",\"doi\":\"10.1007/s11831-025-10266-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Diffusion Models (DMs) have recently emerged as a highly effective category of deep generative models, achieving exceptional results in various domains, including image synthesis, video generation, and molecule design. This survey provides a comprehensive analysis of the expanding body of research on this topic. The primary objective of this study is to investigate the architecture and requirements of generative artificial intelligence systems. Initially, an analysis of the prerequisites and frontier ideas for the implementation of generative AI systems is performed. To clarify the operational mechanisms of the methodology, the design choices of DMs are thoroughly examined, covering aspects such as refinement, parallel generation, editing, in-painting, and cross-domain generation. This study extensively reviews fundamental DMs and their diverse applications in fields such as computer vision (CV), natural language processing (NLP), image synthesis, and interdisciplinary applications (scene generation, 3D vision, video modeling, medical image diagnosis, time-series analysis, audio generation, 3D molecule generation etc.) in other scientific domains. A comparative study for all the works that use generative AI methods for various downstream tasks in each domain is performed. A comprehensive study on datasets is also carried out. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.</p></div>\",\"PeriodicalId\":55473,\"journal\":{\"name\":\"Archives of Computational Methods in Engineering\",\"volume\":\"32 7\",\"pages\":\"4049 - 4092\"},\"PeriodicalIF\":12.1000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archives of Computational Methods in Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11831-025-10266-z\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Computational Methods in Engineering","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s11831-025-10266-z","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Diffusion Models and Generative Artificial Intelligence: Frameworks, Applications and Challenges
Diffusion Models (DMs) have recently emerged as a highly effective category of deep generative models, achieving exceptional results in various domains, including image synthesis, video generation, and molecule design. This survey provides a comprehensive analysis of the expanding body of research on this topic. The primary objective of this study is to investigate the architecture and requirements of generative artificial intelligence systems. Initially, an analysis of the prerequisites and frontier ideas for the implementation of generative AI systems is performed. To clarify the operational mechanisms of the methodology, the design choices of DMs are thoroughly examined, covering aspects such as refinement, parallel generation, editing, in-painting, and cross-domain generation. This study extensively reviews fundamental DMs and their diverse applications in fields such as computer vision (CV), natural language processing (NLP), image synthesis, and interdisciplinary applications (scene generation, 3D vision, video modeling, medical image diagnosis, time-series analysis, audio generation, 3D molecule generation etc.) in other scientific domains. A comparative study for all the works that use generative AI methods for various downstream tasks in each domain is performed. A comprehensive study on datasets is also carried out. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.
期刊介绍:
Archives of Computational Methods in Engineering
Aim and Scope:
Archives of Computational Methods in Engineering serves as an active forum for disseminating research and advanced practices in computational engineering, particularly focusing on mechanics and related fields. The journal emphasizes extended state-of-the-art reviews in selected areas, a unique feature of its publication.
Review Format:
Reviews published in the journal offer:
A survey of current literature
Critical exposition of topics in their full complexity
By organizing the information in this manner, readers can quickly grasp the focus, coverage, and unique features of the Archives of Computational Methods in Engineering.