Lihua Lu, Ruyang Li, Xiaohui Zhang, Hui Wei, Guoguang Du, Binqiang Wang
{"title":"Advances in text-guided 3D editing: a survey","authors":"Lihua Lu, Ruyang Li, Xiaohui Zhang, Hui Wei, Guoguang Du, Binqiang Wang","doi":"10.1007/s10462-024-10937-6","DOIUrl":null,"url":null,"abstract":"<div><p>In 3D Artificial Intelligence Generated Content (AIGC), compared with generating 3D assets from scratch, editing extant 3D assets satisfies user prompts, allowing the creation of diverse and high-quality 3D assets in a time and labor-saving manner. More recently, text-guided 3D editing that modifies 3D assets guided by text prompts is user-friendly and practical, which evokes a surge in research within this field. In this survey, we comprehensively investigate recent literature on text-guided 3D editing in an attempt to answer two questions: What are the methodologies of existing text-guided 3D editing? How has current progress in text-guided 3D editing gone so far? Specifically, we focus on text-guided 3D editing methods published in the past 4 years, delving deeply into their frameworks and principles. We then present a fundamental taxonomy in terms of the editing strategy, optimization scheme, and 3D representation. Based on the taxonomy, we review recent advances in this field, considering factors such as editing scale, type, granularity, and perspective. In addition, we highlight four applications of text-guided 3D editing, including texturing, style transfer, local editing of scenes, and insertion editing, to exploit further the 3D editing capacities with in-depth comparisons and discussions. Depending on the insights achieved by this survey, we discuss open challenges and future research directions. We hope this survey will help readers gain a deeper understanding of this exciting field and foster further advancements in text-guided 3D editing.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"57 12","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10937-6.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-024-10937-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In 3D Artificial Intelligence Generated Content (AIGC), compared with generating 3D assets from scratch, editing extant 3D assets satisfies user prompts, allowing the creation of diverse and high-quality 3D assets in a time and labor-saving manner. More recently, text-guided 3D editing that modifies 3D assets guided by text prompts is user-friendly and practical, which evokes a surge in research within this field. In this survey, we comprehensively investigate recent literature on text-guided 3D editing in an attempt to answer two questions: What are the methodologies of existing text-guided 3D editing? How has current progress in text-guided 3D editing gone so far? Specifically, we focus on text-guided 3D editing methods published in the past 4 years, delving deeply into their frameworks and principles. We then present a fundamental taxonomy in terms of the editing strategy, optimization scheme, and 3D representation. Based on the taxonomy, we review recent advances in this field, considering factors such as editing scale, type, granularity, and perspective. In addition, we highlight four applications of text-guided 3D editing, including texturing, style transfer, local editing of scenes, and insertion editing, to exploit further the 3D editing capacities with in-depth comparisons and discussions. Depending on the insights achieved by this survey, we discuss open challenges and future research directions. We hope this survey will help readers gain a deeper understanding of this exciting field and foster further advancements in text-guided 3D editing.
在三维人工智能生成内容(AIGC)中,与从头开始生成三维资产相比,编辑现有的三维资产可以满足用户的提示,从而以省时省力的方式创建多样化和高质量的三维资产。最近,以文本提示为指导修改三维资产的文本指导三维编辑既友好又实用,从而引发了这一领域的研究热潮。在本调查中,我们全面调查了近期有关文本引导 3D 编辑的文献,试图回答两个问题:现有文本引导 3D 编辑的方法有哪些?文本引导的三维编辑目前进展如何?具体而言,我们将重点关注过去 4 年中发表的文本引导 3D 编辑方法,深入探讨其框架和原理。然后,我们从编辑策略、优化方案和三维表示等方面提出了一个基本分类法。基于该分类法,我们回顾了该领域的最新进展,并考虑了编辑规模、类型、粒度和视角等因素。此外,我们还重点介绍了文本引导的三维编辑的四种应用,包括贴图、风格转换、场景局部编辑和插入编辑,通过深入的比较和讨论进一步开发三维编辑能力。根据本次调查所获得的启示,我们讨论了有待解决的挑战和未来的研究方向。我们希望本调查报告能帮助读者更深入地了解这一令人兴奋的领域,并促进文本引导的三维编辑技术的进一步发展。
期刊介绍:
Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.