特征辐射场（FeRF）：利用深度神经网络进行图像合成的多层次特征融合方法

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2024-09-18 DOI:10.1016/j.asoc.2024.112262

Jubo Chen , Xiaosheng Yu , Chengdong Wu , Xiaolei Tian , Ke Xu

{"title":"特征辐射场（FeRF）：利用深度神经网络进行图像合成的多层次特征融合方法","authors":"Jubo Chen , Xiaosheng Yu , Chengdong Wu , Xiaolei Tian , Ke Xu","doi":"10.1016/j.asoc.2024.112262","DOIUrl":null,"url":null,"abstract":"<div><div>Neural Radiance Field (NeRF) has brought revolutionary changes to the field of image synthesis with its unique ability to generate highly realistic multi-view consistent images from a neural scene representation. However, current NeRF-based methods still largely depend on multiple, precisely posed images, especially for complex or dynamic scenes, limiting their versatility. Furthermore, some recent strategies attempt to integrate simple feature extraction networks with volume rendering techniques to reduce multi-view dependence but create blurry outputs, highlighting the need for more sophisticated feature handling to unlock NeRF's full potential. In this paper, we propose an image synthesis method named FeRF, distinguished by its capacity to perform comprehensive feature extraction on individual unposed images and facilitate feature fusion at any stage. Additionally, we present an \"elaborated-feature generation network\" (EGN) composed of four modules, which is configured with two advanced feature extraction modules aimed at precisely refining and processing subtle, complex visual features from a single image. Given that the core objective of FeRF is the precise capture and processing of intricate features from the input images, we innovatively incorporated precisely designed attention mechanisms into the network architecture to optimize and highlight the importance of key feature attributes, thereby effectively enhancing their contribution to subsequent volume rendering processes. Extensive experimentation validates the outstanding qualitative and quantitative performance of our proposed network structure. In comparison to current image feature-based generalized image synthesis methods, it achieves superior reconstruction quality and level of detail.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature radiance fields (FeRF): A multi-level feature fusion method with deep neural network for image synthesis\",\"authors\":\"Jubo Chen , Xiaosheng Yu , Chengdong Wu , Xiaolei Tian , Ke Xu\",\"doi\":\"10.1016/j.asoc.2024.112262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Neural Radiance Field (NeRF) has brought revolutionary changes to the field of image synthesis with its unique ability to generate highly realistic multi-view consistent images from a neural scene representation. However, current NeRF-based methods still largely depend on multiple, precisely posed images, especially for complex or dynamic scenes, limiting their versatility. Furthermore, some recent strategies attempt to integrate simple feature extraction networks with volume rendering techniques to reduce multi-view dependence but create blurry outputs, highlighting the need for more sophisticated feature handling to unlock NeRF's full potential. In this paper, we propose an image synthesis method named FeRF, distinguished by its capacity to perform comprehensive feature extraction on individual unposed images and facilitate feature fusion at any stage. Additionally, we present an \\\"elaborated-feature generation network\\\" (EGN) composed of four modules, which is configured with two advanced feature extraction modules aimed at precisely refining and processing subtle, complex visual features from a single image. Given that the core objective of FeRF is the precise capture and processing of intricate features from the input images, we innovatively incorporated precisely designed attention mechanisms into the network architecture to optimize and highlight the importance of key feature attributes, thereby effectively enhancing their contribution to subsequent volume rendering processes. Extensive experimentation validates the outstanding qualitative and quantitative performance of our proposed network structure. In comparison to current image feature-based generalized image synthesis methods, it achieves superior reconstruction quality and level of detail.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494624010366\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494624010366","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

神经辐照场（NeRF）具有从神经场景表征生成高度逼真的多视角一致图像的独特能力，为图像合成领域带来了革命性的变化。然而，目前基于 NeRF 的方法在很大程度上仍依赖于多张精确摆放的图像，尤其是复杂或动态场景，从而限制了其通用性。此外，最近的一些策略试图将简单的特征提取网络与体积渲染技术相结合，以减少对多视图的依赖，但却产生了模糊的输出结果，这突出表明需要更复杂的特征处理才能释放 NeRF 的全部潜力。在本文中，我们提出了一种名为 "FeRF "的图像合成方法，其特点是能够对单个未摆放的图像进行综合特征提取，并在任何阶段促进特征融合。此外，我们还提出了一个由四个模块组成的 "精细特征生成网络"（EGN），其中配置了两个高级特征提取模块，旨在精确提炼和处理单张图像中细微、复杂的视觉特征。鉴于 FeRF 的核心目标是从输入图像中精确捕捉和处理复杂的特征，我们创新性地将精确设计的关注机制纳入网络架构，以优化和突出关键特征属性的重要性，从而有效提高它们对后续体积渲染过程的贡献。广泛的实验验证了我们提出的网络结构在质量和数量上的卓越性能。与目前基于图像特征的广义图像合成方法相比，它能获得更高的重建质量和细节水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Feature radiance fields (FeRF): A multi-level feature fusion method with deep neural network for image synthesis

Neural Radiance Field (NeRF) has brought revolutionary changes to the field of image synthesis with its unique ability to generate highly realistic multi-view consistent images from a neural scene representation. However, current NeRF-based methods still largely depend on multiple, precisely posed images, especially for complex or dynamic scenes, limiting their versatility. Furthermore, some recent strategies attempt to integrate simple feature extraction networks with volume rendering techniques to reduce multi-view dependence but create blurry outputs, highlighting the need for more sophisticated feature handling to unlock NeRF's full potential. In this paper, we propose an image synthesis method named FeRF, distinguished by its capacity to perform comprehensive feature extraction on individual unposed images and facilitate feature fusion at any stage. Additionally, we present an "elaborated-feature generation network" (EGN) composed of four modules, which is configured with two advanced feature extraction modules aimed at precisely refining and processing subtle, complex visual features from a single image. Given that the core objective of FeRF is the precise capture and processing of intricate features from the input images, we innovatively incorporated precisely designed attention mechanisms into the network architecture to optimize and highlight the importance of key feature attributes, thereby effectively enhancing their contribution to subsequent volume rendering processes. Extensive experimentation validates the outstanding qualitative and quantitative performance of our proposed network structure. In comparison to current image feature-based generalized image synthesis methods, it achieves superior reconstruction quality and level of detail.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.