PA-INR: Parallel adapter-based storage of edited implicit neural representation

IF 3.1 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of Visual Communication and Image Representation Pub Date : 2025-06-05 DOI:10.1016/j.jvcir.2025.104499

Mingyi Ma , Xinzui Wang , Yan Wang , Tongda Xu , Fucheng Cao , Shichen Su

{"title":"PA-INR: Parallel adapter-based storage of edited implicit neural representation","authors":"Mingyi Ma , Xinzui Wang , Yan Wang , Tongda Xu , Fucheng Cao , Shichen Su","doi":"10.1016/j.jvcir.2025.104499","DOIUrl":null,"url":null,"abstract":"<div><div>INRs (Implicit Neural Representations) have demonstrated strong potential in the field of image representation. Unlike traditional discrete representation methods, INRs map pixel coordinates to RGB values using a neural network, where data can be stored in a continuous form. Due to the infeasibility of interpreting INRs during training, current methods cannot effectively store associated INR weights, such as network weights before and after image editing. This results in the same storage space required to store two associated INR weights as for storing two unrelated images. To address this issue, we propose a method based on Parallel Adapter, which efficiently stores multiple INRs through model fine-tuning. By storing the residuals of different INRs in parallel Adapter structures, the storage space required for multiple associated INRs is significantly reduced. Furthermore, by parallelizing and merging Adapter structures, our method achieves functionality similar to storing and merging editing histories. We conducted experiments on both images and videos, demonstrating that our method is fully compatible with existing weight training methods and methods for generating weights from hypernetworks. And with our method, it is possible to directly utilize a meta-network to generate residuals between INRs, allowing for a generalized direct editing while preserving the original INR structure.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"111 ","pages":"Article 104499"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001130","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

INRs (Implicit Neural Representations) have demonstrated strong potential in the field of image representation. Unlike traditional discrete representation methods, INRs map pixel coordinates to RGB values using a neural network, where data can be stored in a continuous form. Due to the infeasibility of interpreting INRs during training, current methods cannot effectively store associated INR weights, such as network weights before and after image editing. This results in the same storage space required to store two associated INR weights as for storing two unrelated images. To address this issue, we propose a method based on Parallel Adapter, which efficiently stores multiple INRs through model fine-tuning. By storing the residuals of different INRs in parallel Adapter structures, the storage space required for multiple associated INRs is significantly reduced. Furthermore, by parallelizing and merging Adapter structures, our method achieves functionality similar to storing and merging editing histories. We conducted experiments on both images and videos, demonstrating that our method is fully compatible with existing weight training methods and methods for generating weights from hypernetworks. And with our method, it is possible to directly utilize a meta-network to generate residuals between INRs, allowing for a generalized direct editing while preserving the original INR structure.

查看原文本刊更多论文

PA-INR：基于并行适配器的编辑隐式神经表示存储

内隐神经表征（INRs）在图像表征领域显示出强大的潜力。与传统的离散表示方法不同，INRs使用神经网络将像素坐标映射到RGB值，其中数据可以以连续形式存储。由于在训练过程中解释INR的不可行性，目前的方法不能有效地存储相关的INR权值，如图像编辑前后的网络权值。这导致存储两个相关的INR权重所需的存储空间与存储两个不相关的图像所需的存储空间相同。为了解决这个问题，我们提出了一种基于并行适配器的方法，该方法通过模型微调有效地存储多个inr。通过将不同inr的残差存储在并行适配器结构中，可以显著减少多个相关inr所需的存储空间。此外，通过并行化和合并适配器结构，我们的方法实现了类似于存储和合并编辑历史的功能。我们对图像和视频进行了实验，证明我们的方法与现有的权重训练方法和从超网络生成权重的方法完全兼容。使用我们的方法，可以直接利用元网络来生成INR之间的残差，允许在保留原始INR结构的同时进行广义的直接编辑。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Visual Communication and Image Representation 工程技术-计算机：软件工程

CiteScore

5.40

自引率

11.50%

发文量

188

审稿时长

9.9 months

期刊介绍： The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.