Mingyi Ma , Xinzui Wang , Yan Wang , Tongda Xu , Fucheng Cao , Shichen Su
{"title":"PA-INR: Parallel adapter-based storage of edited implicit neural representation","authors":"Mingyi Ma , Xinzui Wang , Yan Wang , Tongda Xu , Fucheng Cao , Shichen Su","doi":"10.1016/j.jvcir.2025.104499","DOIUrl":null,"url":null,"abstract":"<div><div>INRs (Implicit Neural Representations) have demonstrated strong potential in the field of image representation. Unlike traditional discrete representation methods, INRs map pixel coordinates to RGB values using a neural network, where data can be stored in a continuous form. Due to the infeasibility of interpreting INRs during training, current methods cannot effectively store associated INR weights, such as network weights before and after image editing. This results in the same storage space required to store two associated INR weights as for storing two unrelated images. To address this issue, we propose a method based on Parallel Adapter, which efficiently stores multiple INRs through model fine-tuning. By storing the residuals of different INRs in parallel Adapter structures, the storage space required for multiple associated INRs is significantly reduced. Furthermore, by parallelizing and merging Adapter structures, our method achieves functionality similar to storing and merging editing histories. We conducted experiments on both images and videos, demonstrating that our method is fully compatible with existing weight training methods and methods for generating weights from hypernetworks. And with our method, it is possible to directly utilize a meta-network to generate residuals between INRs, allowing for a generalized direct editing while preserving the original INR structure.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"111 ","pages":"Article 104499"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001130","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
INRs (Implicit Neural Representations) have demonstrated strong potential in the field of image representation. Unlike traditional discrete representation methods, INRs map pixel coordinates to RGB values using a neural network, where data can be stored in a continuous form. Due to the infeasibility of interpreting INRs during training, current methods cannot effectively store associated INR weights, such as network weights before and after image editing. This results in the same storage space required to store two associated INR weights as for storing two unrelated images. To address this issue, we propose a method based on Parallel Adapter, which efficiently stores multiple INRs through model fine-tuning. By storing the residuals of different INRs in parallel Adapter structures, the storage space required for multiple associated INRs is significantly reduced. Furthermore, by parallelizing and merging Adapter structures, our method achieves functionality similar to storing and merging editing histories. We conducted experiments on both images and videos, demonstrating that our method is fully compatible with existing weight training methods and methods for generating weights from hypernetworks. And with our method, it is possible to directly utilize a meta-network to generate residuals between INRs, allowing for a generalized direct editing while preserving the original INR structure.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.