广义隐式神经表示的相位mlp

Weifeng Chen, Hui Ding, Bo Li, Bin Liu
{"title":"广义隐式神经表示的相位mlp","authors":"Weifeng Chen, Hui Ding, Bo Li, Bin Liu","doi":"10.1145/3573834.3574503","DOIUrl":null,"url":null,"abstract":"Implicit neural representation(INR) has lifted a climax among deep learning researchers for its powerful continuous representation. With Sine activation or Fourier position embedding, INRs overcome the problem that could not reconstruct high-frequency signals in different domains, such as voice, image or 3D shape. However, many of INR researches can merely represent one object or one scene with high-frequency information, multiple instances will bring a sharp decline of performance. In this work, we propose a newly phase-MLP which can not only recover diverse instances but also keep high quality content. Our network takes phase information which corresponding to target signal as input and combine it with encoded position signals to reconstruct the original data. Moreover, we propose a multi-level phase-MLP base on the infrastructure to retain fidelity for bigger instance while limiting the phase information in the low amount. Experimental results on public images and videos demonstrate our proposed approach outperforms the state-of-the-art methods.","PeriodicalId":345434,"journal":{"name":"Proceedings of the 4th International Conference on Advanced Information Science and System","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Phase-MLP for Generalizable Implicit Neural Representations\",\"authors\":\"Weifeng Chen, Hui Ding, Bo Li, Bin Liu\",\"doi\":\"10.1145/3573834.3574503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Implicit neural representation(INR) has lifted a climax among deep learning researchers for its powerful continuous representation. With Sine activation or Fourier position embedding, INRs overcome the problem that could not reconstruct high-frequency signals in different domains, such as voice, image or 3D shape. However, many of INR researches can merely represent one object or one scene with high-frequency information, multiple instances will bring a sharp decline of performance. In this work, we propose a newly phase-MLP which can not only recover diverse instances but also keep high quality content. Our network takes phase information which corresponding to target signal as input and combine it with encoded position signals to reconstruct the original data. Moreover, we propose a multi-level phase-MLP base on the infrastructure to retain fidelity for bigger instance while limiting the phase information in the low amount. Experimental results on public images and videos demonstrate our proposed approach outperforms the state-of-the-art methods.\",\"PeriodicalId\":345434,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Advanced Information Science and System\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Advanced Information Science and System\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3573834.3574503\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Advanced Information Science and System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573834.3574503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

内隐神经表征(INR)以其强大的连续表征能力在深度学习研究者中掀起了一个高潮。利用正弦激活或傅立叶位置嵌入,inr克服了无法重建不同域(如语音、图像或三维形状)高频信号的问题。然而,许多INR研究只能用高频信息表示一个物体或一个场景,多实例会导致性能急剧下降。在这项工作中,我们提出了一种新的相位mlp,它不仅可以恢复不同的实例,而且可以保持高质量的内容。该网络将目标信号对应的相位信息作为输入,与经过编码的位置信号结合,重构原始数据。此外,我们提出了一种基于基础结构的多级相位mlp,以保持更大实例的保真度,同时将相位信息限制在少量。公共图像和视频的实验结果表明,我们提出的方法优于最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Phase-MLP for Generalizable Implicit Neural Representations
Implicit neural representation(INR) has lifted a climax among deep learning researchers for its powerful continuous representation. With Sine activation or Fourier position embedding, INRs overcome the problem that could not reconstruct high-frequency signals in different domains, such as voice, image or 3D shape. However, many of INR researches can merely represent one object or one scene with high-frequency information, multiple instances will bring a sharp decline of performance. In this work, we propose a newly phase-MLP which can not only recover diverse instances but also keep high quality content. Our network takes phase information which corresponding to target signal as input and combine it with encoded position signals to reconstruct the original data. Moreover, we propose a multi-level phase-MLP base on the infrastructure to retain fidelity for bigger instance while limiting the phase information in the low amount. Experimental results on public images and videos demonstrate our proposed approach outperforms the state-of-the-art methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信