{"title":"广义隐式神经表示的相位mlp","authors":"Weifeng Chen, Hui Ding, Bo Li, Bin Liu","doi":"10.1145/3573834.3574503","DOIUrl":null,"url":null,"abstract":"Implicit neural representation(INR) has lifted a climax among deep learning researchers for its powerful continuous representation. With Sine activation or Fourier position embedding, INRs overcome the problem that could not reconstruct high-frequency signals in different domains, such as voice, image or 3D shape. However, many of INR researches can merely represent one object or one scene with high-frequency information, multiple instances will bring a sharp decline of performance. In this work, we propose a newly phase-MLP which can not only recover diverse instances but also keep high quality content. Our network takes phase information which corresponding to target signal as input and combine it with encoded position signals to reconstruct the original data. Moreover, we propose a multi-level phase-MLP base on the infrastructure to retain fidelity for bigger instance while limiting the phase information in the low amount. Experimental results on public images and videos demonstrate our proposed approach outperforms the state-of-the-art methods.","PeriodicalId":345434,"journal":{"name":"Proceedings of the 4th International Conference on Advanced Information Science and System","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Phase-MLP for Generalizable Implicit Neural Representations\",\"authors\":\"Weifeng Chen, Hui Ding, Bo Li, Bin Liu\",\"doi\":\"10.1145/3573834.3574503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Implicit neural representation(INR) has lifted a climax among deep learning researchers for its powerful continuous representation. With Sine activation or Fourier position embedding, INRs overcome the problem that could not reconstruct high-frequency signals in different domains, such as voice, image or 3D shape. However, many of INR researches can merely represent one object or one scene with high-frequency information, multiple instances will bring a sharp decline of performance. In this work, we propose a newly phase-MLP which can not only recover diverse instances but also keep high quality content. Our network takes phase information which corresponding to target signal as input and combine it with encoded position signals to reconstruct the original data. Moreover, we propose a multi-level phase-MLP base on the infrastructure to retain fidelity for bigger instance while limiting the phase information in the low amount. Experimental results on public images and videos demonstrate our proposed approach outperforms the state-of-the-art methods.\",\"PeriodicalId\":345434,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Advanced Information Science and System\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Advanced Information Science and System\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3573834.3574503\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Advanced Information Science and System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573834.3574503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Phase-MLP for Generalizable Implicit Neural Representations
Implicit neural representation(INR) has lifted a climax among deep learning researchers for its powerful continuous representation. With Sine activation or Fourier position embedding, INRs overcome the problem that could not reconstruct high-frequency signals in different domains, such as voice, image or 3D shape. However, many of INR researches can merely represent one object or one scene with high-frequency information, multiple instances will bring a sharp decline of performance. In this work, we propose a newly phase-MLP which can not only recover diverse instances but also keep high quality content. Our network takes phase information which corresponding to target signal as input and combine it with encoded position signals to reconstruct the original data. Moreover, we propose a multi-level phase-MLP base on the infrastructure to retain fidelity for bigger instance while limiting the phase information in the low amount. Experimental results on public images and videos demonstrate our proposed approach outperforms the state-of-the-art methods.