NVS-GAN: Benefit of generative adversarial network on novel view synthesis

International Journal of Intelligent Networks Pub Date : 2024-01-01 DOI:10.1016/j.ijin.2024.04.002

H.S. Shrisha , V. Anupama

{"title":"NVS-GAN: Benefit of generative adversarial network on novel view synthesis","authors":"H.S. Shrisha , V. Anupama","doi":"10.1016/j.ijin.2024.04.002","DOIUrl":null,"url":null,"abstract":"<div><p>The methodology to generate new views for an object from provided input object view is called Novel View Synthesis (NVS). Humans imagine novel views through prior knowledge gathered through their lifetime. NVS-GAN predicts the novel views through computation. Literature survey reveals that there are limited NVS models with low Trainable Parameter Count (TPC) and low model size. Also, a study on the effect of different loss functions on NVS models was lacking. Lowering the TPC indicates less computational steps for the model to predict the output, therefore desirable. Combined with a low model size, the proposed model will become more suitable for deployment in diverse devices having limited resources for computation. Application of right combination of loss functions yield better accuracy. To address these research gaps, NVS-GAN is proposed. NVS-GAN is a Generative Adversarial Network (GAN) approach which yields NVS-Generator which performs NVS. NVS-Generator incorporates identity skip connections, bilinear sampling module, Depthwise Separable Convolution (DSC) as design features and results in low TPC, model size. In addition to discriminator loss, NVS-GAN is trained with different combinations of loss functions i.e. Mean Absolute Error (MAE) loss, Structural Similarity Index Measure (SSIM) loss, Huber loss on chair and car objects of ShapeNet dataset. The performance of NVS-Generator on test set measured in terms of MAE and SSIM is tabulated and analysed. The performance is compared with existing NVS models. The proposed NVS-GAN experiment recorded reduction in NVS-Generator TPC in 37 %–54.6 % range and reduction in model size between 37.2 % and 47.6 % range. NVS-Generator reduced MAE upto 55 % and improved SSIM upto 4 % than existing models. Summarily, NVS-GAN increased model performance and made the model “lightweight”.</p></div>","PeriodicalId":100702,"journal":{"name":"International Journal of Intelligent Networks","volume":"5 ","pages":"Pages 184-195"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666603024000186/pdfft?md5=1c1cfb2444eb7781ad1ce312521adfae&pid=1-s2.0-S2666603024000186-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Networks","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666603024000186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The methodology to generate new views for an object from provided input object view is called Novel View Synthesis (NVS). Humans imagine novel views through prior knowledge gathered through their lifetime. NVS-GAN predicts the novel views through computation. Literature survey reveals that there are limited NVS models with low Trainable Parameter Count (TPC) and low model size. Also, a study on the effect of different loss functions on NVS models was lacking. Lowering the TPC indicates less computational steps for the model to predict the output, therefore desirable. Combined with a low model size, the proposed model will become more suitable for deployment in diverse devices having limited resources for computation. Application of right combination of loss functions yield better accuracy. To address these research gaps, NVS-GAN is proposed. NVS-GAN is a Generative Adversarial Network (GAN) approach which yields NVS-Generator which performs NVS. NVS-Generator incorporates identity skip connections, bilinear sampling module, Depthwise Separable Convolution (DSC) as design features and results in low TPC, model size. In addition to discriminator loss, NVS-GAN is trained with different combinations of loss functions i.e. Mean Absolute Error (MAE) loss, Structural Similarity Index Measure (SSIM) loss, Huber loss on chair and car objects of ShapeNet dataset. The performance of NVS-Generator on test set measured in terms of MAE and SSIM is tabulated and analysed. The performance is compared with existing NVS models. The proposed NVS-GAN experiment recorded reduction in NVS-Generator TPC in 37 %–54.6 % range and reduction in model size between 37.2 % and 47.6 % range. NVS-Generator reduced MAE upto 55 % and improved SSIM upto 4 % than existing models. Summarily, NVS-GAN increased model performance and made the model “lightweight”.

查看原文本刊更多论文

NVS-GAN：生成式对抗网络对新型视图合成的益处

根据提供的输入对象视图为对象生成新视图的方法称为新视图合成（NVS）。人类通过在一生中收集的先验知识来想象新视图。NVS-GAN 通过计算预测新视图。文献调查显示，可训练参数数（TPC）低、模型规模小的 NVS 模型有限。此外，还缺乏关于不同损失函数对 NVS 模型影响的研究。降低 TPC 表示模型预测输出的计算步骤减少，因此是可取的。结合低模型大小，所提出的模型将更适合部署在计算资源有限的各种设备中。应用正确的损失函数组合可获得更高的精度。针对这些研究空白，我们提出了 NVS-GAN。NVS-GAN 是一种生成对抗网络（GAN）方法，可生成执行 NVS 的 NVS 生成器。NVS-Generator 将身份跳转连接、双线性采样模块、深度可分离卷积（DSC）作为设计特征，从而实现了较低的 TPC 和模型大小。除判别损失外，NVS-GAN 还采用了不同的损失函数组合进行训练，如平均绝对误差（MAE）损失、结构相似性指数测量（SSIM）损失、ShapeNet 数据集椅子和汽车对象的 Huber 损失。以 MAE 和 SSIM 为衡量标准，对 NVS 生成器在测试集上的性能进行了列表和分析。该性能与现有的 NVS 模型进行了比较。根据提议的 NVS-GAN 实验记录，NVS-生成器的 TPC 降低了 37 %-54.6 %，模型大小减少了 37.2 %-47.6 %。与现有模型相比，NVS-Generator 的 MAE 降低了 55%，SSIM 提高了 4%。总之，NVS-GAN 提高了模型性能，使模型变得 "轻量级"。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Intelligent Networks

CiteScore

12.00

自引率

0.00%

发文量