{"title":"面向2d到3d单视图重建的资源意识高性能模型","authors":"Suraj Bidnur, Dhruv Srikanth, Sanjeev Gurugopinath","doi":"10.1109/TENCON54134.2021.9707193","DOIUrl":null,"url":null,"abstract":"We propose two transfer learning-based deep neural network architectures for 2D-to-3D single-view image reconstruction, with an emphasis on low computational resources for training and high reconstruction performance. The proposed models, namely AE-Dense and 3D-SkipNet use DenseNet and ResNet architectures in the encoder, with additional skip connections. Through extensive experimental study on the 3D ShapeNets database, we show that the proposed models outperform state-of-the-art models, namely Pix2Vox and 3D-R2N2, in terms of intersection over union (IoU) metric. In particular, the AE-Dense offers the highest IoU, while the 3D-SkipNet yields a significant reduction in memory and training time, compared to Pix2Vox and 3D-R2N2.","PeriodicalId":405859,"journal":{"name":"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)","volume":"352 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Resource-Conscious High-Performance Models for 2D-to-3D Single-View Reconstruction\",\"authors\":\"Suraj Bidnur, Dhruv Srikanth, Sanjeev Gurugopinath\",\"doi\":\"10.1109/TENCON54134.2021.9707193\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose two transfer learning-based deep neural network architectures for 2D-to-3D single-view image reconstruction, with an emphasis on low computational resources for training and high reconstruction performance. The proposed models, namely AE-Dense and 3D-SkipNet use DenseNet and ResNet architectures in the encoder, with additional skip connections. Through extensive experimental study on the 3D ShapeNets database, we show that the proposed models outperform state-of-the-art models, namely Pix2Vox and 3D-R2N2, in terms of intersection over union (IoU) metric. In particular, the AE-Dense offers the highest IoU, while the 3D-SkipNet yields a significant reduction in memory and training time, compared to Pix2Vox and 3D-R2N2.\",\"PeriodicalId\":405859,\"journal\":{\"name\":\"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)\",\"volume\":\"352 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON54134.2021.9707193\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON54134.2021.9707193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Resource-Conscious High-Performance Models for 2D-to-3D Single-View Reconstruction
We propose two transfer learning-based deep neural network architectures for 2D-to-3D single-view image reconstruction, with an emphasis on low computational resources for training and high reconstruction performance. The proposed models, namely AE-Dense and 3D-SkipNet use DenseNet and ResNet architectures in the encoder, with additional skip connections. Through extensive experimental study on the 3D ShapeNets database, we show that the proposed models outperform state-of-the-art models, namely Pix2Vox and 3D-R2N2, in terms of intersection over union (IoU) metric. In particular, the AE-Dense offers the highest IoU, while the 3D-SkipNet yields a significant reduction in memory and training time, compared to Pix2Vox and 3D-R2N2.