{"title":"Optimal Network Models for Reconstructing 3D Point Cloud from a Single 2D Image","authors":"Huang Chen, Chuen-Horng Lin, Yan-Yu Lin","doi":"10.1109/SNPD54884.2022.10051796","DOIUrl":null,"url":null,"abstract":"This study proposes a series of models for reconstructing 3D point clouds from a single 2D image to obtain the best network model. Four and six improved models are proposed for the encoder and decoder of 3D-LMNet and 3D-PSRNet, respectively, which combine various modules of such encoders and decoders and analyze the relationship of each parameter to the network layer. Optimal allocation parameters are proposed, and four training types are presented for the encoder and decoder to obtain the best model. The model adds a fifth convolution layer to the 3D-PSRNet coding layer. This layer has 512 layers. The convolution size is set to 5 × 5 and the stride is 2. The proposed model does not require professional hardware equipment and cumbersome manual procedures.","PeriodicalId":425462,"journal":{"name":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD54884.2022.10051796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study proposes a series of models for reconstructing 3D point clouds from a single 2D image to obtain the best network model. Four and six improved models are proposed for the encoder and decoder of 3D-LMNet and 3D-PSRNet, respectively, which combine various modules of such encoders and decoders and analyze the relationship of each parameter to the network layer. Optimal allocation parameters are proposed, and four training types are presented for the encoder and decoder to obtain the best model. The model adds a fifth convolution layer to the 3D-PSRNet coding layer. This layer has 512 layers. The convolution size is set to 5 × 5 and the stride is 2. The proposed model does not require professional hardware equipment and cumbersome manual procedures.