{"title":"基于两阶段多层次深度网络的农业多视角三维重建","authors":"Li Guo, Yi-ping Shi, Dinfei Jin","doi":"10.5121/csit.2023.130610","DOIUrl":null,"url":null,"abstract":"To address the problems appearing in multi-view three-dimensional (3D) reconstruction, such as the improvement of the accuracy and completeness of the 3D reconstructed images, a two-stage multi-level depth network is proposed. In the stage 1 of the proposed network, several convolutional block attention modules (CBAMs) are applied in the lateral connections of the feature pyramid network (FPN). This is targeted to enhance the spatial and channel relativity of the different hierarchical feature maps so as to bring more semantic information. In the stage 2, the obtained multi-scale feature maps in the stage 1 are tackled by a set of cascaded processing procedures, such as adaptive propagation, single-trees transform, and matching cost computation. As a result, a depth map could be generated and then be further refined in the processing. Comparing with other state-of-the-art methods, the subjective and objective experiments based on the DTU dataset show that our method performs better result in completeness meanwhile maintaining a considerable overall metric. The investigation of applying the proposed method for reconstructing agricultural crop images was carried out, which is based on a set of self-collected images. The experiment shows that a suitable human visual perception for the images could be obtained.","PeriodicalId":110134,"journal":{"name":"Advanced Information Technologies and Applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-View Three-Dimensional Reconstruction based on a Two-Stage Multi-Level Depth Network for Agriculture Applications\",\"authors\":\"Li Guo, Yi-ping Shi, Dinfei Jin\",\"doi\":\"10.5121/csit.2023.130610\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the problems appearing in multi-view three-dimensional (3D) reconstruction, such as the improvement of the accuracy and completeness of the 3D reconstructed images, a two-stage multi-level depth network is proposed. In the stage 1 of the proposed network, several convolutional block attention modules (CBAMs) are applied in the lateral connections of the feature pyramid network (FPN). This is targeted to enhance the spatial and channel relativity of the different hierarchical feature maps so as to bring more semantic information. In the stage 2, the obtained multi-scale feature maps in the stage 1 are tackled by a set of cascaded processing procedures, such as adaptive propagation, single-trees transform, and matching cost computation. As a result, a depth map could be generated and then be further refined in the processing. Comparing with other state-of-the-art methods, the subjective and objective experiments based on the DTU dataset show that our method performs better result in completeness meanwhile maintaining a considerable overall metric. The investigation of applying the proposed method for reconstructing agricultural crop images was carried out, which is based on a set of self-collected images. The experiment shows that a suitable human visual perception for the images could be obtained.\",\"PeriodicalId\":110134,\"journal\":{\"name\":\"Advanced Information Technologies and Applications\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Information Technologies and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5121/csit.2023.130610\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Information Technologies and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/csit.2023.130610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-View Three-Dimensional Reconstruction based on a Two-Stage Multi-Level Depth Network for Agriculture Applications
To address the problems appearing in multi-view three-dimensional (3D) reconstruction, such as the improvement of the accuracy and completeness of the 3D reconstructed images, a two-stage multi-level depth network is proposed. In the stage 1 of the proposed network, several convolutional block attention modules (CBAMs) are applied in the lateral connections of the feature pyramid network (FPN). This is targeted to enhance the spatial and channel relativity of the different hierarchical feature maps so as to bring more semantic information. In the stage 2, the obtained multi-scale feature maps in the stage 1 are tackled by a set of cascaded processing procedures, such as adaptive propagation, single-trees transform, and matching cost computation. As a result, a depth map could be generated and then be further refined in the processing. Comparing with other state-of-the-art methods, the subjective and objective experiments based on the DTU dataset show that our method performs better result in completeness meanwhile maintaining a considerable overall metric. The investigation of applying the proposed method for reconstructing agricultural crop images was carried out, which is based on a set of self-collected images. The experiment shows that a suitable human visual perception for the images could be obtained.