{"title":"利用无监督深度估计重建中国古建筑的局部结构","authors":"Xiaoling Yao, Lihua Hu, Jifu Zhang","doi":"10.1186/s40494-024-01433-9","DOIUrl":null,"url":null,"abstract":"<p>Digitalization of ancient architectures is one of the effective means for the preservation of heritage structures, with 3D reconstruction based on computer vision being a key component of such digitalization techniques. However, Chinese ancient architectures are located in mountainous areas, and existing 3D reconstruction methods fall short in restoring the local structures of these architectures. This paper proposes a self-attention-guided unsupervised single image-based depth estimation method, providing innovative technical support for the reconstruction of local structures in Chinese ancient architectures. First, an attention module is constructed based on features extracted from architectural images learned by the encoder, and then embedded into the encoder-decoder to capture the interdependencies across local features. Second, a disparity map is generated using the loss constraint network, including reconstruction matching, smoothness of the disparity, and left-right disparity consistency. Third, an unsupervised architecture based on binocular image pairs is constructed to remove any potential adverse effects due to unknown scale or estimated pose errors. Finally, with the known baseline distance and camera focal length, the disparity map is converted into the depth map to perform the end-to-end depth estimation from a single image. Experiments on the our architecture dataset validates our method, and it performs well also well on KITTI.</p>","PeriodicalId":13109,"journal":{"name":"Heritage Science","volume":"75 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reconstructing the local structures of Chinese ancient architecture using unsupervised depth estimation\",\"authors\":\"Xiaoling Yao, Lihua Hu, Jifu Zhang\",\"doi\":\"10.1186/s40494-024-01433-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Digitalization of ancient architectures is one of the effective means for the preservation of heritage structures, with 3D reconstruction based on computer vision being a key component of such digitalization techniques. However, Chinese ancient architectures are located in mountainous areas, and existing 3D reconstruction methods fall short in restoring the local structures of these architectures. This paper proposes a self-attention-guided unsupervised single image-based depth estimation method, providing innovative technical support for the reconstruction of local structures in Chinese ancient architectures. First, an attention module is constructed based on features extracted from architectural images learned by the encoder, and then embedded into the encoder-decoder to capture the interdependencies across local features. Second, a disparity map is generated using the loss constraint network, including reconstruction matching, smoothness of the disparity, and left-right disparity consistency. Third, an unsupervised architecture based on binocular image pairs is constructed to remove any potential adverse effects due to unknown scale or estimated pose errors. Finally, with the known baseline distance and camera focal length, the disparity map is converted into the depth map to perform the end-to-end depth estimation from a single image. Experiments on the our architecture dataset validates our method, and it performs well also well on KITTI.</p>\",\"PeriodicalId\":13109,\"journal\":{\"name\":\"Heritage Science\",\"volume\":\"75 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Heritage Science\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1186/s40494-024-01433-9\",\"RegionNum\":1,\"RegionCategory\":\"艺术学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heritage Science","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s40494-024-01433-9","RegionNum":1,"RegionCategory":"艺术学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Reconstructing the local structures of Chinese ancient architecture using unsupervised depth estimation
Digitalization of ancient architectures is one of the effective means for the preservation of heritage structures, with 3D reconstruction based on computer vision being a key component of such digitalization techniques. However, Chinese ancient architectures are located in mountainous areas, and existing 3D reconstruction methods fall short in restoring the local structures of these architectures. This paper proposes a self-attention-guided unsupervised single image-based depth estimation method, providing innovative technical support for the reconstruction of local structures in Chinese ancient architectures. First, an attention module is constructed based on features extracted from architectural images learned by the encoder, and then embedded into the encoder-decoder to capture the interdependencies across local features. Second, a disparity map is generated using the loss constraint network, including reconstruction matching, smoothness of the disparity, and left-right disparity consistency. Third, an unsupervised architecture based on binocular image pairs is constructed to remove any potential adverse effects due to unknown scale or estimated pose errors. Finally, with the known baseline distance and camera focal length, the disparity map is converted into the depth map to perform the end-to-end depth estimation from a single image. Experiments on the our architecture dataset validates our method, and it performs well also well on KITTI.
期刊介绍:
Heritage Science is an open access journal publishing original peer-reviewed research covering:
Understanding of the manufacturing processes, provenances, and environmental contexts of material types, objects, and buildings, of cultural significance including their historical significance.
Understanding and prediction of physico-chemical and biological degradation processes of cultural artefacts, including climate change, and predictive heritage studies.
Development and application of analytical and imaging methods or equipments for non-invasive, non-destructive or portable analysis of artwork and objects of cultural significance to identify component materials, degradation products and deterioration markers.
Development and application of invasive and destructive methods for understanding the provenance of objects of cultural significance.
Development and critical assessment of treatment materials and methods for artwork and objects of cultural significance.
Development and application of statistical methods and algorithms for data analysis to further understanding of culturally significant objects.
Publication of reference and corpus datasets as supplementary information to the statistical and analytical studies above.
Description of novel technologies that can assist in the understanding of cultural heritage.