{"title":"基于综合数据和深度学习的新型VSLAM定位:在虚拟考古中的应用","authors":"Alicia Colmenero-Fernández","doi":"10.1016/j.culher.2025.04.004","DOIUrl":null,"url":null,"abstract":"<div><div>Virtual archaeology enhances the visitor’s understanding of historical sites through 3D reconstructions of original locations. However, a major obstacle to interactive visualization is the VSLAM (Visual Simultaneous Localization and Mapping) problem, which revolves around accurately matching real-world positions with scaled virtual replicas. Current methods—such as marker recognition or image feature comparison against keyframes or point clouds—often involve high computational costs, complex setups, or software dependencies (embedded systems, GPUs, library compilations).</div><div>Our proposed method leverages Content-Based Image Retrieval (CBIR) to match RGB input images taken on-site with features learned from photogrammetric models. A single image is transmitted via a server-based system, and an easy-to-use request-response interface is guided by the gyroscope. A 360-degree panorama that is in line with the nearest camera perspective is returned after the match. This approach is demonstrated by the mobile app ArQVIA, which makes 3D visualizations accessible and interactive for any monocular device without the need for specialized hardware.</div></div>","PeriodicalId":15480,"journal":{"name":"Journal of Cultural Heritage","volume":"73 ","pages":"Pages 347-357"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Novel VSLAM positioning through synthetic data and deep learning: Applications in virtual archaeology, ArQVIA\",\"authors\":\"Alicia Colmenero-Fernández\",\"doi\":\"10.1016/j.culher.2025.04.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Virtual archaeology enhances the visitor’s understanding of historical sites through 3D reconstructions of original locations. However, a major obstacle to interactive visualization is the VSLAM (Visual Simultaneous Localization and Mapping) problem, which revolves around accurately matching real-world positions with scaled virtual replicas. Current methods—such as marker recognition or image feature comparison against keyframes or point clouds—often involve high computational costs, complex setups, or software dependencies (embedded systems, GPUs, library compilations).</div><div>Our proposed method leverages Content-Based Image Retrieval (CBIR) to match RGB input images taken on-site with features learned from photogrammetric models. A single image is transmitted via a server-based system, and an easy-to-use request-response interface is guided by the gyroscope. A 360-degree panorama that is in line with the nearest camera perspective is returned after the match. This approach is demonstrated by the mobile app ArQVIA, which makes 3D visualizations accessible and interactive for any monocular device without the need for specialized hardware.</div></div>\",\"PeriodicalId\":15480,\"journal\":{\"name\":\"Journal of Cultural Heritage\",\"volume\":\"73 \",\"pages\":\"Pages 347-357\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cultural Heritage\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1296207425000615\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ARCHAEOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cultural Heritage","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1296207425000615","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}
Novel VSLAM positioning through synthetic data and deep learning: Applications in virtual archaeology, ArQVIA
Virtual archaeology enhances the visitor’s understanding of historical sites through 3D reconstructions of original locations. However, a major obstacle to interactive visualization is the VSLAM (Visual Simultaneous Localization and Mapping) problem, which revolves around accurately matching real-world positions with scaled virtual replicas. Current methods—such as marker recognition or image feature comparison against keyframes or point clouds—often involve high computational costs, complex setups, or software dependencies (embedded systems, GPUs, library compilations).
Our proposed method leverages Content-Based Image Retrieval (CBIR) to match RGB input images taken on-site with features learned from photogrammetric models. A single image is transmitted via a server-based system, and an easy-to-use request-response interface is guided by the gyroscope. A 360-degree panorama that is in line with the nearest camera perspective is returned after the match. This approach is demonstrated by the mobile app ArQVIA, which makes 3D visualizations accessible and interactive for any monocular device without the need for specialized hardware.
期刊介绍:
The Journal of Cultural Heritage publishes original papers which comprise previously unpublished data and present innovative methods concerning all aspects of science and technology of cultural heritage as well as interpretation and theoretical issues related to preservation.