{"title":"Transfer learning and performance enhancement techniques for deep semantic segmentation of built heritage point clouds","authors":"F. Matrone, M. Martini","doi":"10.4995/VAR.2021.15318","DOIUrl":null,"url":null,"abstract":"The growing availability of three-dimensional (3D) data, such as point clouds, coming from Light Detection and Ranging (LiDAR), Mobile Mapping Systems (MMSs) or Unmanned Aerial Vehicles (UAVs), provides the opportunity to rapidly generate 3D models to support the restoration, conservation, and safeguarding activities of cultural heritage (CH). The so-called scan-to-BIM process can, in fact, benefit from such data, and they can themselves be a source for further analyses or activities on the archaeological and built heritage. There are several ways to exploit this type of data, such as Historic Building Information Modelling (HBIM), mesh creation, rasterisation, classification, and semantic segmentation. The latter, referring to point clouds, is a trending topic not only in the CH domain but also in other fields like autonomous navigation, medicine or retail. Precisely in these sectors, the task of semantic segmentation has been mainly exploited and developed with artificial intelligence techniques. In particular, machine learning (ML) algorithms, and their deep learning (DL) subset, are increasingly applied and have established a solid state-of-the-art in the last half-decade. However, applications of DL techniques on heritage point clouds are still scarce; therefore, we propose to tackle this framework within the built heritage field. Starting from some previous tests with the Dynamic Graph Convolutional Neural Network (DGCNN), in this contribution close attention is paid to: i) the investigation of fine-tuned models, used as a transfer learning technique, ii) the combination of external classifiers, such as Random Forest (RF), with the artificial neural network, and iii) the evaluation of the data augmentation results for the domain-specific ArCH dataset. Finally, after taking into account the main advantages and criticalities, considerations are made on the possibility to profit by this methodology also for non-programming or domain experts.Highlights:Semantic segmentation of built heritage point clouds through deep neural networks can provide performances comparable to those of more consolidated state-of-the-art ML classifiers.Transfer learning approaches, as fine-tuning, can considerably reduce computational time also for CH domain-specific datasets, as well as improve metrics for some challenging categories (i.e. windows or mouldings).Data augmentation techniques do not significantly improve overall performances.","PeriodicalId":44206,"journal":{"name":"Virtual Archaeology Review","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2021-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Archaeology Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4995/VAR.2021.15318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}
引用次数: 3
Abstract
The growing availability of three-dimensional (3D) data, such as point clouds, coming from Light Detection and Ranging (LiDAR), Mobile Mapping Systems (MMSs) or Unmanned Aerial Vehicles (UAVs), provides the opportunity to rapidly generate 3D models to support the restoration, conservation, and safeguarding activities of cultural heritage (CH). The so-called scan-to-BIM process can, in fact, benefit from such data, and they can themselves be a source for further analyses or activities on the archaeological and built heritage. There are several ways to exploit this type of data, such as Historic Building Information Modelling (HBIM), mesh creation, rasterisation, classification, and semantic segmentation. The latter, referring to point clouds, is a trending topic not only in the CH domain but also in other fields like autonomous navigation, medicine or retail. Precisely in these sectors, the task of semantic segmentation has been mainly exploited and developed with artificial intelligence techniques. In particular, machine learning (ML) algorithms, and their deep learning (DL) subset, are increasingly applied and have established a solid state-of-the-art in the last half-decade. However, applications of DL techniques on heritage point clouds are still scarce; therefore, we propose to tackle this framework within the built heritage field. Starting from some previous tests with the Dynamic Graph Convolutional Neural Network (DGCNN), in this contribution close attention is paid to: i) the investigation of fine-tuned models, used as a transfer learning technique, ii) the combination of external classifiers, such as Random Forest (RF), with the artificial neural network, and iii) the evaluation of the data augmentation results for the domain-specific ArCH dataset. Finally, after taking into account the main advantages and criticalities, considerations are made on the possibility to profit by this methodology also for non-programming or domain experts.Highlights:Semantic segmentation of built heritage point clouds through deep neural networks can provide performances comparable to those of more consolidated state-of-the-art ML classifiers.Transfer learning approaches, as fine-tuning, can considerably reduce computational time also for CH domain-specific datasets, as well as improve metrics for some challenging categories (i.e. windows or mouldings).Data augmentation techniques do not significantly improve overall performances.
期刊介绍:
Virtual Archaeology Review (VAR) aims the publication of original papers, interdisciplinary reviews and essays on the new discipline of virtual archaeology, which is continuously evolving and currently on its way to achieve scientific consolidation. In fact, Virtual Archaeology deals with the digital representation of historical heritage objects, buildings and landscapes through 3D acquisition, digital recording and interactive and immersive tools for analysis, interpretation, dissemination and communication purposes by means of multidimensional geometric properties and visual computational modelling. VAR will publish full-length original papers which reflect both current research and practice throughout the world, in order to contribute to the advancement of the new field of virtual archaeology, ranging from new ways of digital recording and documentation, advanced reconstruction and 3D modelling up to cyber-archaeology, virtual exhibitions and serious gaming. Thus acceptable material may emerge from interesting applications as well as from original developments or research. OBJECTIVES: - OFFER researchers working in the field of virtual archaeology and cultural heritage an appropriate editorial frame to publish state-of-the-art research works, as well as theoretical and methodological contributions. - GATHER virtual archaeology progresses achieved as a new international scientific discipline. - ENCOURAGE the publication of the latest, state-of-the-art, significant research and meaningful applications in the field of virtual archaeology. - ENHANCE international connections in the field of virtual archaeology and cultural heritage.