基于深度学习技术的可移动遗产数字化多类语义分割

IF 1.6 0 ARCHAEOLOGY

Virtual Archaeology Review Pub Date : 2021-05-12 DOI:10.4995/VAR.2021.15329

G. Patrucco, F. Setragno

{"title":"基于深度学习技术的可移动遗产数字化多类语义分割","authors":"G. Patrucco, F. Setragno","doi":"10.4995/VAR.2021.15329","DOIUrl":null,"url":null,"abstract":"Digitisation processes of movable heritage are becoming increasingly popular to document the artworks stored in our museums. A growing number of strategies for the three-dimensional (3D) acquisition and modelling of these invaluable assets have been developed in the last few years. Their objective is to efficiently respond to this documentation need and contribute to deepening the knowledge of the masterpieces investigated constantly by researchers operating in many fieldworks. Nowadays, one of the most effective solutions is represented by the development of image-based techniques, usually connected to a Structure-from-Motion (SfM) photogrammetric approach. However, while images acquisition is relatively rapid, the processes connected to data processing are very time-consuming and require the operator’s substantial manual involvement. Developing deep learning-based strategies can be an effective solution to enhance the automatism level. In this research, which has been carried out in the framework of the digitisation of a wooden maquettes collection stored in the ‘Museo Egizio di Torino’, using a photogrammetric approach, an automatic masking strategy using deep learning techniques is proposed, to increase the level of automatism and therefore, optimise the photogrammetric pipeline. Starting from a manually annotated dataset, a neural network was trained to automatically perform a semantic classification to isolate the maquettes from the background. The proposed methodology allowed the researchers to obtain automatically segmented masks with a high degree of accuracy. The workflow is described (as regards acquisition strategies, dataset processing, and neural network training). In addition, the accuracy of the results is evaluated and discussed. Finally, the researchers proposed the possibility of performing a multiclass segmentation on the digital images to recognise different object categories in the images, as well as to define a semantic hierarchy to perform automatic classification of different elements in the acquired images.Highlights:In the framework of movable heritage digitisation processes, many procedures are very time-consuming, and they still require the operator’s substantial manual involvement.This research proposes using deep learning techniques to enhance the automatism level in the generation of exclusion masks, improving the optimisation of the photogrammetric procedures.Following this strategy, the possibility of performing a multiclass semantic segmentation (on the 2D images and, consequently, on the 3D point cloud) is also discussed, considering the accuracy of the obtainable results.","PeriodicalId":44206,"journal":{"name":"Virtual Archaeology Review","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2021-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Multiclass semantic segmentation for digitisation of movable heritage using deep learning techniques\",\"authors\":\"G. Patrucco, F. Setragno\",\"doi\":\"10.4995/VAR.2021.15329\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digitisation processes of movable heritage are becoming increasingly popular to document the artworks stored in our museums. A growing number of strategies for the three-dimensional (3D) acquisition and modelling of these invaluable assets have been developed in the last few years. Their objective is to efficiently respond to this documentation need and contribute to deepening the knowledge of the masterpieces investigated constantly by researchers operating in many fieldworks. Nowadays, one of the most effective solutions is represented by the development of image-based techniques, usually connected to a Structure-from-Motion (SfM) photogrammetric approach. However, while images acquisition is relatively rapid, the processes connected to data processing are very time-consuming and require the operator’s substantial manual involvement. Developing deep learning-based strategies can be an effective solution to enhance the automatism level. In this research, which has been carried out in the framework of the digitisation of a wooden maquettes collection stored in the ‘Museo Egizio di Torino’, using a photogrammetric approach, an automatic masking strategy using deep learning techniques is proposed, to increase the level of automatism and therefore, optimise the photogrammetric pipeline. Starting from a manually annotated dataset, a neural network was trained to automatically perform a semantic classification to isolate the maquettes from the background. The proposed methodology allowed the researchers to obtain automatically segmented masks with a high degree of accuracy. The workflow is described (as regards acquisition strategies, dataset processing, and neural network training). In addition, the accuracy of the results is evaluated and discussed. Finally, the researchers proposed the possibility of performing a multiclass segmentation on the digital images to recognise different object categories in the images, as well as to define a semantic hierarchy to perform automatic classification of different elements in the acquired images.Highlights:In the framework of movable heritage digitisation processes, many procedures are very time-consuming, and they still require the operator’s substantial manual involvement.This research proposes using deep learning techniques to enhance the automatism level in the generation of exclusion masks, improving the optimisation of the photogrammetric procedures.Following this strategy, the possibility of performing a multiclass semantic segmentation (on the 2D images and, consequently, on the 3D point cloud) is also discussed, considering the accuracy of the obtainable results.\",\"PeriodicalId\":44206,\"journal\":{\"name\":\"Virtual Archaeology Review\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2021-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virtual Archaeology Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4995/VAR.2021.15329\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ARCHAEOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Archaeology Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4995/VAR.2021.15329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}

引用次数: 3

摘要

可移动文物的数码化程序越来越受欢迎，以记录博物馆内的艺术品。在过去的几年里，越来越多的三维(3D)获取和建模这些宝贵资产的策略已经发展起来。他们的目标是有效地回应这一文献需求，并有助于加深研究人员在许多实地工作中不断调查的杰作的知识。目前，最有效的解决方案之一是基于图像的技术的发展，通常与运动结构(SfM)摄影测量方法相关联。然而，虽然图像采集相对较快，但与数据处理相关的过程非常耗时，并且需要操作员大量的人工参与。开发基于深度学习的策略是提高自动化水平的有效途径。在这项研究中，该研究是在“都灵博物馆”中存储的木制模型的数字化框架内进行的，使用摄影测量方法，提出了一种使用深度学习技术的自动掩蔽策略，以提高自动化水平，从而优化摄影测量管道。从手动标注的数据集开始，训练神经网络自动执行语义分类，将模型从背景中分离出来。所提出的方法使研究人员能够以高精度获得自动分割的掩模。描述了工作流(关于获取策略、数据集处理和神经网络训练)。此外，还对结果的准确性进行了评价和讨论。最后，研究人员提出了对数字图像进行多类分割以识别图像中不同对象类别的可能性，以及定义语义层次以对获取的图像中的不同元素进行自动分类的可能性。亮点:在可移动遗产数字化流程的框架中，许多程序非常耗时，并且仍然需要操作员大量的人工参与。本研究提出使用深度学习技术来提高排除掩模生成的自动化水平，提高摄影测量程序的优化程度。在此策略下，考虑到可获得结果的准确性，还讨论了执行多类语义分割(在2D图像上，因此在3D点云上)的可能性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multiclass semantic segmentation for digitisation of movable heritage using deep learning techniques

Digitisation processes of movable heritage are becoming increasingly popular to document the artworks stored in our museums. A growing number of strategies for the three-dimensional (3D) acquisition and modelling of these invaluable assets have been developed in the last few years. Their objective is to efficiently respond to this documentation need and contribute to deepening the knowledge of the masterpieces investigated constantly by researchers operating in many fieldworks. Nowadays, one of the most effective solutions is represented by the development of image-based techniques, usually connected to a Structure-from-Motion (SfM) photogrammetric approach. However, while images acquisition is relatively rapid, the processes connected to data processing are very time-consuming and require the operator’s substantial manual involvement. Developing deep learning-based strategies can be an effective solution to enhance the automatism level. In this research, which has been carried out in the framework of the digitisation of a wooden maquettes collection stored in the ‘Museo Egizio di Torino’, using a photogrammetric approach, an automatic masking strategy using deep learning techniques is proposed, to increase the level of automatism and therefore, optimise the photogrammetric pipeline. Starting from a manually annotated dataset, a neural network was trained to automatically perform a semantic classification to isolate the maquettes from the background. The proposed methodology allowed the researchers to obtain automatically segmented masks with a high degree of accuracy. The workflow is described (as regards acquisition strategies, dataset processing, and neural network training). In addition, the accuracy of the results is evaluated and discussed. Finally, the researchers proposed the possibility of performing a multiclass segmentation on the digital images to recognise different object categories in the images, as well as to define a semantic hierarchy to perform automatic classification of different elements in the acquired images.Highlights:In the framework of movable heritage digitisation processes, many procedures are very time-consuming, and they still require the operator’s substantial manual involvement.This research proposes using deep learning techniques to enhance the automatism level in the generation of exclusion masks, improving the optimisation of the photogrammetric procedures.Following this strategy, the possibility of performing a multiclass semantic segmentation (on the 2D images and, consequently, on the 3D point cloud) is also discussed, considering the accuracy of the obtainable results.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Virtual Archaeology Review ARCHAEOLOGY-

CiteScore

5.20

自引率

21.70%

发文量

审稿时长

20 weeks

期刊介绍： Virtual Archaeology Review (VAR) aims the publication of original papers, interdisciplinary reviews and essays on the new discipline of virtual archaeology, which is continuously evolving and currently on its way to achieve scientific consolidation. In fact, Virtual Archaeology deals with the digital representation of historical heritage objects, buildings and landscapes through 3D acquisition, digital recording and interactive and immersive tools for analysis, interpretation, dissemination and communication purposes by means of multidimensional geometric properties and visual computational modelling. VAR will publish full-length original papers which reflect both current research and practice throughout the world, in order to contribute to the advancement of the new field of virtual archaeology, ranging from new ways of digital recording and documentation, advanced reconstruction and 3D modelling up to cyber-archaeology, virtual exhibitions and serious gaming. Thus acceptable material may emerge from interesting applications as well as from original developments or research. OBJECTIVES: - OFFER researchers working in the field of virtual archaeology and cultural heritage an appropriate editorial frame to publish state-of-the-art research works, as well as theoretical and methodological contributions. - GATHER virtual archaeology progresses achieved as a new international scientific discipline. - ENCOURAGE the publication of the latest, state-of-the-art, significant research and meaningful applications in the field of virtual archaeology. - ENHANCE international connections in the field of virtual archaeology and cultural heritage.