结合视频摘要和运动结构的非结构化视频自动三维重建

Q1 Computer Science

Frontiers in ICT Pub Date : 2018-11-06 DOI:10.3389/fict.2018.00029

A. Doulamis

{"title":"结合视频摘要和运动结构的非结构化视频自动三维重建","authors":"A. Doulamis","doi":"10.3389/fict.2018.00029","DOIUrl":null,"url":null,"abstract":"Social media and collection of large volumes of multimedia data such as images, videos and the accompanying text is of prime importance in today’s society. This is stimulated by the power of the humans to communicate one with the others. A useful paradigm of exploitation of such a huge amount of multimedia volumes is the 3D reconstruction and modelling of sites, historical cultural cities/regions or objects of interest from the short videos captured by simple users mainly for personal or touristic purposes. The main challenge in this research is the unstructured nature of the videos and the fact that they contain many information which is not related with the object the 3D model we ask for but for personal usage such as humans in front of the objects, weather conditions, etc. In this article, we propose an automatic scheme for 3D modelling/reconstruction of objects of interest by collecting pools of short duration videos that have been captured mainly for touristic purposes. Initially a video summarization algorithm is introduced using a discriminant Principal Component Analysis (d-PCA). The goal of this innovative scheme is to extract the frames so that bunches within each video cluster that contains videos of content referring to the same object present the maximum coherency of image data while content across bunches the minimum one. Experimental results on cultural objects indicate the efficiency pf the proposed method to 3D reconstruct assets of interest using an unstructured image content information. □","PeriodicalId":37157,"journal":{"name":"Frontiers in ICT","volume":"101 1","pages":"29"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic 3D Reconstruction From Unstructured Videos Combining Video Summarization and Structure From Motion\",\"authors\":\"A. Doulamis\",\"doi\":\"10.3389/fict.2018.00029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media and collection of large volumes of multimedia data such as images, videos and the accompanying text is of prime importance in today’s society. This is stimulated by the power of the humans to communicate one with the others. A useful paradigm of exploitation of such a huge amount of multimedia volumes is the 3D reconstruction and modelling of sites, historical cultural cities/regions or objects of interest from the short videos captured by simple users mainly for personal or touristic purposes. The main challenge in this research is the unstructured nature of the videos and the fact that they contain many information which is not related with the object the 3D model we ask for but for personal usage such as humans in front of the objects, weather conditions, etc. In this article, we propose an automatic scheme for 3D modelling/reconstruction of objects of interest by collecting pools of short duration videos that have been captured mainly for touristic purposes. Initially a video summarization algorithm is introduced using a discriminant Principal Component Analysis (d-PCA). The goal of this innovative scheme is to extract the frames so that bunches within each video cluster that contains videos of content referring to the same object present the maximum coherency of image data while content across bunches the minimum one. Experimental results on cultural objects indicate the efficiency pf the proposed method to 3D reconstruct assets of interest using an unstructured image content information. □\",\"PeriodicalId\":37157,\"journal\":{\"name\":\"Frontiers in ICT\",\"volume\":\"101 1\",\"pages\":\"29\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in ICT\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fict.2018.00029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in ICT","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fict.2018.00029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 2

摘要

社交媒体和大量多媒体数据的收集，如图像、视频和随附的文本，在当今社会是至关重要的。这是由人类与他人交流的能力所激发的。利用如此庞大的多媒体容量，一个有用的范例是利用简单用户拍摄的短视频，对遗址、历史文化城市/地区或感兴趣的物体进行3D重建和建模，主要用于个人或旅游目的。本研究的主要挑战是视频的非结构化性质，以及它们包含的许多信息与我们要求的3D模型对象无关，而是用于个人使用，例如人类在对象前面，天气条件等。在本文中，我们提出了一种自动方案，通过收集主要用于旅游目的的短时间视频池，对感兴趣的对象进行3D建模/重建。首先介绍了一种基于判别主成分分析(d-PCA)的视频摘要算法。这种创新方案的目标是提取帧，以便每个视频簇中的包含引用相同对象的内容的视频呈现最大的图像数据一致性，而跨簇的内容呈现最小的图像数据一致性。在文物上的实验结果表明，该方法可以有效地利用非结构化的图像内容信息对感兴趣的资产进行三维重建。□

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic 3D Reconstruction From Unstructured Videos Combining Video Summarization and Structure From Motion

Social media and collection of large volumes of multimedia data such as images, videos and the accompanying text is of prime importance in today’s society. This is stimulated by the power of the humans to communicate one with the others. A useful paradigm of exploitation of such a huge amount of multimedia volumes is the 3D reconstruction and modelling of sites, historical cultural cities/regions or objects of interest from the short videos captured by simple users mainly for personal or touristic purposes. The main challenge in this research is the unstructured nature of the videos and the fact that they contain many information which is not related with the object the 3D model we ask for but for personal usage such as humans in front of the objects, weather conditions, etc. In this article, we propose an automatic scheme for 3D modelling/reconstruction of objects of interest by collecting pools of short duration videos that have been captured mainly for touristic purposes. Initially a video summarization algorithm is introduced using a discriminant Principal Component Analysis (d-PCA). The goal of this innovative scheme is to extract the frames so that bunches within each video cluster that contains videos of content referring to the same object present the maximum coherency of image data while content across bunches the minimum one. Experimental results on cultural objects indicate the efficiency pf the proposed method to 3D reconstruct assets of interest using an unstructured image content information. □

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in ICT Computer Science-Computer Networks and Communications

自引率

0.00%

发文量