{"title":"结合视频摘要和运动结构的非结构化视频自动三维重建","authors":"A. Doulamis","doi":"10.3389/fict.2018.00029","DOIUrl":null,"url":null,"abstract":"Social media and collection of large volumes of multimedia data such as images, videos and the accompanying text is of prime importance in today’s society. This is stimulated by the power of the humans to communicate one with the others. A useful paradigm of exploitation of such a huge amount of multimedia volumes is the 3D reconstruction and modelling of sites, historical cultural cities/regions or objects of interest from the short videos captured by simple users mainly for personal or touristic purposes. The main challenge in this research is the unstructured nature of the videos and the fact that they contain many information which is not related with the object the 3D model we ask for but for personal usage such as humans in front of the objects, weather conditions, etc. In this article, we propose an automatic scheme for 3D modelling/reconstruction of objects of interest by collecting pools of short duration videos that have been captured mainly for touristic purposes. Initially a video summarization algorithm is introduced using a discriminant Principal Component Analysis (d-PCA). The goal of this innovative scheme is to extract the frames so that bunches within each video cluster that contains videos of content referring to the same object present the maximum coherency of image data while content across bunches the minimum one. Experimental results on cultural objects indicate the efficiency pf the proposed method to 3D reconstruct assets of interest using an unstructured image content information. □","PeriodicalId":37157,"journal":{"name":"Frontiers in ICT","volume":"101 1","pages":"29"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic 3D Reconstruction From Unstructured Videos Combining Video Summarization and Structure From Motion\",\"authors\":\"A. Doulamis\",\"doi\":\"10.3389/fict.2018.00029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media and collection of large volumes of multimedia data such as images, videos and the accompanying text is of prime importance in today’s society. This is stimulated by the power of the humans to communicate one with the others. A useful paradigm of exploitation of such a huge amount of multimedia volumes is the 3D reconstruction and modelling of sites, historical cultural cities/regions or objects of interest from the short videos captured by simple users mainly for personal or touristic purposes. The main challenge in this research is the unstructured nature of the videos and the fact that they contain many information which is not related with the object the 3D model we ask for but for personal usage such as humans in front of the objects, weather conditions, etc. In this article, we propose an automatic scheme for 3D modelling/reconstruction of objects of interest by collecting pools of short duration videos that have been captured mainly for touristic purposes. Initially a video summarization algorithm is introduced using a discriminant Principal Component Analysis (d-PCA). The goal of this innovative scheme is to extract the frames so that bunches within each video cluster that contains videos of content referring to the same object present the maximum coherency of image data while content across bunches the minimum one. Experimental results on cultural objects indicate the efficiency pf the proposed method to 3D reconstruct assets of interest using an unstructured image content information. □\",\"PeriodicalId\":37157,\"journal\":{\"name\":\"Frontiers in ICT\",\"volume\":\"101 1\",\"pages\":\"29\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in ICT\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fict.2018.00029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in ICT","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fict.2018.00029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
Automatic 3D Reconstruction From Unstructured Videos Combining Video Summarization and Structure From Motion
Social media and collection of large volumes of multimedia data such as images, videos and the accompanying text is of prime importance in today’s society. This is stimulated by the power of the humans to communicate one with the others. A useful paradigm of exploitation of such a huge amount of multimedia volumes is the 3D reconstruction and modelling of sites, historical cultural cities/regions or objects of interest from the short videos captured by simple users mainly for personal or touristic purposes. The main challenge in this research is the unstructured nature of the videos and the fact that they contain many information which is not related with the object the 3D model we ask for but for personal usage such as humans in front of the objects, weather conditions, etc. In this article, we propose an automatic scheme for 3D modelling/reconstruction of objects of interest by collecting pools of short duration videos that have been captured mainly for touristic purposes. Initially a video summarization algorithm is introduced using a discriminant Principal Component Analysis (d-PCA). The goal of this innovative scheme is to extract the frames so that bunches within each video cluster that contains videos of content referring to the same object present the maximum coherency of image data while content across bunches the minimum one. Experimental results on cultural objects indicate the efficiency pf the proposed method to 3D reconstruct assets of interest using an unstructured image content information. □