从多媒体内容中自动提取元数据的系统方法

2016 IEEE Symposium Series on Computational Intelligence (SSCI) Pub Date : 2016-12-06 DOI:10.1109/SSCI.2016.7849983

Christos Varytimidis, Georgios Tsatiris, Konstantinos Rapantzikos, S. Kollias

{"title":"从多媒体内容中自动提取元数据的系统方法","authors":"Christos Varytimidis, Georgios Tsatiris, Konstantinos Rapantzikos, S. Kollias","doi":"10.1109/SSCI.2016.7849983","DOIUrl":null,"url":null,"abstract":"There is a need for automatic processing and extracting of meaningful metadata from multimedia information, especially in the audiovisual industry. This higher level information is used in a variety of practices, such as enriching multimedia content with external links, clickable objects and useful related information in general. This paper presents a system for efficient multimedia content analysis and automatic annotation within a multimedia processing and publishing framework. This system is comprised of three modules: the first provides detection of faces and recognition of known persons; the second provides generic object detection, based on a deep convolutional neural network topology; the third provides automated location estimation and landmark recognition based on state-of-the-art technologies. The results are exported in meaningful metadata that can be utilized in various ways. The system has been developed and successfully tested in the framework of the EC Horizon 2020 Mecanex project, targeting advertising and production markets.","PeriodicalId":120288,"journal":{"name":"2016 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"A systemic approach to automatic metadata extraction from multimedia content\",\"authors\":\"Christos Varytimidis, Georgios Tsatiris, Konstantinos Rapantzikos, S. Kollias\",\"doi\":\"10.1109/SSCI.2016.7849983\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There is a need for automatic processing and extracting of meaningful metadata from multimedia information, especially in the audiovisual industry. This higher level information is used in a variety of practices, such as enriching multimedia content with external links, clickable objects and useful related information in general. This paper presents a system for efficient multimedia content analysis and automatic annotation within a multimedia processing and publishing framework. This system is comprised of three modules: the first provides detection of faces and recognition of known persons; the second provides generic object detection, based on a deep convolutional neural network topology; the third provides automated location estimation and landmark recognition based on state-of-the-art technologies. The results are exported in meaningful metadata that can be utilized in various ways. The system has been developed and successfully tested in the framework of the EC Horizon 2020 Mecanex project, targeting advertising and production markets.\",\"PeriodicalId\":120288,\"journal\":{\"name\":\"2016 IEEE Symposium Series on Computational Intelligence (SSCI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE Symposium Series on Computational Intelligence (SSCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSCI.2016.7849983\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI.2016.7849983","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

从多媒体信息中自动处理和提取有意义的元数据是一种迫切需要，尤其是在视听行业。这种高级信息用于各种实践，例如用外部链接、可点击对象和有用的相关信息来丰富多媒体内容。本文提出了一个在多媒体处理和发布框架下的高效多媒体内容分析和自动注释系统。该系统由三个模块组成:第一个模块提供人脸检测和已知人员的识别;第二种提供基于深度卷积神经网络拓扑的通用目标检测;第三个提供基于最先进技术的自动位置估计和地标识别。结果导出为有意义的元数据，可以以各种方式使用这些元数据。该系统已在EC Horizon 2020 Mecanex项目框架下开发并成功测试，目标是广告和生产市场。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A systemic approach to automatic metadata extraction from multimedia content

There is a need for automatic processing and extracting of meaningful metadata from multimedia information, especially in the audiovisual industry. This higher level information is used in a variety of practices, such as enriching multimedia content with external links, clickable objects and useful related information in general. This paper presents a system for efficient multimedia content analysis and automatic annotation within a multimedia processing and publishing framework. This system is comprised of three modules: the first provides detection of faces and recognition of known persons; the second provides generic object detection, based on a deep convolutional neural network topology; the third provides automated location estimation and landmark recognition based on state-of-the-art technologies. The results are exported in meaningful metadata that can be utilized in various ways. The system has been developed and successfully tested in the framework of the EC Horizon 2020 Mecanex project, targeting advertising and production markets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

自引率

0.00%

发文量