{"title":"基于形状的视频序列自动标注与模糊索引","authors":"Vinay Chander, S. Tapaswi","doi":"10.1109/ICIS.2010.16","DOIUrl":null,"url":null,"abstract":"Multimedia content available on the web is increasing at a rapid rate. At this rate, indexing video content manually for search and retrieval is getting increasingly difficult due to the high cost and time that it demands. In this paper, we propose a scheme for automatic annotation and indexing of video content with a goal of supporting efficient retrieval. The framework performs the annotation and indexing by operating on the key representative frames of a video sequence. The proposed framework broadly involves three major steps. The first step involves pre-processing, segmenting a video frame into objects and obtaining the boundary coordinates. In the second step, shape descriptors are computed by applying discrete Fourier transform on the centroid distance function. In the third and the last step, template based matching and fuzzy indexing of the video is performed. Simulations were carried out for evaluating the performance of both pre-processing and annotation using the database-MPEG7 CE Shape-1 Part B. The results show that the framework could effectively perform annotation.","PeriodicalId":338038,"journal":{"name":"2010 IEEE/ACIS 9th International Conference on Computer and Information Science","volume":"121 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Shape Based Automatic Annotation and Fuzzy Indexing of Video Sequences\",\"authors\":\"Vinay Chander, S. Tapaswi\",\"doi\":\"10.1109/ICIS.2010.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multimedia content available on the web is increasing at a rapid rate. At this rate, indexing video content manually for search and retrieval is getting increasingly difficult due to the high cost and time that it demands. In this paper, we propose a scheme for automatic annotation and indexing of video content with a goal of supporting efficient retrieval. The framework performs the annotation and indexing by operating on the key representative frames of a video sequence. The proposed framework broadly involves three major steps. The first step involves pre-processing, segmenting a video frame into objects and obtaining the boundary coordinates. In the second step, shape descriptors are computed by applying discrete Fourier transform on the centroid distance function. In the third and the last step, template based matching and fuzzy indexing of the video is performed. Simulations were carried out for evaluating the performance of both pre-processing and annotation using the database-MPEG7 CE Shape-1 Part B. The results show that the framework could effectively perform annotation.\",\"PeriodicalId\":338038,\"journal\":{\"name\":\"2010 IEEE/ACIS 9th International Conference on Computer and Information Science\",\"volume\":\"121 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE/ACIS 9th International Conference on Computer and Information Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIS.2010.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE/ACIS 9th International Conference on Computer and Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2010.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
摘要
网络上可用的多媒体内容正在快速增长。在这种速度下,手动索引视频内容进行搜索和检索变得越来越困难,因为它需要高成本和时间。本文提出了一种视频内容的自动标注和索引方案,以支持高效的检索。该框架通过操作视频序列的关键代表帧来执行注释和索引。拟议的框架大致包括三个主要步骤。第一步包括预处理,将视频帧分割成对象并获得边界坐标。第二步,通过对质心距离函数进行离散傅里叶变换计算形状描述子。第三步,也是最后一步,对视频进行基于模板的匹配和模糊索引。利用mpeg7 CE Shape-1 Part b数据库对预处理和标注性能进行了仿真评估,结果表明该框架能够有效地进行标注。
Shape Based Automatic Annotation and Fuzzy Indexing of Video Sequences
Multimedia content available on the web is increasing at a rapid rate. At this rate, indexing video content manually for search and retrieval is getting increasingly difficult due to the high cost and time that it demands. In this paper, we propose a scheme for automatic annotation and indexing of video content with a goal of supporting efficient retrieval. The framework performs the annotation and indexing by operating on the key representative frames of a video sequence. The proposed framework broadly involves three major steps. The first step involves pre-processing, segmenting a video frame into objects and obtaining the boundary coordinates. In the second step, shape descriptors are computed by applying discrete Fourier transform on the centroid distance function. In the third and the last step, template based matching and fuzzy indexing of the video is performed. Simulations were carried out for evaluating the performance of both pre-processing and annotation using the database-MPEG7 CE Shape-1 Part B. The results show that the framework could effectively perform annotation.