2-DOM:面向Web图像标注的二维对象模型

2008 Third International Workshop on Semantic Media Adaptation and Personalization Pub Date : 2008-12-15 DOI:10.1109/SMAP.2008.23

S. Alcic, Stefan Conrad

{"title":"2-DOM:面向Web图像标注的二维对象模型","authors":"S. Alcic, Stefan Conrad","doi":"10.1109/SMAP.2008.23","DOIUrl":null,"url":null,"abstract":"The automatic annotation of images is still a non-reliable process due to the well-known semantic gap dominating between the physical representation of images and their high level semantics. To avoid the confrontation with the semantic gap several approaches restrict the image dataset to web images. Web images mostly appear on websites with other text contents which can deliver important information about the image semantics. Popular image search engines use text contents surrounding the image to generate annotation keywords. Also emphasized text contents like headlines are assumed to be important description providers. Otherwise we discover false positive results in high ranking positions of this search engines which are the effect of incorrect text-to-image mappings. This paper addresses the problem of finding correct matches between text elements and images in HTML-documents by extending the DOM model tree of a web-page to a 2-dimensional object model (2-DOM) tree. This model adapts to the two dimensional manner of web documents and thus allows a better mapping of text articles to images. The evaluation results show that with a precision over 90 percent text articles are well assigned to web images. This leads intuitive to better textual information about the presented events on the images and thus yields to a better image retrieval quality in querying-by-keyword scenarios.","PeriodicalId":292389,"journal":{"name":"2008 Third International Workshop on Semantic Media Adaptation and Personalization","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"2-DOM: A 2-Dimensional Object Model towards Web Image Annotation\",\"authors\":\"S. Alcic, Stefan Conrad\",\"doi\":\"10.1109/SMAP.2008.23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The automatic annotation of images is still a non-reliable process due to the well-known semantic gap dominating between the physical representation of images and their high level semantics. To avoid the confrontation with the semantic gap several approaches restrict the image dataset to web images. Web images mostly appear on websites with other text contents which can deliver important information about the image semantics. Popular image search engines use text contents surrounding the image to generate annotation keywords. Also emphasized text contents like headlines are assumed to be important description providers. Otherwise we discover false positive results in high ranking positions of this search engines which are the effect of incorrect text-to-image mappings. This paper addresses the problem of finding correct matches between text elements and images in HTML-documents by extending the DOM model tree of a web-page to a 2-dimensional object model (2-DOM) tree. This model adapts to the two dimensional manner of web documents and thus allows a better mapping of text articles to images. The evaluation results show that with a precision over 90 percent text articles are well assigned to web images. This leads intuitive to better textual information about the presented events on the images and thus yields to a better image retrieval quality in querying-by-keyword scenarios.\",\"PeriodicalId\":292389,\"journal\":{\"name\":\"2008 Third International Workshop on Semantic Media Adaptation and Personalization\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Third International Workshop on Semantic Media Adaptation and Personalization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SMAP.2008.23\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Third International Workshop on Semantic Media Adaptation and Personalization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMAP.2008.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

由于图像的物理表示和高级语义之间存在着众所周知的语义鸿沟，因此图像的自动标注仍然是一个不可靠的过程。为了避免与语义缺口的冲突，有几种方法将图像数据集限制为web图像。Web图像通常与其他文本内容一起出现在网站上，这些文本内容可以传递有关图像语义的重要信息。流行的图像搜索引擎使用图像周围的文本内容来生成注释关键字。此外，像标题这样的强调文本内容被认为是重要的描述提供者。否则，我们会在这个搜索引擎的高排名位置发现假阳性结果，这是不正确的文本到图像映射的影响。本文通过将网页的DOM模型树扩展到二维对象模型(2-DOM)树，解决了html文档中文本元素和图像之间的正确匹配问题。该模型适用于web文档的二维方式，因此可以更好地将文本文章映射到图像。评估结果表明，准确率超过90%的文本文章可以很好地分配到网络图像。这使得直观地获得了关于图像上所呈现事件的更好的文本信息，从而在按关键字查询的场景中获得了更好的图像检索质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

2-DOM: A 2-Dimensional Object Model towards Web Image Annotation

The automatic annotation of images is still a non-reliable process due to the well-known semantic gap dominating between the physical representation of images and their high level semantics. To avoid the confrontation with the semantic gap several approaches restrict the image dataset to web images. Web images mostly appear on websites with other text contents which can deliver important information about the image semantics. Popular image search engines use text contents surrounding the image to generate annotation keywords. Also emphasized text contents like headlines are assumed to be important description providers. Otherwise we discover false positive results in high ranking positions of this search engines which are the effect of incorrect text-to-image mappings. This paper addresses the problem of finding correct matches between text elements and images in HTML-documents by extending the DOM model tree of a web-page to a 2-dimensional object model (2-DOM) tree. This model adapts to the two dimensional manner of web documents and thus allows a better mapping of text articles to images. The evaluation results show that with a precision over 90 percent text articles are well assigned to web images. This leads intuitive to better textual information about the presented events on the images and thus yields to a better image retrieval quality in querying-by-keyword scenarios.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 Third International Workshop on Semantic Media Adaptation and Personalization

自引率

0.00%

发文量