Tracking and locating source content in a weblog using semantic annotation techniques

2013 International Conference on Recent Trends in Information Technology (ICRTIT) Pub Date : 2013-07-25 DOI:10.1109/ICRTIT.2013.6844238

R. Priyadarshini, L. Tamilselvan

{"title":"Tracking and locating source content in a weblog using semantic annotation techniques","authors":"R. Priyadarshini, L. Tamilselvan","doi":"10.1109/ICRTIT.2013.6844238","DOIUrl":null,"url":null,"abstract":"In recent years, the World Wide Web has grown tremendously and become more complex because of the growing number of users and the content being added in varied formats. Web 1.0 suffered from publishing limitations as it requires significant amount of software investments. Web 2.0 has changed this by providing easy to use web tools to enable people to generate content and publish it easily on the web. This resulted in an explosion of web contents and has made relevant information retrieval challenging. This has led to the evolution of semantic web, where the traditional web content is added with semantic repository. There are semantic web based tools being developed and researched to make the information retrieval more efficient. This paper describes a semantic weblog which has the feature of locating the exact source content from the reference URLs. This ideology will be used in the CMS (Content Management System). It's highly possible that the content in CMS will be redundant over the web as most of the time the content will be gathered from already existing websites. Back tracking the source of such content will become obsolete and also changes to the source are difficult to be tracked. The proposed document based CMS varies from these traditional CMS in architecture, storage and control flow. Source URLs and content markings are indexed and mirrored. The Semantically annotated content are located in the stored websites and matched with the original source websites. It also allows backtracking of data to the original source URL(s) which will also be referred for future use.","PeriodicalId":113531,"journal":{"name":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRTIT.2013.6844238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In recent years, the World Wide Web has grown tremendously and become more complex because of the growing number of users and the content being added in varied formats. Web 1.0 suffered from publishing limitations as it requires significant amount of software investments. Web 2.0 has changed this by providing easy to use web tools to enable people to generate content and publish it easily on the web. This resulted in an explosion of web contents and has made relevant information retrieval challenging. This has led to the evolution of semantic web, where the traditional web content is added with semantic repository. There are semantic web based tools being developed and researched to make the information retrieval more efficient. This paper describes a semantic weblog which has the feature of locating the exact source content from the reference URLs. This ideology will be used in the CMS (Content Management System). It's highly possible that the content in CMS will be redundant over the web as most of the time the content will be gathered from already existing websites. Back tracking the source of such content will become obsolete and also changes to the source are difficult to be tracked. The proposed document based CMS varies from these traditional CMS in architecture, storage and control flow. Source URLs and content markings are indexed and mirrored. The Semantically annotated content are located in the stored websites and matched with the original source websites. It also allows backtracking of data to the original source URL(s) which will also be referred for future use.

查看原文本刊更多论文

使用语义注释技术跟踪和定位博客中的源内容

近年来，由于用户数量的增加和内容以各种格式添加，万维网发展迅速，变得更加复杂。Web 1.0受到发布限制的困扰，因为它需要大量的软件投资。Web 2.0改变了这一点，它提供了易于使用的Web工具，使人们能够轻松地在Web上生成和发布内容。这导致了网络内容的爆炸式增长，并使相关信息检索具有挑战性。这导致了语义网的发展，传统的网络内容被添加到语义库中。人们正在开发和研究基于语义网的工具，以提高信息检索的效率。本文描述了一种语义博客，它具有从参考url中定位准确源内容的特点。这一思想将被用于CMS(内容管理系统)。CMS中的内容很可能在网络上是冗余的，因为大多数时候内容将从已经存在的网站收集。追溯这些内容的来源将会过时，而且来源的变化也很难被追踪。本文提出的基于文档的CMS在架构、存储和控制流程等方面与传统的CMS有所不同。源url和内容标记被索引和镜像。语义注释的内容位于存储的网站中，并与原始源网站相匹配。它还允许将数据回溯到原始源URL，该URL也将被引用以供将来使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 International Conference on Recent Trends in Information Technology (ICRTIT)

自引率

0.00%

发文量