滚动你自己的网络数据库:提供可搜索网络内容的创新方法

Chun-Hsiung Tseng, Yung-Hui Chen, Han-Ci Syu, C. Chuang, J. Wu, Yan-Ru Jiang
{"title":"滚动你自己的网络数据库:提供可搜索网络内容的创新方法","authors":"Chun-Hsiung Tseng, Yung-Hui Chen, Han-Ci Syu, C. Chuang, J. Wu, Yan-Ru Jiang","doi":"10.6159/IJSE.2014.(4-4).02","DOIUrl":null,"url":null,"abstract":"The paper is aimed at addressing two issues: first, despite of the importance of semantic information in HTML pages, it is often ignored by search engines due to various technology difficulties; second, the ambiguity problem sometimes makes results returned by search engines much less useful. OOSM, a schema model as well as a set of information processing tools, is proposed in the paper. OOSM develops the concept of coarse mapping, that is, users are allowed (but not restricted) to associate a grammar node to a sub section instead of a single node on a HTML page. AS a result, minor modifications of the annotated HTML page can be tolerated. We believe that OOSM is a right solution for the issues presented in this paper.","PeriodicalId":14209,"journal":{"name":"International Journal of Science and Engineering","volume":"4 1","pages":"5-9"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Roll Your Own Web Database: An Innovative Approach for Providing Searchable Web Content\",\"authors\":\"Chun-Hsiung Tseng, Yung-Hui Chen, Han-Ci Syu, C. Chuang, J. Wu, Yan-Ru Jiang\",\"doi\":\"10.6159/IJSE.2014.(4-4).02\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper is aimed at addressing two issues: first, despite of the importance of semantic information in HTML pages, it is often ignored by search engines due to various technology difficulties; second, the ambiguity problem sometimes makes results returned by search engines much less useful. OOSM, a schema model as well as a set of information processing tools, is proposed in the paper. OOSM develops the concept of coarse mapping, that is, users are allowed (but not restricted) to associate a grammar node to a sub section instead of a single node on a HTML page. AS a result, minor modifications of the annotated HTML page can be tolerated. We believe that OOSM is a right solution for the issues presented in this paper.\",\"PeriodicalId\":14209,\"journal\":{\"name\":\"International Journal of Science and Engineering\",\"volume\":\"4 1\",\"pages\":\"5-9\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.6159/IJSE.2014.(4-4).02\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6159/IJSE.2014.(4-4).02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文旨在解决两个问题:第一,尽管语义信息在HTML页面中很重要,但由于各种技术困难,它经常被搜索引擎所忽略;其次,歧义问题有时会使搜索引擎返回的结果变得不那么有用。本文提出了一种面向对象的模式模型和一套信息处理工具。OOSM开发了粗映射的概念,也就是说,允许(但不限制)用户将语法节点关联到子节,而不是HTML页面上的单个节点。因此,可以容忍对带注释的HTML页面进行微小的修改。我们相信OOSM是本文提出的问题的正确解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Roll Your Own Web Database: An Innovative Approach for Providing Searchable Web Content
The paper is aimed at addressing two issues: first, despite of the importance of semantic information in HTML pages, it is often ignored by search engines due to various technology difficulties; second, the ambiguity problem sometimes makes results returned by search engines much less useful. OOSM, a schema model as well as a set of information processing tools, is proposed in the paper. OOSM develops the concept of coarse mapping, that is, users are allowed (but not restricted) to associate a grammar node to a sub section instead of a single node on a HTML page. AS a result, minor modifications of the annotated HTML page can be tolerated. We believe that OOSM is a right solution for the issues presented in this paper.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信