Chun-Hsiung Tseng, Yung-Hui Chen, Han-Ci Syu, C. Chuang, J. Wu, Yan-Ru Jiang
{"title":"Roll Your Own Web Database: An Innovative Approach for Providing Searchable Web Content","authors":"Chun-Hsiung Tseng, Yung-Hui Chen, Han-Ci Syu, C. Chuang, J. Wu, Yan-Ru Jiang","doi":"10.6159/IJSE.2014.(4-4).02","DOIUrl":null,"url":null,"abstract":"The paper is aimed at addressing two issues: first, despite of the importance of semantic information in HTML pages, it is often ignored by search engines due to various technology difficulties; second, the ambiguity problem sometimes makes results returned by search engines much less useful. OOSM, a schema model as well as a set of information processing tools, is proposed in the paper. OOSM develops the concept of coarse mapping, that is, users are allowed (but not restricted) to associate a grammar node to a sub section instead of a single node on a HTML page. AS a result, minor modifications of the annotated HTML page can be tolerated. We believe that OOSM is a right solution for the issues presented in this paper.","PeriodicalId":14209,"journal":{"name":"International Journal of Science and Engineering","volume":"4 1","pages":"5-9"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6159/IJSE.2014.(4-4).02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The paper is aimed at addressing two issues: first, despite of the importance of semantic information in HTML pages, it is often ignored by search engines due to various technology difficulties; second, the ambiguity problem sometimes makes results returned by search engines much less useful. OOSM, a schema model as well as a set of information processing tools, is proposed in the paper. OOSM develops the concept of coarse mapping, that is, users are allowed (but not restricted) to associate a grammar node to a sub section instead of a single node on a HTML page. AS a result, minor modifications of the annotated HTML page can be tolerated. We believe that OOSM is a right solution for the issues presented in this paper.