SEPHWIR: Search Engine Parsing for Hidden Web Information Retrieval

Manpreet Singh Sehgal, Sachin Gupta, Twinkle Sehgal
{"title":"SEPHWIR: Search Engine Parsing for Hidden Web Information Retrieval","authors":"Manpreet Singh Sehgal, Sachin Gupta, Twinkle Sehgal","doi":"10.1109/SSTEPS57475.2022.00038","DOIUrl":null,"url":null,"abstract":"The world wide web consists of web pages and databases from which web pages can be generated on demand. It is presumed that the information stored in the databases is of better quality than the one published onto already created static webpages. The web search engine architectures are tuned to access static webpages and index and rank them in their results against the query. This results into the accessibility to the kind of information that is not of better quality. This paper talks about the approach to parse such results of the search engines and find the entry points to the databases to fetch the better-quality information.","PeriodicalId":289933,"journal":{"name":"2022 International Conference on Smart and Sustainable Technologies in Energy and Power Sectors (SSTEPS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Smart and Sustainable Technologies in Energy and Power Sectors (SSTEPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSTEPS57475.2022.00038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The world wide web consists of web pages and databases from which web pages can be generated on demand. It is presumed that the information stored in the databases is of better quality than the one published onto already created static webpages. The web search engine architectures are tuned to access static webpages and index and rank them in their results against the query. This results into the accessibility to the kind of information that is not of better quality. This paper talks about the approach to parse such results of the search engines and find the entry points to the databases to fetch the better-quality information.
隐藏Web信息检索的搜索引擎解析
万维网由网页和数据库组成,网页可以根据需要生成。假定存储在数据库中的信息比发布在已创建的静态网页上的信息质量更好。web搜索引擎架构被调整为访问静态网页和索引,并根据查询在结果中对它们进行排名。这就导致了对那些质量不高的信息的可访问性。本文讨论了如何对搜索引擎的搜索结果进行解析,并找到进入数据库的入口点,从而获得质量更好的信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信