{"title":"SEPHWIR: Search Engine Parsing for Hidden Web Information Retrieval","authors":"Manpreet Singh Sehgal, Sachin Gupta, Twinkle Sehgal","doi":"10.1109/SSTEPS57475.2022.00038","DOIUrl":null,"url":null,"abstract":"The world wide web consists of web pages and databases from which web pages can be generated on demand. It is presumed that the information stored in the databases is of better quality than the one published onto already created static webpages. The web search engine architectures are tuned to access static webpages and index and rank them in their results against the query. This results into the accessibility to the kind of information that is not of better quality. This paper talks about the approach to parse such results of the search engines and find the entry points to the databases to fetch the better-quality information.","PeriodicalId":289933,"journal":{"name":"2022 International Conference on Smart and Sustainable Technologies in Energy and Power Sectors (SSTEPS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Smart and Sustainable Technologies in Energy and Power Sectors (SSTEPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSTEPS57475.2022.00038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The world wide web consists of web pages and databases from which web pages can be generated on demand. It is presumed that the information stored in the databases is of better quality than the one published onto already created static webpages. The web search engine architectures are tuned to access static webpages and index and rank them in their results against the query. This results into the accessibility to the kind of information that is not of better quality. This paper talks about the approach to parse such results of the search engines and find the entry points to the databases to fetch the better-quality information.