Web分析在网页过滤中的应用

Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004. Pub Date : 2004-06-07 DOI:10.1145/996350.996442

M. Chau

{"title":"Web分析在网页过滤中的应用","authors":"M. Chau","doi":"10.1145/996350.996442","DOIUrl":null,"url":null,"abstract":"Vertical search engines provide Web users with an alternative way to search for information on the Web by providing customized searching in particular domains. However, two issues need to be addressed when developing these search engines: how to locate relevant documents on the Web and how to filter out irrelevant documents from a set of documents collected from the Web. This paper reports the research in addressing the second issue. In this research a machine learning-based approach that combines Web content analysis and Web structure analysis is proposed.","PeriodicalId":362133,"journal":{"name":"Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004.","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Applying Web analysis in Web page filtering\",\"authors\":\"M. Chau\",\"doi\":\"10.1145/996350.996442\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vertical search engines provide Web users with an alternative way to search for information on the Web by providing customized searching in particular domains. However, two issues need to be addressed when developing these search engines: how to locate relevant documents on the Web and how to filter out irrelevant documents from a set of documents collected from the Web. This paper reports the research in addressing the second issue. In this research a machine learning-based approach that combines Web content analysis and Web structure analysis is proposed.\",\"PeriodicalId\":362133,\"journal\":{\"name\":\"Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004.\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/996350.996442\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/996350.996442","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

垂直搜索引擎通过在特定领域提供自定义搜索，为Web用户提供了在Web上搜索信息的另一种方法。但是，在开发这些搜索引擎时需要解决两个问题:如何在Web上定位相关文档，以及如何从Web收集的一组文档中过滤掉不相关的文档。本文报道了针对第二个问题的研究。本文提出了一种基于机器学习的Web内容分析与Web结构分析相结合的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Applying Web analysis in Web page filtering

Vertical search engines provide Web users with an alternative way to search for information on the Web by providing customized searching in particular domains. However, two issues need to be addressed when developing these search engines: how to locate relevant documents on the Web and how to filter out irrelevant documents from a set of documents collected from the Web. This paper reports the research in addressing the second issue. In this research a machine learning-based approach that combines Web content analysis and Web structure analysis is proposed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004.

自引率

0.00%

发文量