Web信息提取

2005 IEEE International Conference on Information Acquisition Pub Date : 1900-01-01 DOI:10.1109/ICIA.2005.1635157

Man I. Lam, Zhiguo Gong

{"title":"Web信息提取","authors":"Man I. Lam, Zhiguo Gong","doi":"10.1109/ICIA.2005.1635157","DOIUrl":null,"url":null,"abstract":"Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web.","PeriodicalId":136611,"journal":{"name":"2005 IEEE International Conference on Information Acquisition","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Web information extraction\",\"authors\":\"Man I. Lam, Zhiguo Gong\",\"doi\":\"10.1109/ICIA.2005.1635157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web.\",\"PeriodicalId\":136611,\"journal\":{\"name\":\"2005 IEEE International Conference on Information Acquisition\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 IEEE International Conference on Information Acquisition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIA.2005.1635157\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE International Conference on Information Acquisition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIA.2005.1635157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

摘要

随着Internet技术的不断发展，网页可以提供海量的信息资源。它改变了传统的保存和搜索信息的方式。对网页的查询目标变得越来越大，越来越重要。如今，搜索引擎是在网络上搜索信息的一种非常流行的方法。但是，它只显示一个文档列表，而不是针对用户的特定问题的特定答案或知识。因此，从Web中提取数据成为一个热门话题。本文研究了当前Web数据提取的发展现状、存在的困难和实现的目标。此外，我们对一些实例进行了说明和分析，并给出了我们的Web信息抽取解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Web information extraction

Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2005 IEEE International Conference on Information Acquisition

自引率

0.00%

发文量