{"title":"Web信息提取","authors":"Man I. Lam, Zhiguo Gong","doi":"10.1109/ICIA.2005.1635157","DOIUrl":null,"url":null,"abstract":"Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web.","PeriodicalId":136611,"journal":{"name":"2005 IEEE International Conference on Information Acquisition","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Web information extraction\",\"authors\":\"Man I. Lam, Zhiguo Gong\",\"doi\":\"10.1109/ICIA.2005.1635157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web.\",\"PeriodicalId\":136611,\"journal\":{\"name\":\"2005 IEEE International Conference on Information Acquisition\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 IEEE International Conference on Information Acquisition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIA.2005.1635157\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE International Conference on Information Acquisition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIA.2005.1635157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web.