{"title":"Web文本挖掘中提取算法的研究与实现","authors":"Shiqun Yin, Yuhui Qiu, Jike Ge, Xiaohong Lan","doi":"10.1109/IITA.2007.31","DOIUrl":null,"url":null,"abstract":"Now people use the search engine - Google, Yahoo, Baidu etc. to lookup Web information mainly, but these search engines involve so wide range, with whose intelligence degree is pool. They are very difficult to mine data further. So, Web text mining aims to resolve this problem. This paper discusses an algorithm of how to follow the appointed Web site or Web page according to the user's request by using the technique of ASP (Active Serve Page), and research and realization how further obtains data of particular range in Internet by text extraction on Web mining. The method has more accuracy, better applicability and less manual interference. The information obtained are the majority of the documents in the field, it can be used to create knowledge database, and it can also be treated as a small scaled vertical search system.","PeriodicalId":191218,"journal":{"name":"Workshop on Intelligent Information Technology Application (IITA 2007)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Research and Realization of Extraction Algorithm on Web Text Mining\",\"authors\":\"Shiqun Yin, Yuhui Qiu, Jike Ge, Xiaohong Lan\",\"doi\":\"10.1109/IITA.2007.31\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Now people use the search engine - Google, Yahoo, Baidu etc. to lookup Web information mainly, but these search engines involve so wide range, with whose intelligence degree is pool. They are very difficult to mine data further. So, Web text mining aims to resolve this problem. This paper discusses an algorithm of how to follow the appointed Web site or Web page according to the user's request by using the technique of ASP (Active Serve Page), and research and realization how further obtains data of particular range in Internet by text extraction on Web mining. The method has more accuracy, better applicability and less manual interference. The information obtained are the majority of the documents in the field, it can be used to create knowledge database, and it can also be treated as a small scaled vertical search system.\",\"PeriodicalId\":191218,\"journal\":{\"name\":\"Workshop on Intelligent Information Technology Application (IITA 2007)\",\"volume\":\"143 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Workshop on Intelligent Information Technology Application (IITA 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IITA.2007.31\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Intelligent Information Technology Application (IITA 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IITA.2007.31","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research and Realization of Extraction Algorithm on Web Text Mining
Now people use the search engine - Google, Yahoo, Baidu etc. to lookup Web information mainly, but these search engines involve so wide range, with whose intelligence degree is pool. They are very difficult to mine data further. So, Web text mining aims to resolve this problem. This paper discusses an algorithm of how to follow the appointed Web site or Web page according to the user's request by using the technique of ASP (Active Serve Page), and research and realization how further obtains data of particular range in Internet by text extraction on Web mining. The method has more accuracy, better applicability and less manual interference. The information obtained are the majority of the documents in the field, it can be used to create knowledge database, and it can also be treated as a small scaled vertical search system.