{"title":"An efficient Internet crawling and filtering system for the nationwide tendering information retrieval","authors":"Toshio Matsuda, Kazushige Nakamura, Norihiko Sakamoto","doi":"10.1109/WI.2003.1241304","DOIUrl":null,"url":null,"abstract":"With the growth of Internet, the central government and local governments have begun to publish matters concerning the prospect of orders for public works, the announcement of tendering and the contracting information on their Web sites. However, it is time consuming and painful for bidders such as constructors and manufacturers to periodically search the above information that matches their needs. Recently, there are various search engines, e.g. Google and Yahoo!, but those general search engines are not effective for the purpose of retrieving the above information quickly enough because of their crawling interval and coverage. Then we developed a system to automate the process of gathering such information, filtering for users' needs and delivering as the tendering and contracting information database. We describe the concept of the system as well as the key techniques to realize it: (1) to efficiently retrieve only relevant Web pages, and (2) filtering to match users' needs.","PeriodicalId":403574,"journal":{"name":"Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI.2003.1241304","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the growth of Internet, the central government and local governments have begun to publish matters concerning the prospect of orders for public works, the announcement of tendering and the contracting information on their Web sites. However, it is time consuming and painful for bidders such as constructors and manufacturers to periodically search the above information that matches their needs. Recently, there are various search engines, e.g. Google and Yahoo!, but those general search engines are not effective for the purpose of retrieving the above information quickly enough because of their crawling interval and coverage. Then we developed a system to automate the process of gathering such information, filtering for users' needs and delivering as the tendering and contracting information database. We describe the concept of the system as well as the key techniques to realize it: (1) to efficiently retrieve only relevant Web pages, and (2) filtering to match users' needs.