{"title":"具有BSP的并行搜索引擎","authors":"V. Gil-Costa, Mauricio Marín","doi":"10.1109/LAWEB.2005.7","DOIUrl":null,"url":null,"abstract":"With the technology advance and the growth of Internet, the information that can be found in this net, as well as the number of users that access to look for specific data is bigger. Therefore it is desirable to have a search system that allows to recover information at a reasonable time and in an efficient way. This article describes two strategies based on buckets to improve the efficiency of a classic data structure, the inverted lists, used to index the queries to textual databases. The design of the server that processes the queries, is effected on top of the bulk synchronous model of parallel computing. The main interest is how query performance is performed by the index organization on a cluster of PCs.","PeriodicalId":286939,"journal":{"name":"Third Latin American Web Congress (LA-WEB'2005)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A parallel search engine with BSP\",\"authors\":\"V. Gil-Costa, Mauricio Marín\",\"doi\":\"10.1109/LAWEB.2005.7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the technology advance and the growth of Internet, the information that can be found in this net, as well as the number of users that access to look for specific data is bigger. Therefore it is desirable to have a search system that allows to recover information at a reasonable time and in an efficient way. This article describes two strategies based on buckets to improve the efficiency of a classic data structure, the inverted lists, used to index the queries to textual databases. The design of the server that processes the queries, is effected on top of the bulk synchronous model of parallel computing. The main interest is how query performance is performed by the index organization on a cluster of PCs.\",\"PeriodicalId\":286939,\"journal\":{\"name\":\"Third Latin American Web Congress (LA-WEB'2005)\",\"volume\":\"57 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Third Latin American Web Congress (LA-WEB'2005)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/LAWEB.2005.7\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Third Latin American Web Congress (LA-WEB'2005)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LAWEB.2005.7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
With the technology advance and the growth of Internet, the information that can be found in this net, as well as the number of users that access to look for specific data is bigger. Therefore it is desirable to have a search system that allows to recover information at a reasonable time and in an efficient way. This article describes two strategies based on buckets to improve the efficiency of a classic data structure, the inverted lists, used to index the queries to textual databases. The design of the server that processes the queries, is effected on top of the bulk synchronous model of parallel computing. The main interest is how query performance is performed by the index organization on a cluster of PCs.