C. Böhm, Bernhard Braunmüller, H. Kriegel, Matthias Schubert
{"title":"数字图书馆的高效相似度搜索","authors":"C. Böhm, Bernhard Braunmüller, H. Kriegel, Matthias Schubert","doi":"10.1109/ADL.2000.848382","DOIUrl":null,"url":null,"abstract":"Digital libraries are a core information technology. When the stored data is complex, e.g. high-resolution images or molecular protein structures, simple query types such as exact match query are hardly applicable. In such environments similarity queries, particularly range queries and k-nearest neighbor queries, are important query types. Numerous approaches have been proposed for the processing of similarity queries which mainly concentrate on highly dynamic data sets where insertion, update, and deletion operations occur. However, only little effort has been devoted to the case of rather static data sets-frequently, occurring in digital libraries. In this paper we introduce a novel technique for efficient similarity search on top of static or rarely changing data sets. In particularly we propose a special sorting order on the data objects which can be effectively exploited to substantially reduce the total query time of similarity queries. An extensive experimental evaluation with real-world data sets emphasizes the practical impact of our technique.","PeriodicalId":426762,"journal":{"name":"Proceedings IEEE Advances in Digital Libraries 2000","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Efficient similarity search in digital libraries\",\"authors\":\"C. Böhm, Bernhard Braunmüller, H. Kriegel, Matthias Schubert\",\"doi\":\"10.1109/ADL.2000.848382\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digital libraries are a core information technology. When the stored data is complex, e.g. high-resolution images or molecular protein structures, simple query types such as exact match query are hardly applicable. In such environments similarity queries, particularly range queries and k-nearest neighbor queries, are important query types. Numerous approaches have been proposed for the processing of similarity queries which mainly concentrate on highly dynamic data sets where insertion, update, and deletion operations occur. However, only little effort has been devoted to the case of rather static data sets-frequently, occurring in digital libraries. In this paper we introduce a novel technique for efficient similarity search on top of static or rarely changing data sets. In particularly we propose a special sorting order on the data objects which can be effectively exploited to substantially reduce the total query time of similarity queries. An extensive experimental evaluation with real-world data sets emphasizes the practical impact of our technique.\",\"PeriodicalId\":426762,\"journal\":{\"name\":\"Proceedings IEEE Advances in Digital Libraries 2000\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings IEEE Advances in Digital Libraries 2000\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ADL.2000.848382\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE Advances in Digital Libraries 2000","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ADL.2000.848382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Digital libraries are a core information technology. When the stored data is complex, e.g. high-resolution images or molecular protein structures, simple query types such as exact match query are hardly applicable. In such environments similarity queries, particularly range queries and k-nearest neighbor queries, are important query types. Numerous approaches have been proposed for the processing of similarity queries which mainly concentrate on highly dynamic data sets where insertion, update, and deletion operations occur. However, only little effort has been devoted to the case of rather static data sets-frequently, occurring in digital libraries. In this paper we introduce a novel technique for efficient similarity search on top of static or rarely changing data sets. In particularly we propose a special sorting order on the data objects which can be effectively exploited to substantially reduce the total query time of similarity queries. An extensive experimental evaluation with real-world data sets emphasizes the practical impact of our technique.