Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.最新文献

筛选
英文 中文
Knowledge Sifter: ontology-driven search over heterogeneous databases 知识筛选器:异构数据库的本体驱动搜索
L. Kerschberg, Mizan Chowdhury, A. Damiano, Hanjo Jeong, Scott Mitchell, Jingwei Si, Stephen Smith
{"title":"Knowledge Sifter: ontology-driven search over heterogeneous databases","authors":"L. Kerschberg, Mizan Chowdhury, A. Damiano, Hanjo Jeong, Scott Mitchell, Jingwei Si, Stephen Smith","doi":"10.1109/SSDBM.2004.46","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.46","url":null,"abstract":"Knowledge Sifter is a scaleable agent-based system that supports access to heterogeneous information sources such as the Web, open-source repositories, XML-databases and the emerging Semantic Web. User query specification is supported by a user agent that accesses multiple ontologies using an integrated conceptual model. A collection of cooperating agents supports interactive query specification, refinement, decomposition, and processing, as well as result ranking and presentation. The Knowledge Sifter architecture is general and modular so that ontologies and information sources can be easily incorporated. A proof-of-concept implementation depicts Knowledge Sifter using a domain ontology together with geospatial and semantic name services to enhance query formulation and to search image databases such as Lycos and TerraServer.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"244 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129995521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
East of Neuchatel: a universal model for the representation of statistical taxonomy systems 纳沙泰尔以东:统计分类系统表示的通用模型
M. Denk, K. Froeschl
{"title":"East of Neuchatel: a universal model for the representation of statistical taxonomy systems","authors":"M. Denk, K. Froeschl","doi":"10.1109/SSDBM.2004.31","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.31","url":null,"abstract":"The Neuchatel data model for statistical classification systems can be regarded as a standard for classification management in official statistics. However, despite its unquestioned benefits, the model remains rather informal. For computational use, the semantics of model structures have to be specified more accurately. Focusing on formal underpinnings of the model required to support automated processing of classification structures, this paper proposes an alternative approach. The Neuchatel concepts are simplified and generalized, leading to a universal model of hierarchical-taxonomic classification structures featuring precise semantics and supporting typical algebraic transformations related to classification systems.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133059111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Accessing and visualizing scientific spatiotemporal data 获取和可视化科学时空数据
D. Katz, Attila Bergou, G. Berriman, Gary L. Block, J. Collier, D. Curkendall, J. Good, L. Husman, J. Jacob, A. Laity, Peggy Li, C. Miller, T. Prince, H. Siegel, Roy Williams
{"title":"Accessing and visualizing scientific spatiotemporal data","authors":"D. Katz, Attila Bergou, G. Berriman, Gary L. Block, J. Collier, D. Curkendall, J. Good, L. Husman, J. Jacob, A. Laity, Peggy Li, C. Miller, T. Prince, H. Siegel, Roy Williams","doi":"10.1109/SSDBM.2004.11","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.11","url":null,"abstract":"This paper discusses work done by JPL's Parallel Applications Technologies Group in helping scientists access and visualize very large data sets through the use of multiple computing resources, such as parallel supercomputers, clusters, and grids. These tools do one or more of the following tasks: visualize local data sets for local users, visualize local data sets for remote users, and access and visualize remote data sets. The tools are used for various types of data, including remotely sensed image data, digital elevation models, astronomical surveys, etc. The paper attempts to pull some common elements out of these tools that may be useful for others who have to work with similarly large data sets.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130655434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An efficient method to find area clusters with constraints using grid index structure 一种利用网格索引结构查找约束区域簇的有效方法
Kwang-Su Yang, Ruixin Yang, Jiang Tang, M. Kafatos
{"title":"An efficient method to find area clusters with constraints using grid index structure","authors":"Kwang-Su Yang, Ruixin Yang, Jiang Tang, M. Kafatos","doi":"10.1109/SSDBM.2004.14","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.14","url":null,"abstract":"This work presents an efficient method to find area clusters satisfying certain constraints. The method uses a grid index structure to find them by examining five cell, 8-connected neighborhood. The hierarchical clustering technique is applied for preprocessing the grid with similar value distribution into the same clusters. This preprocessing helps reduce the computational time and eliminate the possible outlier grid cells. The constraints are average value range, minimum area size and minimum missing data ratio. The method is implemented for a remote sensing data, MOD08/spl I.bar/M3, which is MODIS (MODerate resolution Imaging Spectroradiometer) level 3 monthly atmospheric product.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134452997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MM-Cubing: computing Iceberg cubes by factorizing the lattice space MM-Cubing:通过分解晶格空间来计算冰山立方体
Zheng Shao, Jiawei Han, Dong Xin
{"title":"MM-Cubing: computing Iceberg cubes by factorizing the lattice space","authors":"Zheng Shao, Jiawei Han, Dong Xin","doi":"10.1109/SSDBM.2004.53","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.53","url":null,"abstract":"The data cube and iceberg cube computation problem has been studied by many researchers. There are three major approaches developed in this direction: (1) top-down computation, represented by MultiWay array aggregation (Zhao et. al., 1997) which utilizes shared computation and performs well on dense data sets; (2) bottom-up computation, represented by BUC (Beyer and Ramakrishnan, 1999), which takes advantage of Apriori Pruning and performs well on sparse data sets; and (3) integrated top-down and bottom-up computation, represented by Star-Cubing (Xin, et. al., 2003), which takes advantages of both and has high performance in most cases. However; the performance of Star-Cubing degrades in very sparse data sets due to the additional cost introduced by the tree structure. None of the three approaches achieves uniformly high performance on all kinds of data sets. In this paper; we present a new approach that compute Iceberg Cubes by factorizing the lattice space according to the frequency of values. This approach, different from all the previous dimension-based approaches where the importance of data distribution is not recognized, partitions the cube lattice into one dense subspace and several sparse subspaces. With this approach, a new method called MM-Cubing has been developed. MM-Cubing is highly adaptive to dense, sparse or skewed data sets. Our performance study shows that MM-Cubing is efficient and achieves high performance over all kinds of data distributions.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133919031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
A comparative study of spatial indexing techniques for multidimensional scientific datasets 多维科学数据集空间索引技术的比较研究
Beomseok Nam, A. Sussman
{"title":"A comparative study of spatial indexing techniques for multidimensional scientific datasets","authors":"Beomseok Nam, A. Sussman","doi":"10.1109/SSDBM.2004.1","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.1","url":null,"abstract":"Scientific applications that query into very large multidimensional datasets are becoming more common. These datasets are growing in size every day, and are becoming truly enormous, making it infeasible to index individual data elements. We have instead been experimenting with chunking the datasets to index them, grouping data elements into small chunks of a fixed, but dataset-specific, size to take advantage of spatial locality. While spatial indexing structures based on R-trees perform reasonably well for the rectangular bounding boxes of such chunked datasets, other indexing structures based on KDB-trees, such as Hybrid trees, have been shown to perform very well for point data. In this paper, we investigate how all these indexing structures perform for multidimensional scientific datasets, and compare their features and performance with that of SH-trees, an extension of Hybrid trees, for indexing multidimensional rectangles. Our experimental results show that the algorithms for building and searching SH-trees outperform those for R-trees, R*-trees, and X-trees for both real application and synthetic datasets and queries. We show that the SH-tree algorithms perform well for both low and high dimensional data, and that they scale well to high dimensions both for building and searching the trees.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"141 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131397677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
COBBLER: combining column and row enumeration for closed pattern discovery COBBLER:结合列和行枚举,用于封闭模式发现
Feng Pan, A. Tung, G. Cong, Xin Xu
{"title":"COBBLER: combining column and row enumeration for closed pattern discovery","authors":"Feng Pan, A. Tung, G. Cong, Xin Xu","doi":"10.1109/SSDBM.2004.21","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.21","url":null,"abstract":"The problem of mining frequent closed patterns has received considerable attention recently as it promises to have much less redundancy compared to discovering all frequent patterns. Existing algorithms can presently be separated into two groups, feature (column) enumeration and row enumeration. Feature enumeration algorithms like CHARM and CLOSET+ are efficient for datasets with small number of features and large number of rows since the number of feature combinations to be enumerated is small. Row enumeration algorithms like CARPENTER on the other hand are more suitable for datasets (eg. bioinformatics data) with large number of features and small number of rows. Both groups of algorithms, however, will encounter problem for datasets that have large number of rows and features. In this paper, we describe a new algorithm called COBBLER which can efficiently mine such datasets . COBBLER is designed to dynamically switch between feature enumeration and row enumeration depending on the data characteristic in the process of mining. As such, each portion of the dataset can be processed using the most suitable method, making the mining more efficient. Several experiments on real-life and synthetic datasets show that COBBLER is an order of magnitude better than previous closed pattern mining algorithms like CHARM, CLOSET+ and CARPENTER.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114217776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 64
Retrieval of isomorphic substructures in crystallographic databases 晶体学数据库中同构子结构的检索
H. Klein
{"title":"Retrieval of isomorphic substructures in crystallographic databases","authors":"H. Klein","doi":"10.1109/SSDBM.2004.59","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.59","url":null,"abstract":"Local bindings of atoms are often modeled by coordination polyhedra with vertices representing ligands, i.e. atoms with strong bonds to a central atom. Neighbouring polyhedra may be linked by vertices, edges, or faces depending on whether their central atoms share one, two, or more atoms as ligands. Substructures formed by linked polyhedra are of considerable interest for studying crystal structures. We introduce a finite graph representation for infinite polyhedral networks and show how to build an index for a given set of model structures such that the retrieval of isomorphic substructures is supported. A system has been implemented providing this functionality on an interactive graphical Web interface.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123583138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Grid-based metadata services 网格元数据服务
E. Deelman, Gurmeet Singh, M. Atkinson, A. Chervenak, Neil Philippe Chue Hong, C. Kesselman, Sonal Patil, L. Pearlman, Mei-Hui Su
{"title":"Grid-based metadata services","authors":"E. Deelman, Gurmeet Singh, M. Atkinson, A. Chervenak, Neil Philippe Chue Hong, C. Kesselman, Sonal Patil, L. Pearlman, Mei-Hui Su","doi":"10.1109/SSDBM.2004.39","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.39","url":null,"abstract":"Data sets being managed in grid environments today are growing at a rapid rate, expected to reach 100s of petabytes in the near future. Managing such large data sets poses challenges for efficient data access, data publication and data discovery. In this paper we focus on the data publication and discovery process through the use of descriptive metadata. This metadata describe the properties of individual data items and collections. We discuss issues of metadata services in service rich environments, such as the grid. We describe the requirements and the architecture for such services in the context of grid and the available grid services. We present a data model that can capture the complexity of the data publication and discovery process. Based on that model we identify a set of interfaces and operations that need to be provided to support metadata management. We present a particular implementation of a grid metadata service, basing it on existing grid services technologies. Finally we examine alternative implementations of that service.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128888612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
On the integration of autonomous data marts 关于自主数据集市的集成
L. Cabibbo, Riccardo Torlone
{"title":"On the integration of autonomous data marts","authors":"L. Cabibbo, Riccardo Torlone","doi":"10.1109/SSDBM.2004.57","DOIUrl":"https://doi.org/10.1109/SSDBM.2004.57","url":null,"abstract":"We address the problem of integrating a federation of dimensional data marts. This problem arises when, e.g., a large organization (or a federation thereof) needs to combine independently developed data warehouses. We show that this problem can be tackled in a systematic way because of two main reasons. First, data marts are structured in a rather uniform way, along dimensions and facts. Second, data quality in data marts is usually higher than in generic databases, since they are obtained by reconciling several data sources. Our scenario of reference is a federation (i.e., a logical integration) of various data marts, which we need to query in a unified way, that is, by means of drill-across operations. We propose a novel notion of dimension compatibility and characterize its general property. We then show the significance of dimension compatibility in performing drill-across queries over autonomous data marts. We also discuss general strategies for the integration of data marts.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130606464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信