L. J. Milask, Traci Guynup, Christopher Hammel, L. Kerschberg, George Michaels
{"title":"A forest canopy research database and analysis system","authors":"L. J. Milask, Traci Guynup, Christopher Hammel, L. Kerschberg, George Michaels","doi":"10.1109/SSDM.1997.621165","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621165","url":null,"abstract":"Described is a prototype database and analysis system developed to support the specific domain of forest canopy research. This effort utilized a multidiscipline team comprised of information (database systems), statistical analysis and forest canopy scientists. Both large scale (Oracle) and smaller scale (Visual FoxPro) databases were prototyped. A Web based query interface to the Oracle system was also demonstrated. The paper addresses the FoxPro database and S-Plus statistical analysis interface developed to address the data analysis, data integration, and data distribution requirements of the originating forest canopy research team. The prototype system employs Visual FoxPro (VFP) as the database engine. Visualization and analytical functions are demonstrated by the use of VFP forms and custom designed S-Plus procedures. Finally, a Web database server facility is also demonstrated. The authors conclude that value added support centers could be created to develop and disseminate small scale database and analysis systems to a specific scientific community.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115961694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Y. Ioannidis, M. Livny, A. Ailamaki, Arvind Ranganathan, Andrew Therber, Maria Yuin, Martha Anderson, John Norman
{"title":"Managing soil science experiments using Zoo (http://www.cs.wisc.edu//spl sim/ZOO)","authors":"Y. Ioannidis, M. Livny, A. Ailamaki, Arvind Ranganathan, Andrew Therber, Maria Yuin, Martha Anderson, John Norman","doi":"10.1109/SSDM.1997.621171","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621171","url":null,"abstract":"We have studied the needs of a wide range of experimental disciplines, developed solutions to some of the basic problems in experiment management and made significant progress towards implementing a simple desktop experiment management environment (DEME) called Zoo. Our work has proceeded in a tight loop between developing generic experiment management technology that is implemented in a generic tool, installing customized enhancements of the tool that constitute full systems [complete customized desktop experiment management systems (CDEMSs)] in laboratories of interest, and using the provided feedback to guide our research directions. In this paper, we first outline the overall architecture of Zoo and then discuss a particular experiment.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130976838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Summarizability in OLAP and statistical data bases","authors":"H. Lenz, A. Shoshani","doi":"10.1109/SSDM.1997.621175","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621175","url":null,"abstract":"The summarizability of OLAP (online analytical processing) and statistical databases is an a extremely important property, because violating this condition can lead to erroneous conclusions and decisions. In this paper, we explore the conditions for summarizability. We introduce a framework for precisely specifying the context in which statistical objects are defined. We use a three-step process to define normalized statistical objects. Using this framework, we identify three necessary conditions for summarizability. We provide specific tests for each of the conditions that can be verified either from semantic knowledge or by checking the statistical database itself. We also provide the reasoning for our belief that these three summarizability conditions are sufficient as well.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114174364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Kafatos, X. Wang, H. Weir, Zuotao Li, P. Hertz, H. Wolf, Ruixin Yang, Duane King, D. Ziskin
{"title":"The Virtual Domain Application Data Center: serving interdisciplinary Earth scientists","authors":"M. Kafatos, X. Wang, H. Weir, Zuotao Li, P. Hertz, H. Wolf, Ruixin Yang, Duane King, D. Ziskin","doi":"10.1109/SSDM.1997.621195","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621195","url":null,"abstract":"The authors address the data access and analysis issues faced by interdisciplinary Earth scientists and graduate students as a prototypical domain community which will be accessing large data sets in Earth system science in the following decades. They present a working prototype developed at George Mason University to serve wide user needs termed Virtual Domain Application Data Center (VDADC). The VDADC prototype provides tools, data products and services tailored to users and can be extended to other domain communities. The VDADC operates in a distributed environment, the World Wide Web, and in close association with federated data centers. Moreover, the information technology implementation is driven by science scenarios and can apply to a variety of domain users, thus reducing network traffic the data centers by implementing intelligent data searching or \"content-based browsing\" prior to data ordering thus more effectively addressing user needs.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123815248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Some thoughts about a metadata management system","authors":"J. Kent, M. Schuerhoff","doi":"10.1109/SSDM.1997.621184","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621184","url":null,"abstract":"The authors have tried to present some very general ideas about metadata. The aim is not yet to provide solutions, but to define the field for further research. It has become clear that the concept of metadata is very diverse, and that a general approach is needed in order to provide generally applicable solutions. They have learned some basic principles about metadata that will help them in further research.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126828376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"geoPOM: a heterogeneous geoscientific persistent object system","authors":"Silvia Nittel, R. Muntz, E. Mesrobian","doi":"10.1109/SSDM.1997.621194","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621194","url":null,"abstract":"Lately, a need for uniform access to and integration of data stored in specialized, non-standard repositories such as GIS or multimedia storage servers has become apparent. The authors provide an overview of a heterogeneous geoscientific persistent object manager (geoPOM) developed at the UCLA Data Mining Laboratory. GeoPOM provides users with the \"illusion\" of a single object-oriented spatial data store even though the data is actually stored in several different spatial data repositories, thus, allowing users to define and handle spatial data in a uniform manner The geoPOM data model is based on the ODMG-93 standard for object-oriented data models, and the Open Geodata Consortium's (OGC) standardization effort for temporal-spatial object types (OGIS).","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128139065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using the STEP standard and databases in science","authors":"Udo Nink","doi":"10.1109/SSDM.1997.621189","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621189","url":null,"abstract":"Database management systems (DBMS) have to provide certain facilities meeting the requirements of scientific and statistical database management. Reviewing problems and promises for current database technology, the STEP standard is assessed for selected aspects of data management, data access, data exchange, and data modeling. STEP-based solutions are proposed for concrete examples of SS-DBM especially in the context of the scientific data exchange standard FITS. The author introduces and discusses EXPRESS, the modeling language of STEP, and SDAI, the corresponding data access interface. The performance of navigational access provided by SDAI is considered a crucial aspect. Exploiting the code generation mechanism used to instantiate SDAI for a given programming language-he calls it a generated call interface-an adequate software architecture on top of an ODBMS as well as STEP-specific optimizations are proposed.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130667698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Brankovic, P. Horák, Mirka Miller, G. Wrightson
{"title":"Usability of compromise-free statistical databases","authors":"L. Brankovic, P. Horák, Mirka Miller, G. Wrightson","doi":"10.1109/SSDM.1997.621177","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621177","url":null,"abstract":"The usability of a statistical database is defined to be the ratio of the cardinality of the largest set of queries which can be answered without compromise to the total number of queries. In this paper, we present new results concerning the usability of secure statistical databases for general SUM, COUNT and MEAN queries, as well as for the corresponding range queries. We give the usability of these k-dimensional databases for all k/spl ges/1. The paper concludes with a discussion of the implications of our results.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131348231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data mining and knowledge discovery in databases: implications for scientific databases","authors":"U. Fayyad","doi":"10.1109/SSDM.1997.621141","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621141","url":null,"abstract":"Data mining and knowledge discovery in databases (KDD) promise to play an important role in the way people interact with databases, especially scientific databases where analysis and exploration operations are essential. The author defines the basic notions in data mining and KDD, defines the goals, presents motivation, and gives a high-level definition of the KDD process and how it relates to data mining. The author then focuses on data mining methods. Basic coverage of a sampling of methods is provided to illustrate the methods and how they are used. The author covers a case study of a successful application in science data analysis: the classification of cataloging of a major astronomy sky survey covering 2 billion objects in the northern sky. The system can outperform human as well as classical computational analysis tools in astronomy on the task of recognizing faint stars and galaxies. The author also covers the problem of scaling a clustering problem to a large catalog database of billions of objects.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123748016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Remote access tool for Earth science data","authors":"E. Dobinson, R. Raskin","doi":"10.1109/SSDM.1997.621170","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621170","url":null,"abstract":"Presents an HTTP-based client/server application prototype that facilitates Internet access to Earth science data. The client consists of a Java applet GUI that allows the user to select spatial/temporal subsets of indexed datasets. The client also includes a MATLAB interface that allows the incoming data to be loaded directly into a MATLAB session. The server provides directory, catalog and data access services and performs the subsetting operations prior to data transmission. An example is presented where data from multiple sources and in multiple formats are combined into a single MATLAB plot. The prototype addresses the lack of common data models in the Earth sciences. It also addresses the need for access to corroborating data by Earth Observation Satellite (EOS) instrument team members for calibration and validation.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131064608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}