Cross Engine Database Joining

2010 Eighth ACIS International Conference on Software Engineering Research, Management and Applications Pub Date : 2010-05-24 DOI:10.1109/SERA.2010.13

Wesley Leonard, Paul B. Albee

{"title":"Cross Engine Database Joining","authors":"Wesley Leonard, Paul B. Albee","doi":"10.1109/SERA.2010.13","DOIUrl":null,"url":null,"abstract":"A standards-based, open-source middleware system was designed and implemented to facilitate the analysis of large and disparate datasets. This system makes it possible to access several different types of database servers simultaneously, browse remote data, combine datasets, and join tables from remote databases independent of vendor. The system uses an algorithm known as Dynamic Merge Cache to handle data caching, query generation, transformations, and joining with minimal operational interference to source databases. The system is able to combine any subset of configured databases and convert the information into XML. The resulting XML is made available to analysis tools through a web service. After the system connects to a remote database, a metadata catalog is created from the source database. The user is able to configure which tables and fields to export from the remote dataset. The user is also able to filter, transform, and combine data. The system was tested with a large fish contaminant database and a second database populated with simulated scientific data.","PeriodicalId":102108,"journal":{"name":"2010 Eighth ACIS International Conference on Software Engineering Research, Management and Applications","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Eighth ACIS International Conference on Software Engineering Research, Management and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERA.2010.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

A standards-based, open-source middleware system was designed and implemented to facilitate the analysis of large and disparate datasets. This system makes it possible to access several different types of database servers simultaneously, browse remote data, combine datasets, and join tables from remote databases independent of vendor. The system uses an algorithm known as Dynamic Merge Cache to handle data caching, query generation, transformations, and joining with minimal operational interference to source databases. The system is able to combine any subset of configured databases and convert the information into XML. The resulting XML is made available to analysis tools through a web service. After the system connects to a remote database, a metadata catalog is created from the source database. The user is able to configure which tables and fields to export from the remote dataset. The user is also able to filter, transform, and combine data. The system was tested with a large fish contaminant database and a second database populated with simulated scientific data.

查看原文本刊更多论文

跨引擎数据库连接

设计并实现了一个基于标准的开源中间件系统，以促进对大型异构数据集的分析。该系统使同时访问几种不同类型的数据库服务器、浏览远程数据、组合数据集和连接独立于供应商的远程数据库的表成为可能。该系统使用一种称为动态合并缓存的算法来处理数据缓存、查询生成、转换和连接，对源数据库的操作干扰最小。系统能够组合已配置数据库的任何子集，并将信息转换为XML。生成的XML可通过web服务提供给分析工具。系统连接到远程数据库后，将从源数据库创建元数据目录。用户能够配置要从远程数据集导出哪些表和字段。用户还可以过滤、转换和组合数据。该系统在一个大型鱼类污染物数据库和一个包含模拟科学数据的第二个数据库中进行了测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 Eighth ACIS International Conference on Software Engineering Research, Management and Applications

自引率

0.00%

发文量