Analysis of Joins and Semi-joins in Centralized and Distributed Database Queries

Manik Sharma, G. Singh
{"title":"Analysis of Joins and Semi-joins in Centralized and Distributed Database Queries","authors":"Manik Sharma, G. Singh","doi":"10.1109/ICCS.2012.15","DOIUrl":null,"url":null,"abstract":"Database is defined as collection of files or table, where as DBMS stands for Database Management System which is collection of unified programs used to manage overall activities of the database. The two dominant approaches used for storing and managing database are centralized database management system and distributed database management system in which data is placed at central location and distributed over several locations respectively. Independent of the database approach used, one of the foremost issue in the database is the retrieval of data by using multiple table from central repository in centralized database and from number of sites in distributed database. Joins and semi joins are primitive operations used to extract required information from one, two or multiple tables. In this paper the focus is given on computing and analyzing the performance of joins and semi joins in centralized as well as in distributed database system. The various metrics that will be considered while analyzing performance of join and semi join in centralized database and distributed database system are Query Cost, Memory used, CPU Cost, Input Output Cost, Sort Operations, Data Transmission, Total Time and Response Time. In short the intention of this study is compare and contrasts the behavior join and semi-join approach in centralized and distributed database system.","PeriodicalId":429916,"journal":{"name":"2012 International Conference on Computing Sciences","volume":"187 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Computing Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCS.2012.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Database is defined as collection of files or table, where as DBMS stands for Database Management System which is collection of unified programs used to manage overall activities of the database. The two dominant approaches used for storing and managing database are centralized database management system and distributed database management system in which data is placed at central location and distributed over several locations respectively. Independent of the database approach used, one of the foremost issue in the database is the retrieval of data by using multiple table from central repository in centralized database and from number of sites in distributed database. Joins and semi joins are primitive operations used to extract required information from one, two or multiple tables. In this paper the focus is given on computing and analyzing the performance of joins and semi joins in centralized as well as in distributed database system. The various metrics that will be considered while analyzing performance of join and semi join in centralized database and distributed database system are Query Cost, Memory used, CPU Cost, Input Output Cost, Sort Operations, Data Transmission, Total Time and Response Time. In short the intention of this study is compare and contrasts the behavior join and semi-join approach in centralized and distributed database system.
集中式和分布式数据库查询中连接和半连接的分析
数据库被定义为文件或表的集合,其中DBMS代表数据库管理系统,它是用于管理数据库整体活动的统一程序的集合。用于存储和管理数据库的两种主要方法是集中式数据库管理系统和分布式数据库管理系统,其中数据分别放置在中心位置和分布在多个位置。与使用的数据库方法无关,数据库中最重要的问题之一是使用多个表从集中式数据库中的中央存储库和分布式数据库中的多个站点检索数据。连接和半连接是用于从一个、两个或多个表中提取所需信息的基本操作。本文重点对集中式和分布式数据库系统中连接和半连接的性能进行了计算和分析。在集中式数据库和分布式数据库系统中分析连接和半连接性能时需要考虑的各种指标有查询成本、内存使用、CPU成本、输入输出成本、排序操作、数据传输、总时间和响应时间。简而言之,本研究的目的是比较和对比集中式和分布式数据库系统中的行为连接和半连接方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信