Data Distribution and Distributed Transaction Management

Wilfried Lemahieu, S. V. Broucke, B. Baesens
{"title":"Data Distribution and Distributed Transaction Management","authors":"Wilfried Lemahieu, S. V. Broucke, B. Baesens","doi":"10.1017/9781316888773.018","DOIUrl":null,"url":null,"abstract":"Chapter Objectives In this chapter, you will learn to: • grasp the basics of distributed systems and distributed databases; • discern key architectural implications of distributed databases; • understand the impact of fragmentation, allocation, and replication; • identify different types of transparency; • understand the steps in distributed query processing; • understand distributed transaction management and concurrency control; • grasp the impact of eventual consistency and BASE transactions. Opening Scenario As Sober envisions growing as part of its long-term strategy, it wants to have a careful understanding of the data implications involved. More specifically, the company wants to know if it would make sense to distribute its data across a network of offices and work with a distributed database. Sober wants to know the impact of data distribution on query processing and optimization, transaction management, and concurrency control. In this chapter, we focus on the specifics of distributed databases (i.e., systems in which the data and DBMS functionality are distributed over different nodes or locations on a network). First, we discuss the general properties of distributed systems and offer an overview of some architectural variants of distributed database systems. Then, we tackle the different ways of distributing data over nodes in a network, including the possibility of data replication. We also focus on the degree to which the data distribution can be made transparent to applications and users. Then, we discuss the complexity of query processing and query optimization in a distributed setting. A next section is dedicated to distributed transaction management and concurrency control, focusing on both tightly coupled and loosely coupled settings. The last section overviews the particularities of transaction management in Big Data and NoSQL databases, which are often distributed in a cluster set-up, presenting BASE transactions as an alternative to the traditional ACID transaction paradigms. Distributed Systems and Distributed Databases Ever since the early days of computing, which were dominated by monolithic mainframes, distributed systems have had their place in the ICT landscape. A distributed computing system consists of several processing units or nodes with a certain level of autonomy, which are interconnected by a network and which cooperatively perform complex tasks. These complex tasks can be divided into subtasks as performed by the individual nodes.","PeriodicalId":186558,"journal":{"name":"Principles of Database Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Principles of Database Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/9781316888773.018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Chapter Objectives In this chapter, you will learn to: • grasp the basics of distributed systems and distributed databases; • discern key architectural implications of distributed databases; • understand the impact of fragmentation, allocation, and replication; • identify different types of transparency; • understand the steps in distributed query processing; • understand distributed transaction management and concurrency control; • grasp the impact of eventual consistency and BASE transactions. Opening Scenario As Sober envisions growing as part of its long-term strategy, it wants to have a careful understanding of the data implications involved. More specifically, the company wants to know if it would make sense to distribute its data across a network of offices and work with a distributed database. Sober wants to know the impact of data distribution on query processing and optimization, transaction management, and concurrency control. In this chapter, we focus on the specifics of distributed databases (i.e., systems in which the data and DBMS functionality are distributed over different nodes or locations on a network). First, we discuss the general properties of distributed systems and offer an overview of some architectural variants of distributed database systems. Then, we tackle the different ways of distributing data over nodes in a network, including the possibility of data replication. We also focus on the degree to which the data distribution can be made transparent to applications and users. Then, we discuss the complexity of query processing and query optimization in a distributed setting. A next section is dedicated to distributed transaction management and concurrency control, focusing on both tightly coupled and loosely coupled settings. The last section overviews the particularities of transaction management in Big Data and NoSQL databases, which are often distributed in a cluster set-up, presenting BASE transactions as an alternative to the traditional ACID transaction paradigms. Distributed Systems and Distributed Databases Ever since the early days of computing, which were dominated by monolithic mainframes, distributed systems have had their place in the ICT landscape. A distributed computing system consists of several processing units or nodes with a certain level of autonomy, which are interconnected by a network and which cooperatively perform complex tasks. These complex tasks can be divided into subtasks as performed by the individual nodes.
数据分布和分布式事务管理
在本章中,您将学习:•掌握分布式系统和分布式数据库的基础知识;•辨别分布式数据库的关键架构含义;•了解碎片、分配和复制的影响;•确定不同类型的透明度;•了解分布式查询处理的步骤;•了解分布式事务管理和并发控制;•掌握最终一致性和BASE事务的影响。随着醒酒公司将增长作为其长期战略的一部分,它希望仔细了解所涉及的数据影响。更具体地说,该公司想知道在办公室网络中分发数据并使用分布式数据库是否有意义。Sober想知道数据分布对查询处理和优化、事务管理和并发控制的影响。在本章中,我们将重点关注分布式数据库(即数据和DBMS功能分布在网络上不同节点或位置的系统)的细节。首先,我们讨论分布式系统的一般属性,并概述分布式数据库系统的一些体系结构变体。然后,我们处理在网络节点上分布数据的不同方式,包括数据复制的可能性。我们还关注数据分布对应用程序和用户透明的程度。然后,我们讨论了分布式环境下查询处理和查询优化的复杂性。下一节将专门讨论分布式事务管理和并发控制,重点关注紧耦合和松耦合设置。最后一节概述了大数据和NoSQL数据库中事务管理的特殊性,这些数据库通常分布在集群设置中,将BASE事务作为传统ACID事务范例的替代方案。分布式系统和分布式数据库自从由大型主机主导的早期计算时代以来,分布式系统就在ICT领域占有一席之地。分布式计算系统由若干具有一定自治程度的处理单元或节点组成,这些处理单元或节点通过网络相互连接,协同执行复杂任务。这些复杂的任务可以分成子任务,由各个节点执行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信