Mapping problems within heterogeneous distributed data base systems

ACM-SE 18 Pub Date : 1980-03-24 DOI:10.1145/503838.503877

E. Lisboa

{"title":"Mapping problems within heterogeneous distributed data base systems","authors":"E. Lisboa","doi":"10.1145/503838.503877","DOIUrl":null,"url":null,"abstract":"Supporting heterogeneous database systems in a distributed database environment requires the translation between different forms of data representation and of data manipulation while transmitting data within the network. A generally accepted approach to this translation is to define a standard database system interface (data model and data manipulation language) and to build a global schema of the distributed data. If there is a need to move data from one network node to another, the request for data is first translated into the standard interface, and then converted into the destination database interface to be evaluated. The general problem of translations between different database interfaces is known as the \"mapping problem\". In this paper, we consider the mapping problem in a heterogeneous distributed database environment.First, two components of database mappings are considered: the data mapping that expresses the source data structure's elements in terms of the target data structure's elements, and the operator mapping that translates the source operators into the corresponding target operators using as reference the data mapping.Four different types of mapping are identified according to the relationship between the source and the target data strucutes. First, the trivial mapping where both the source and the target data structure express exactly the same data structure using the same data model. In this case, the data mapping, as well as the operator mapping, is totally expressed by an identity function. Second, the schema mapping where the source and target data structure differ, even though the underlying data model is the same. Third, the model mapping where the data structures expressed in both schemas are exactly the same, but are described using different data models. Finally, the total mapping occurs when both source and target data models and data structures differ. Each of these four mapping cases presents its own inherent difficulties that are analyzed through the paper.Then mappings are considered within Distributed Data Base Systems. We propose an architectural arrangement that localizes total and schema mappings between the user and the standard interfaces, and limits mappings between the standard and local interfaces to the trivial and model mapping cases. Several advantages result from this arrangement (which does not lessen the mapping flexibility between user and local interfaces). The mappings between the standard and local interfaces are certified feasible and in their simplest form, the design and evaluation of the standard schema is facilitated and the operator decomposition and scheduling are rendered independent of schema mappings.Lastly, the paper presents the architectural framework and the mapping capabilities of a heterogeneous distributed database system being implemented at the University of Southwestern Louisiana.","PeriodicalId":431590,"journal":{"name":"ACM-SE 18","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1980-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM-SE 18","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/503838.503877","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Supporting heterogeneous database systems in a distributed database environment requires the translation between different forms of data representation and of data manipulation while transmitting data within the network. A generally accepted approach to this translation is to define a standard database system interface (data model and data manipulation language) and to build a global schema of the distributed data. If there is a need to move data from one network node to another, the request for data is first translated into the standard interface, and then converted into the destination database interface to be evaluated. The general problem of translations between different database interfaces is known as the "mapping problem". In this paper, we consider the mapping problem in a heterogeneous distributed database environment.First, two components of database mappings are considered: the data mapping that expresses the source data structure's elements in terms of the target data structure's elements, and the operator mapping that translates the source operators into the corresponding target operators using as reference the data mapping.Four different types of mapping are identified according to the relationship between the source and the target data strucutes. First, the trivial mapping where both the source and the target data structure express exactly the same data structure using the same data model. In this case, the data mapping, as well as the operator mapping, is totally expressed by an identity function. Second, the schema mapping where the source and target data structure differ, even though the underlying data model is the same. Third, the model mapping where the data structures expressed in both schemas are exactly the same, but are described using different data models. Finally, the total mapping occurs when both source and target data models and data structures differ. Each of these four mapping cases presents its own inherent difficulties that are analyzed through the paper.Then mappings are considered within Distributed Data Base Systems. We propose an architectural arrangement that localizes total and schema mappings between the user and the standard interfaces, and limits mappings between the standard and local interfaces to the trivial and model mapping cases. Several advantages result from this arrangement (which does not lessen the mapping flexibility between user and local interfaces). The mappings between the standard and local interfaces are certified feasible and in their simplest form, the design and evaluation of the standard schema is facilitated and the operator decomposition and scheduling are rendered independent of schema mappings.Lastly, the paper presents the architectural framework and the mapping capabilities of a heterogeneous distributed database system being implemented at the University of Southwestern Louisiana.

查看原文本刊更多论文

异构分布式数据库系统中的映射问题

在分布式数据库环境中支持异构数据库系统需要在网络中传输数据时在不同形式的数据表示和数据操作之间进行转换。一种普遍接受的转换方法是定义标准数据库系统接口(数据模型和数据操作语言)，并构建分布式数据的全局模式。如果需要将数据从一个网络节点移动到另一个网络节点，则首先将数据请求转换为标准接口，然后再转换为要评估的目标数据库接口。在不同数据库接口之间进行转换的一般问题被称为“映射问题”。本文研究了异构分布式数据库环境下的映射问题。首先，考虑数据库映射的两个组成部分:用目标数据结构的元素表示源数据结构元素的数据映射，以及使用数据映射作为引用将源操作符转换为相应目标操作符的操作符映射。根据源数据结构和目标数据结构之间的关系，确定了四种不同类型的映射。首先是简单的映射，其中源数据结构和目标数据结构使用相同的数据模型表达完全相同的数据结构。在这种情况下，数据映射以及操作符映射完全由恒等函数表示。第二，即使底层数据模型相同，源数据结构和目标数据结构不同的模式映射。第三，模型映射，其中两个模式中表达的数据结构完全相同，但使用不同的数据模型进行描述。最后，当源和目标数据模型和数据结构不同时，就会发生总映射。这四个映射案例中的每一个都有其固有的困难，并通过本文进行了分析。然后在分布式数据库系统中考虑映射。我们提出了一种架构安排，将用户和标准接口之间的总映射和模式映射本地化，并将标准接口和本地接口之间的映射限制为琐碎和模型映射用例。这种安排带来了几个好处(它不会降低用户和本地接口之间映射的灵活性)。证明了标准接口和本地接口之间的映射是可行的，并以其最简单的形式简化了标准模式的设计和评估，使运算符分解和调度独立于模式映射。最后，本文介绍了在西南路易斯安那大学实现的异构分布式数据库系统的体系结构框架和映射功能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM-SE 18

自引率

0.00%

发文量