{"title":"使用基于代理的SysML工具箱建模和模拟联邦数据库,以便对联邦搜索进行早期验证","authors":"S. Melzer, Stefan Thiemann, R. Möller","doi":"10.1109/SysCon48628.2021.9447055","DOIUrl":null,"url":null,"abstract":"In research and cultural institutions as well as in companies, many data are collected and increasingly stored in databases. For example, during the project EDAK (Epigraphische Datenbank zum antiken Kleinasien) the Department for Ancient History at the Universität Hamburg created an epigraphic database of ancient Asia Minor. This database contains a collection of Greek and Latin inscriptions from modern-day Turkey. For analyzing data, a user usually sends a query to one database and receives an answer. However, it is not always sufficient to use one database only. For example, the fragment AO 29196 is located at the Louvre. The counterpart of this fragment is the fragment KUG 15 which is located in Germany. Indeed, both fragments were found without database federation, but in the future, for analyzing data from different databases it would be desirable to find related data in a FDBS. This example highlights the need to sometimes combine, analyze, and query data from different databases. Database federation offers a logical centralization of data without the need to change physical implementation of the databases. Thus, the complexity of query execution increases. Users send queries to the FDBS, and then the FDBS forwards queries to each (relevant) database node. The answer to queries vary. The response may be correct, incorrect, or incomplete. To overcome this problem, theoretical foundations have been developed. Although theories exist for implementing FDBSs, in practice, the implementation of a FDBS is a complex task. When using the currently-existing FDBS such as Denodo, a programmer or engineer still has to develop a congruent overview for querying internal as well as external databases. An FDBS is often expensive to develop. We predict that early validation of federated searches may identify problems which need to be solved before implementation. Therefore, we recommend carrying out a feasibility study before a FDBS is set up. In this paper we present how to model and simulate federated databases before implementation. The development of database federation for early validation of federated searches is challenging due to the distribution, heterogeneity, and autonomy. In order to support the development, established methods, tools, and languages for modeling, simulating, and validating systems are useful. Developers are supported in model-based development using the SysML, the modeling and simulation tool Cameo Systems Modeler and the broker-based SysML Toolbox. For modeling and simulation, we use the tool Cameo Systems Modeler and the broker-based SysML Toolbox. In addition, we evaluate our model-based approach by feasibility study by means of a prototypical implementation of federated databases. This paper is structured as follows. First, the databases are briefly described which are used for the feasibility study. Second, the broker-based SysML Toolbox, as an extension of the tool Cameo Systems Modeler, is introduced which contributes in creating communications networks. Third, broker federation is explained, followed by describing the characteristics of federated databases and the challenges providing federated searches. Subsequently the model-based approach for modeling and simulating an FDBS with low coding efforts is demonstrated. Afterwards an evaluation for federated searches is presented. Finally, a short conclusion is given.","PeriodicalId":384949,"journal":{"name":"2021 IEEE International Systems Conference (SysCon)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Modeling and Simulating Federated Databases for early Validation of Federated Searches using the Broker-based SysML Toolbox\",\"authors\":\"S. Melzer, Stefan Thiemann, R. Möller\",\"doi\":\"10.1109/SysCon48628.2021.9447055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In research and cultural institutions as well as in companies, many data are collected and increasingly stored in databases. For example, during the project EDAK (Epigraphische Datenbank zum antiken Kleinasien) the Department for Ancient History at the Universität Hamburg created an epigraphic database of ancient Asia Minor. This database contains a collection of Greek and Latin inscriptions from modern-day Turkey. For analyzing data, a user usually sends a query to one database and receives an answer. However, it is not always sufficient to use one database only. For example, the fragment AO 29196 is located at the Louvre. The counterpart of this fragment is the fragment KUG 15 which is located in Germany. Indeed, both fragments were found without database federation, but in the future, for analyzing data from different databases it would be desirable to find related data in a FDBS. This example highlights the need to sometimes combine, analyze, and query data from different databases. Database federation offers a logical centralization of data without the need to change physical implementation of the databases. Thus, the complexity of query execution increases. Users send queries to the FDBS, and then the FDBS forwards queries to each (relevant) database node. The answer to queries vary. The response may be correct, incorrect, or incomplete. To overcome this problem, theoretical foundations have been developed. Although theories exist for implementing FDBSs, in practice, the implementation of a FDBS is a complex task. When using the currently-existing FDBS such as Denodo, a programmer or engineer still has to develop a congruent overview for querying internal as well as external databases. An FDBS is often expensive to develop. We predict that early validation of federated searches may identify problems which need to be solved before implementation. Therefore, we recommend carrying out a feasibility study before a FDBS is set up. In this paper we present how to model and simulate federated databases before implementation. The development of database federation for early validation of federated searches is challenging due to the distribution, heterogeneity, and autonomy. In order to support the development, established methods, tools, and languages for modeling, simulating, and validating systems are useful. Developers are supported in model-based development using the SysML, the modeling and simulation tool Cameo Systems Modeler and the broker-based SysML Toolbox. For modeling and simulation, we use the tool Cameo Systems Modeler and the broker-based SysML Toolbox. In addition, we evaluate our model-based approach by feasibility study by means of a prototypical implementation of federated databases. This paper is structured as follows. First, the databases are briefly described which are used for the feasibility study. Second, the broker-based SysML Toolbox, as an extension of the tool Cameo Systems Modeler, is introduced which contributes in creating communications networks. Third, broker federation is explained, followed by describing the characteristics of federated databases and the challenges providing federated searches. Subsequently the model-based approach for modeling and simulating an FDBS with low coding efforts is demonstrated. Afterwards an evaluation for federated searches is presented. Finally, a short conclusion is given.\",\"PeriodicalId\":384949,\"journal\":{\"name\":\"2021 IEEE International Systems Conference (SysCon)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Systems Conference (SysCon)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SysCon48628.2021.9447055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Systems Conference (SysCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SysCon48628.2021.9447055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
在研究和文化机构以及公司中,许多数据被收集并越来越多地存储在数据库中。例如,在EDAK(铭文数据库zum antiken Kleinasien)项目期间,Universität汉堡的古代史部门创建了一个古代小亚细亚铭文数据库。这个数据库包含了来自现代土耳其的希腊语和拉丁语铭文的集合。为了分析数据,用户通常向一个数据库发送查询并接收答案。但是,只使用一个数据库并不总是足够的。例如,位于卢浮宫的AO 29196碎片。与此片段相对应的是位于德国的片段KUG 15。实际上,这两个片段都是在没有数据库联合的情况下找到的,但是在将来,为了分析来自不同数据库的数据,希望在FDBS中找到相关的数据。这个例子强调了有时需要组合、分析和查询来自不同数据库的数据。数据库联合提供了数据的逻辑集中化,而不需要更改数据库的物理实现。因此,查询执行的复杂性增加了。用户向FDBS发送查询,然后FDBS将查询转发给每个(相关)数据库节点。对这些问题的回答各不相同。回答可能是正确的、不正确的或不完整的。为了克服这一问题,已经建立了理论基础。虽然有理论支持FDBS的实施,但在实践中,FDBS的实施是一项复杂的任务。当使用当前存在的FDBS(如Denodo)时,程序员或工程师仍然必须为查询内部和外部数据库开发一致的概述。开发FDBS通常是昂贵的。我们预测,联邦搜索的早期验证可能会识别出在实现之前需要解决的问题。因此,我们建议在设立FDBS前进行可行性研究。在本文中,我们介绍了如何在实现之前对联邦数据库进行建模和模拟。由于分布、异构性和自主性,开发用于联邦搜索早期验证的数据库联邦具有挑战性。为了支持开发,用于建模、模拟和验证系统的已建立的方法、工具和语言是有用的。开发人员可以使用SysML、建模和仿真工具Cameo Systems Modeler以及基于代理的SysML工具箱进行基于模型的开发。对于建模和仿真,我们使用camameo Systems Modeler工具和基于代理的SysML工具箱。此外,我们通过对联邦数据库的原型实现进行可行性研究来评估基于模型的方法。本文的结构如下。首先,简要介绍了用于可行性研究的数据库。其次,介绍了基于代理的SysML工具箱,作为Cameo Systems Modeler工具的扩展,它有助于创建通信网络。第三,解释代理联合,然后描述联邦数据库的特征和提供联邦搜索的挑战。在此基础上,提出了一种基于模型的低编码FDBS建模与仿真方法。然后给出了对联邦搜索的评价。最后,给出了一个简短的结论。
Modeling and Simulating Federated Databases for early Validation of Federated Searches using the Broker-based SysML Toolbox
In research and cultural institutions as well as in companies, many data are collected and increasingly stored in databases. For example, during the project EDAK (Epigraphische Datenbank zum antiken Kleinasien) the Department for Ancient History at the Universität Hamburg created an epigraphic database of ancient Asia Minor. This database contains a collection of Greek and Latin inscriptions from modern-day Turkey. For analyzing data, a user usually sends a query to one database and receives an answer. However, it is not always sufficient to use one database only. For example, the fragment AO 29196 is located at the Louvre. The counterpart of this fragment is the fragment KUG 15 which is located in Germany. Indeed, both fragments were found without database federation, but in the future, for analyzing data from different databases it would be desirable to find related data in a FDBS. This example highlights the need to sometimes combine, analyze, and query data from different databases. Database federation offers a logical centralization of data without the need to change physical implementation of the databases. Thus, the complexity of query execution increases. Users send queries to the FDBS, and then the FDBS forwards queries to each (relevant) database node. The answer to queries vary. The response may be correct, incorrect, or incomplete. To overcome this problem, theoretical foundations have been developed. Although theories exist for implementing FDBSs, in practice, the implementation of a FDBS is a complex task. When using the currently-existing FDBS such as Denodo, a programmer or engineer still has to develop a congruent overview for querying internal as well as external databases. An FDBS is often expensive to develop. We predict that early validation of federated searches may identify problems which need to be solved before implementation. Therefore, we recommend carrying out a feasibility study before a FDBS is set up. In this paper we present how to model and simulate federated databases before implementation. The development of database federation for early validation of federated searches is challenging due to the distribution, heterogeneity, and autonomy. In order to support the development, established methods, tools, and languages for modeling, simulating, and validating systems are useful. Developers are supported in model-based development using the SysML, the modeling and simulation tool Cameo Systems Modeler and the broker-based SysML Toolbox. For modeling and simulation, we use the tool Cameo Systems Modeler and the broker-based SysML Toolbox. In addition, we evaluate our model-based approach by feasibility study by means of a prototypical implementation of federated databases. This paper is structured as follows. First, the databases are briefly described which are used for the feasibility study. Second, the broker-based SysML Toolbox, as an extension of the tool Cameo Systems Modeler, is introduced which contributes in creating communications networks. Third, broker federation is explained, followed by describing the characteristics of federated databases and the challenges providing federated searches. Subsequently the model-based approach for modeling and simulating an FDBS with low coding efforts is demonstrated. Afterwards an evaluation for federated searches is presented. Finally, a short conclusion is given.