Hatim Alsuwat, Emad Alsuwat, Tieming Geng, Chin-Tser Huang, C. Farkas
{"title":"Data Dependencies Preserving Shuffle in Relational Database","authors":"Hatim Alsuwat, Emad Alsuwat, Tieming Geng, Chin-Tser Huang, C. Farkas","doi":"10.1109/ICDIS.2019.00034","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem that database shuffling algorithms do not preserve data dependencies. We introduce an approach for preserving functional dependencies and data-driven associations during database shuffle. We use Boyce-Codd Normal Form (BCNF) decomposition for preserving functional dependencies. Given a relation R that is not in BCNF form, we recommend to decompose R into BCNF relations R1, ..., Rn. Each Ri (i = 1, ...,n) is shuffled then rejoined to create the shuffled relation. Our approach guarantees losslessness and preserves functional dependencies. Data-driven associations may also be lost during database shuffling. For this, we generate the transitive closure of attributes that are associated. We require that the associated attributed are shuffled together. We also present our theoretical and empirical results.","PeriodicalId":181673,"journal":{"name":"2019 2nd International Conference on Data Intelligence and Security (ICDIS)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 2nd International Conference on Data Intelligence and Security (ICDIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIS.2019.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper addresses the problem that database shuffling algorithms do not preserve data dependencies. We introduce an approach for preserving functional dependencies and data-driven associations during database shuffle. We use Boyce-Codd Normal Form (BCNF) decomposition for preserving functional dependencies. Given a relation R that is not in BCNF form, we recommend to decompose R into BCNF relations R1, ..., Rn. Each Ri (i = 1, ...,n) is shuffled then rejoined to create the shuffled relation. Our approach guarantees losslessness and preserves functional dependencies. Data-driven associations may also be lost during database shuffling. For this, we generate the transitive closure of attributes that are associated. We require that the associated attributed are shuffled together. We also present our theoretical and empirical results.
本文解决了数据库洗牌算法不能保持数据依赖关系的问题。我们介绍了一种在数据库洗牌过程中保留功能依赖关系和数据驱动关联的方法。我们使用Boyce-Codd范式(BCNF)分解来保持功能依赖。给定一个非BCNF形式的关系R,我们建议将R分解为BCNF关系R1,…Rn,。每个Ri (i = 1,…,n)被洗牌,然后重新连接以创建洗牌关系。我们的方法保证了无损性并保留了功能依赖性。数据驱动的关联也可能在数据库变换期间丢失。为此,我们生成关联属性的传递闭包。我们需要将相关的属性混在一起。我们还提出了我们的理论和实证结果。