{"title":"基于ParSIT的高效内存精确近似函数依赖提取","authors":"B. Tusor, A. Várkonyi-Kóczy","doi":"10.1109/INES49302.2020.9147187","DOIUrl":null,"url":null,"abstract":"In the last decade, Big Data has been presenting more and more challenges to various fields of computer science that center around data processing. Functional dependency extraction, the process of finding rules and relationships between attributes of datasets, is one such application. In this paper, a new dependency extraction method is presented for finding both exact and approximate functional dependencies, that is also memory efficient for large datasets. The proposed method is a parallelized improvement of Sequential Indexing Tables. It is evaluated through benchmark datasets and analysis is given about its time and spatial complexity.","PeriodicalId":175830,"journal":{"name":"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Memory Efficient Exact and Approximate Functional Dependency Extraction with ParSIT\",\"authors\":\"B. Tusor, A. Várkonyi-Kóczy\",\"doi\":\"10.1109/INES49302.2020.9147187\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the last decade, Big Data has been presenting more and more challenges to various fields of computer science that center around data processing. Functional dependency extraction, the process of finding rules and relationships between attributes of datasets, is one such application. In this paper, a new dependency extraction method is presented for finding both exact and approximate functional dependencies, that is also memory efficient for large datasets. The proposed method is a parallelized improvement of Sequential Indexing Tables. It is evaluated through benchmark datasets and analysis is given about its time and spatial complexity.\",\"PeriodicalId\":175830,\"journal\":{\"name\":\"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INES49302.2020.9147187\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES49302.2020.9147187","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Memory Efficient Exact and Approximate Functional Dependency Extraction with ParSIT
In the last decade, Big Data has been presenting more and more challenges to various fields of computer science that center around data processing. Functional dependency extraction, the process of finding rules and relationships between attributes of datasets, is one such application. In this paper, a new dependency extraction method is presented for finding both exact and approximate functional dependencies, that is also memory efficient for large datasets. The proposed method is a parallelized improvement of Sequential Indexing Tables. It is evaluated through benchmark datasets and analysis is given about its time and spatial complexity.