A Protection Layer over MapReduce Framework for Big Data Privacy

International Journal of Computer and Information Technology(2279-0764) Pub Date : 2022-06-11 DOI:10.24203/ijcit.v11i2.263

Hidayath Ali Baig

{"title":"A Protection Layer over MapReduce Framework for Big Data Privacy","authors":"Hidayath Ali Baig","doi":"10.24203/ijcit.v11i2.263","DOIUrl":null,"url":null,"abstract":"In many organizations, big data analytics has become a trend in gathering valuable data insights. The framework MapReduce, which is generally used for this purpose, has been accepted by most organizations for its exceptional characteristics. However, because of the availability of significant processing resources, dispersed privacy-sensitive details can be collected quickly, increasing the widespread privacy concerns. This article reviews some of the existing research articles on the MapReduce framework's privacy issues and proposes an additional layer of privacy protection over the adopted framework. The data is split into bits and processed in the clouds, and two other steps are taken. Hadoop splits the file into bits of a smaller scale. The task tracker then allocates these bits to several mappers. First, the data is split up into key-value pairs, and the intermediate data sets are generated. The efficiency of the suggested approach may then be effectively interpreted. Overall, the proposed method provides improved scalability. The following figures compare execution time with relation to file size and the number of partitions. As privacy protection technique is used, the loss of data content can be appropriately handled. It has been demonstrated that MRPL outperforms current methods in terms of CPU optimization, memory usage, and reduced information loss. Research reveals that the suggested strategy creates significant advantages for Big Data by enhancing privacy and protection. MRPL can considerably solve the privacy issues in Big Data.","PeriodicalId":359510,"journal":{"name":"International Journal of Computer and Information Technology(2279-0764)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer and Information Technology(2279-0764)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24203/ijcit.v11i2.263","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In many organizations, big data analytics has become a trend in gathering valuable data insights. The framework MapReduce, which is generally used for this purpose, has been accepted by most organizations for its exceptional characteristics. However, because of the availability of significant processing resources, dispersed privacy-sensitive details can be collected quickly, increasing the widespread privacy concerns. This article reviews some of the existing research articles on the MapReduce framework's privacy issues and proposes an additional layer of privacy protection over the adopted framework. The data is split into bits and processed in the clouds, and two other steps are taken. Hadoop splits the file into bits of a smaller scale. The task tracker then allocates these bits to several mappers. First, the data is split up into key-value pairs, and the intermediate data sets are generated. The efficiency of the suggested approach may then be effectively interpreted. Overall, the proposed method provides improved scalability. The following figures compare execution time with relation to file size and the number of partitions. As privacy protection technique is used, the loss of data content can be appropriately handled. It has been demonstrated that MRPL outperforms current methods in terms of CPU optimization, memory usage, and reduced information loss. Research reveals that the suggested strategy creates significant advantages for Big Data by enhancing privacy and protection. MRPL can considerably solve the privacy issues in Big Data.

查看原文本刊更多论文

基于MapReduce框架的大数据隐私保护层

在许多组织中，大数据分析已经成为收集有价值数据见解的趋势。通常用于此目的的框架MapReduce因其独特的特性而被大多数组织所接受。然而，由于大量处理资源的可用性，可以快速收集分散的隐私敏感细节，从而增加了广泛的隐私问题。本文回顾了一些关于MapReduce框架隐私问题的现有研究文章，并提出了在采用的框架之上增加一层隐私保护的建议。数据被分割成比特，在云中进行处理，然后再进行另外两个步骤。Hadoop将文件分成更小的位。然后，任务跟踪器将这些位分配给几个映射器。首先，将数据拆分为键值对，并生成中间数据集。这样就可以有效地解释所建议的方法的效率。总的来说，所提出的方法提供了改进的可伸缩性。下图比较了执行时间与文件大小和分区数量的关系。由于采用了隐私保护技术，数据内容的丢失可以得到适当的处理。已经证明，MRPL在CPU优化、内存使用和减少信息丢失方面优于当前的方法。研究表明，建议的策略通过加强隐私和保护，为大数据创造了显著的优势。MRPL可以很好地解决大数据中的隐私问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Computer and Information Technology(2279-0764)

自引率

0.00%

发文量