{"title":"云计算环境下基于Rabin CDC和MD5的安全重复数据删除机制","authors":"Himshai Kambo, Bharati Sinha","doi":"10.1109/RTEICT.2017.8256626","DOIUrl":null,"url":null,"abstract":"Chunking is a technique that splits a whole file or data into separate chunks. Chunking is applied to detect duplication in any remote processes such as data deduplication and data compression. Content defined chunking is a process of splitting the file into variable length chunks as per the cut points defined initially. Also these are more challenging as byte shifting is involved which results to be expected to increase the deduplication process. While processing out content defined Chunking onto cloud based Dedup App for the encrypted data stored in cloud results in improvement in efficiency. In this paper, we have proposed a technique which includes applying Rabin CDC for purpose of making variable size chunks and further processing that data chunks for encrypted data on cloud. This works on same data being uploaded by different users on same cloud storage (Hadoop Distributed File System).","PeriodicalId":342831,"journal":{"name":"2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Secure data deduplication mechanism based on Rabin CDC and MD5 in cloud computing environment\",\"authors\":\"Himshai Kambo, Bharati Sinha\",\"doi\":\"10.1109/RTEICT.2017.8256626\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Chunking is a technique that splits a whole file or data into separate chunks. Chunking is applied to detect duplication in any remote processes such as data deduplication and data compression. Content defined chunking is a process of splitting the file into variable length chunks as per the cut points defined initially. Also these are more challenging as byte shifting is involved which results to be expected to increase the deduplication process. While processing out content defined Chunking onto cloud based Dedup App for the encrypted data stored in cloud results in improvement in efficiency. In this paper, we have proposed a technique which includes applying Rabin CDC for purpose of making variable size chunks and further processing that data chunks for encrypted data on cloud. This works on same data being uploaded by different users on same cloud storage (Hadoop Distributed File System).\",\"PeriodicalId\":342831,\"journal\":{\"name\":\"2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RTEICT.2017.8256626\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTEICT.2017.8256626","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Secure data deduplication mechanism based on Rabin CDC and MD5 in cloud computing environment
Chunking is a technique that splits a whole file or data into separate chunks. Chunking is applied to detect duplication in any remote processes such as data deduplication and data compression. Content defined chunking is a process of splitting the file into variable length chunks as per the cut points defined initially. Also these are more challenging as byte shifting is involved which results to be expected to increase the deduplication process. While processing out content defined Chunking onto cloud based Dedup App for the encrypted data stored in cloud results in improvement in efficiency. In this paper, we have proposed a technique which includes applying Rabin CDC for purpose of making variable size chunks and further processing that data chunks for encrypted data on cloud. This works on same data being uploaded by different users on same cloud storage (Hadoop Distributed File System).