基于LZO压缩的HDFS大数据检索

2019 International Conference on Advances in Computing and Communication Engineering (ICACCE) Pub Date : 2019-04-01 DOI:10.1109/ICACCE46606.2019.9079993

T. Prasanth, K. Aarthi, M. Gunasekaran

{"title":"基于LZO压缩的HDFS大数据检索","authors":"T. Prasanth, K. Aarthi, M. Gunasekaran","doi":"10.1109/ICACCE46606.2019.9079993","DOIUrl":null,"url":null,"abstract":"Any type of organization depends on accurate data analytics to make better decisions. Users of these organizations request access from different resources like processes or executors. When processing this request of users, the data retrieval speed is low and also data is inaccurate for some conditions. To solve this issue, a system may be proposed having Hadoop Distributed File system (HDFS) with Lempel-Ziv-Oberhumer(LZO). The first step in the proposed technique is to retrieve and mine the data from respective database. The next step is to cluster the extracted data and optimize it using HDFS and LZO compression method. In the last step, if the compressed data is found similar to user requested data, the final data has to be visualized to the user. The proposed retrieving process in big data gives better performance and reduced execution time.","PeriodicalId":317123,"journal":{"name":"2019 International Conference on Advances in Computing and Communication Engineering (ICACCE)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Big Data Retrieval using HDFS with LZO Compression\",\"authors\":\"T. Prasanth, K. Aarthi, M. Gunasekaran\",\"doi\":\"10.1109/ICACCE46606.2019.9079993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Any type of organization depends on accurate data analytics to make better decisions. Users of these organizations request access from different resources like processes or executors. When processing this request of users, the data retrieval speed is low and also data is inaccurate for some conditions. To solve this issue, a system may be proposed having Hadoop Distributed File system (HDFS) with Lempel-Ziv-Oberhumer(LZO). The first step in the proposed technique is to retrieve and mine the data from respective database. The next step is to cluster the extracted data and optimize it using HDFS and LZO compression method. In the last step, if the compressed data is found similar to user requested data, the final data has to be visualized to the user. The proposed retrieving process in big data gives better performance and reduced execution time.\",\"PeriodicalId\":317123,\"journal\":{\"name\":\"2019 International Conference on Advances in Computing and Communication Engineering (ICACCE)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advances in Computing and Communication Engineering (ICACCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACCE46606.2019.9079993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advances in Computing and Communication Engineering (ICACCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACCE46606.2019.9079993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

任何类型的组织都依赖于准确的数据分析来做出更好的决策。这些组织的用户从不同的资源(如流程或执行者)请求访问。在处理用户的这一请求时，数据检索速度较慢，而且在某些情况下数据不准确。为了解决这个问题，一个系统可能会被提议使用具有LZO (Lempel-Ziv-Oberhumer)特性的HDFS (Hadoop Distributed File system)。该技术的第一步是从各自的数据库中检索和挖掘数据。下一步是将提取的数据进行聚类，并使用HDFS和LZO压缩方法进行优化。在最后一步中，如果发现压缩数据与用户请求的数据相似，则必须将最终数据可视化给用户。提出的大数据检索流程具有更好的性能和更短的执行时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Big Data Retrieval using HDFS with LZO Compression

Any type of organization depends on accurate data analytics to make better decisions. Users of these organizations request access from different resources like processes or executors. When processing this request of users, the data retrieval speed is low and also data is inaccurate for some conditions. To solve this issue, a system may be proposed having Hadoop Distributed File system (HDFS) with Lempel-Ziv-Oberhumer(LZO). The first step in the proposed technique is to retrieve and mine the data from respective database. The next step is to cluster the extracted data and optimize it using HDFS and LZO compression method. In the last step, if the compressed data is found similar to user requested data, the final data has to be visualized to the user. The proposed retrieving process in big data gives better performance and reduced execution time.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 International Conference on Advances in Computing and Communication Engineering (ICACCE)

自引率

0.00%

发文量