M. Adnan, Muhammad Afzal, M. Aslam, Roohi Jan, A. MARTÍNEZ-ENRIQUEZ
{"title":"使用基于Hadoop架构的云计算最小化大数据问题","authors":"M. Adnan, Muhammad Afzal, M. Aslam, Roohi Jan, A. MARTÍNEZ-ENRIQUEZ","doi":"10.1109/HONET.2014.7029370","DOIUrl":null,"url":null,"abstract":"This paper emphasizes importance and solution of big data problems through cloud computing. Knowledge embedded in big data generated by sensors, personal computers and mobile devices is compelling many companies to spend millions of dollars to solve problems of information and knowledge extraction to make intelligent decisions in time for the growth of their businesses. Google BigQuery, Rackspace Big Data Cloud, Amazon Web Services are some platforms that are providing limited solutions and infrastructures to deal with big data problems. However, our study motivates IT companies to use open source Hadoop architecture to develop cloud systems for reliable distributed computing to process their large data sets efficiently and effectively. Our main guideline is to resolve the big data through a company's own infrastructure and integrating various other big data infrastructures into their clouds. Also that, Hadoop reduce/map technique can be implemented on the clusters within and across the private and public clouds.","PeriodicalId":297826,"journal":{"name":"2014 11th Annual High Capacity Optical Networks and Emerging/Enabling Technologies (Photonics for Energy)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Minimizing big data problems using cloud computing based on Hadoop architecture\",\"authors\":\"M. Adnan, Muhammad Afzal, M. Aslam, Roohi Jan, A. MARTÍNEZ-ENRIQUEZ\",\"doi\":\"10.1109/HONET.2014.7029370\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper emphasizes importance and solution of big data problems through cloud computing. Knowledge embedded in big data generated by sensors, personal computers and mobile devices is compelling many companies to spend millions of dollars to solve problems of information and knowledge extraction to make intelligent decisions in time for the growth of their businesses. Google BigQuery, Rackspace Big Data Cloud, Amazon Web Services are some platforms that are providing limited solutions and infrastructures to deal with big data problems. However, our study motivates IT companies to use open source Hadoop architecture to develop cloud systems for reliable distributed computing to process their large data sets efficiently and effectively. Our main guideline is to resolve the big data through a company's own infrastructure and integrating various other big data infrastructures into their clouds. Also that, Hadoop reduce/map technique can be implemented on the clusters within and across the private and public clouds.\",\"PeriodicalId\":297826,\"journal\":{\"name\":\"2014 11th Annual High Capacity Optical Networks and Emerging/Enabling Technologies (Photonics for Energy)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-02-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 11th Annual High Capacity Optical Networks and Emerging/Enabling Technologies (Photonics for Energy)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HONET.2014.7029370\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 11th Annual High Capacity Optical Networks and Emerging/Enabling Technologies (Photonics for Energy)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HONET.2014.7029370","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21
摘要
本文强调了通过云计算解决大数据问题的重要性和方法。传感器、个人电脑和移动设备产生的大数据中蕴含的知识,迫使许多公司花费数百万美元来解决信息和知识提取问题,以便及时做出明智的决策,促进业务增长。Google BigQuery、Rackspace大数据云、Amazon Web Services等平台提供了有限的解决方案和基础设施来处理大数据问题。然而,我们的研究促使IT公司使用开源Hadoop架构来开发可靠的分布式计算云系统,以高效有效地处理其大型数据集。我们的主要方针是通过公司自己的基础设施解决大数据问题,并将各种其他大数据基础设施集成到他们的云中。此外,Hadoop reduce/map技术可以在私有云和公共云之间的集群上实现。
Minimizing big data problems using cloud computing based on Hadoop architecture
This paper emphasizes importance and solution of big data problems through cloud computing. Knowledge embedded in big data generated by sensors, personal computers and mobile devices is compelling many companies to spend millions of dollars to solve problems of information and knowledge extraction to make intelligent decisions in time for the growth of their businesses. Google BigQuery, Rackspace Big Data Cloud, Amazon Web Services are some platforms that are providing limited solutions and infrastructures to deal with big data problems. However, our study motivates IT companies to use open source Hadoop architecture to develop cloud systems for reliable distributed computing to process their large data sets efficiently and effectively. Our main guideline is to resolve the big data through a company's own infrastructure and integrating various other big data infrastructures into their clouds. Also that, Hadoop reduce/map technique can be implemented on the clusters within and across the private and public clouds.