{"title":"大数据异构环境下数据放置策略研究","authors":"Anilkumar Ambore, Rani V. Udaya","doi":"10.1109/ICOEI.2019.8862676","DOIUrl":null,"url":null,"abstract":"In the present computerized world the extent of the information is expanding at an arbitrary speed. Simultaneously, need to process and examine huge amounts of information likewise expanded. In a few challenge commercial enterprise and logical programs, there's a want to technique petabytes of information in productive way day by day. Hadoop is most popularly used in data intensive applications. The current Hadoop usage accept that computing are homogeneous in nature. The capacity of tremendous amount of information in Hadoop is done using Hadoop Distributed File System (HDFS). HDFS employments block placement arrangement to part a really expansive record into pieces and place them over the cluster in a distributed way. In the present era of social networking, we cannot imagine having a cluster of homogeneous nodes only. Information region has not been taken under consideration for propelling theoretical outline assignments, since it is expected that the most mappings are data-local. To bargain with this Hadoop has usefulness to duplicate the information square where mappers are running. This makes a part of execution corruption particularly on heterogeneous cluster due to I/O delay or organize congestions In this survey we first brief introduction to the Big data, Hadoop and MapReduce. Later several data placement strategies in heterogeneous environment are studied.","PeriodicalId":212501,"journal":{"name":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Survey on Data Placement Strategy in Big Data Heterogeneous Environments\",\"authors\":\"Anilkumar Ambore, Rani V. Udaya\",\"doi\":\"10.1109/ICOEI.2019.8862676\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the present computerized world the extent of the information is expanding at an arbitrary speed. Simultaneously, need to process and examine huge amounts of information likewise expanded. In a few challenge commercial enterprise and logical programs, there's a want to technique petabytes of information in productive way day by day. Hadoop is most popularly used in data intensive applications. The current Hadoop usage accept that computing are homogeneous in nature. The capacity of tremendous amount of information in Hadoop is done using Hadoop Distributed File System (HDFS). HDFS employments block placement arrangement to part a really expansive record into pieces and place them over the cluster in a distributed way. In the present era of social networking, we cannot imagine having a cluster of homogeneous nodes only. Information region has not been taken under consideration for propelling theoretical outline assignments, since it is expected that the most mappings are data-local. To bargain with this Hadoop has usefulness to duplicate the information square where mappers are running. This makes a part of execution corruption particularly on heterogeneous cluster due to I/O delay or organize congestions In this survey we first brief introduction to the Big data, Hadoop and MapReduce. Later several data placement strategies in heterogeneous environment are studied.\",\"PeriodicalId\":212501,\"journal\":{\"name\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOEI.2019.8862676\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI.2019.8862676","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Survey on Data Placement Strategy in Big Data Heterogeneous Environments
In the present computerized world the extent of the information is expanding at an arbitrary speed. Simultaneously, need to process and examine huge amounts of information likewise expanded. In a few challenge commercial enterprise and logical programs, there's a want to technique petabytes of information in productive way day by day. Hadoop is most popularly used in data intensive applications. The current Hadoop usage accept that computing are homogeneous in nature. The capacity of tremendous amount of information in Hadoop is done using Hadoop Distributed File System (HDFS). HDFS employments block placement arrangement to part a really expansive record into pieces and place them over the cluster in a distributed way. In the present era of social networking, we cannot imagine having a cluster of homogeneous nodes only. Information region has not been taken under consideration for propelling theoretical outline assignments, since it is expected that the most mappings are data-local. To bargain with this Hadoop has usefulness to duplicate the information square where mappers are running. This makes a part of execution corruption particularly on heterogeneous cluster due to I/O delay or organize congestions In this survey we first brief introduction to the Big data, Hadoop and MapReduce. Later several data placement strategies in heterogeneous environment are studied.