{"title":"数据并行编程模型综述","authors":"K. Hou, Jing Zhang, Junhuai Li","doi":"10.1109/ICCSE.2012.6295154","DOIUrl":null,"url":null,"abstract":"Data-parallel programming model (DPPM for short) specialized for data-intensive computing becomes considerable popular because it simplifies the development of distributed parallel programs. DPPMs are classified into two categories: 1) MapReduce, Dryad; and 2) Piccolo, Function Flow, etc. based on their maturity. We analyze and compare these typical models by deployment, application, data partition, communication, fault tolerance and so on. Finally, we pay more attention to discussing development of key technologies which are deployment of storage and computation, task partition and fault tolerance in DPPM.","PeriodicalId":264063,"journal":{"name":"2012 7th International Conference on Computer Science & Education (ICCSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Review of data-parallel programming model\",\"authors\":\"K. Hou, Jing Zhang, Junhuai Li\",\"doi\":\"10.1109/ICCSE.2012.6295154\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data-parallel programming model (DPPM for short) specialized for data-intensive computing becomes considerable popular because it simplifies the development of distributed parallel programs. DPPMs are classified into two categories: 1) MapReduce, Dryad; and 2) Piccolo, Function Flow, etc. based on their maturity. We analyze and compare these typical models by deployment, application, data partition, communication, fault tolerance and so on. Finally, we pay more attention to discussing development of key technologies which are deployment of storage and computation, task partition and fault tolerance in DPPM.\",\"PeriodicalId\":264063,\"journal\":{\"name\":\"2012 7th International Conference on Computer Science & Education (ICCSE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 7th International Conference on Computer Science & Education (ICCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSE.2012.6295154\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 7th International Conference on Computer Science & Education (ICCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSE.2012.6295154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
专门用于数据密集型计算的数据并行编程模型(DPPM)由于简化了分布式并行程序的开发而变得相当流行。dppm分为两类:1)MapReduce, Dryad;2) Piccolo, Function Flow等。从部署、应用、数据分区、通信、容错等方面对这些典型模型进行了分析和比较。最后,重点讨论了DPPM中存储与计算部署、任务划分和容错等关键技术的发展。
Data-parallel programming model (DPPM for short) specialized for data-intensive computing becomes considerable popular because it simplifies the development of distributed parallel programs. DPPMs are classified into two categories: 1) MapReduce, Dryad; and 2) Piccolo, Function Flow, etc. based on their maturity. We analyze and compare these typical models by deployment, application, data partition, communication, fault tolerance and so on. Finally, we pay more attention to discussing development of key technologies which are deployment of storage and computation, task partition and fault tolerance in DPPM.