用于科学分析的干燥linq

2009 Fifth IEEE International Conference on e-Science Pub Date : 2009-12-09 DOI:10.1109/e-Science.2009.53

Jaliya Ekanayake, Thilina Gunarathne, G. Fox, A. S. Balkir, Christophe Poulain, N. Araujo, R. Barga

{"title":"用于科学分析的干燥linq","authors":"Jaliya Ekanayake, Thilina Gunarathne, G. Fox, A. S. Balkir, Christophe Poulain, N. Araujo, R. Barga","doi":"10.1109/e-Science.2009.53","DOIUrl":null,"url":null,"abstract":"Applying high level parallel runtimes to data/compute intensive applications is becoming increasingly common. The simplicity of the MapReduce programming model and the availability of open source MapReduce runtimes such as Hadoop, are attracting more users to the MapReduce programming model. Recently, Microsoft has released DryadLINQ for academic use, allowing users to experience a new programming model and a runtime that is capable of performing large scale data/compute intensive analyses. In this paper, we present our experience in applying DryadLINQ for a series of scientific data analysis applications, identify their mapping to the DryadLINQ programming model, and compare their performances with Hadoop implementations of the same applications.","PeriodicalId":325840,"journal":{"name":"2009 Fifth IEEE International Conference on e-Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":"{\"title\":\"DryadLINQ for Scientific Analyses\",\"authors\":\"Jaliya Ekanayake, Thilina Gunarathne, G. Fox, A. S. Balkir, Christophe Poulain, N. Araujo, R. Barga\",\"doi\":\"10.1109/e-Science.2009.53\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Applying high level parallel runtimes to data/compute intensive applications is becoming increasingly common. The simplicity of the MapReduce programming model and the availability of open source MapReduce runtimes such as Hadoop, are attracting more users to the MapReduce programming model. Recently, Microsoft has released DryadLINQ for academic use, allowing users to experience a new programming model and a runtime that is capable of performing large scale data/compute intensive analyses. In this paper, we present our experience in applying DryadLINQ for a series of scientific data analysis applications, identify their mapping to the DryadLINQ programming model, and compare their performances with Hadoop implementations of the same applications.\",\"PeriodicalId\":325840,\"journal\":{\"name\":\"2009 Fifth IEEE International Conference on e-Science\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"44\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Fifth IEEE International Conference on e-Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/e-Science.2009.53\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Fifth IEEE International Conference on e-Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/e-Science.2009.53","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 44

摘要

将高级并行运行时应用于数据/计算密集型应用程序正变得越来越普遍。MapReduce编程模型的简单性和开源MapReduce运行时(如Hadoop)的可用性正在吸引更多的用户使用MapReduce编程模型。最近，微软发布了学术用途的DryadLINQ，允许用户体验一种新的编程模型和能够执行大规模数据/计算密集型分析的运行时。在本文中，我们介绍了我们在一系列科学数据分析应用中应用DryadLINQ的经验，确定了它们到DryadLINQ编程模型的映射，并将它们的性能与相同应用的Hadoop实现进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

DryadLINQ for Scientific Analyses

Applying high level parallel runtimes to data/compute intensive applications is becoming increasingly common. The simplicity of the MapReduce programming model and the availability of open source MapReduce runtimes such as Hadoop, are attracting more users to the MapReduce programming model. Recently, Microsoft has released DryadLINQ for academic use, allowing users to experience a new programming model and a runtime that is capable of performing large scale data/compute intensive analyses. In this paper, we present our experience in applying DryadLINQ for a series of scientific data analysis applications, identify their mapping to the DryadLINQ programming model, and compare their performances with Hadoop implementations of the same applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 Fifth IEEE International Conference on e-Science

自引率

0.00%

发文量