数据密集型广域应用的框架设计

Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556) Pub Date : 2000-05-01 DOI:10.1109/HCW.2000.843737

M. Beynon, T. Kurç, A. Sussman, J. Saltz

{"title":"数据密集型广域应用的框架设计","authors":"M. Beynon, T. Kurç, A. Sussman, J. Saltz","doi":"10.1109/HCW.2000.843737","DOIUrl":null,"url":null,"abstract":"Applications that use collections of very large, distributed datasets have become an increasingly important part of science and engineering. With high performance wide-area networks becoming more pervasive, there is interest in making collective use of distributed computational and data resources. Recent work has converged to the notion of the Grid, which attempts to uniformly present a heterogeneous collection of distributed resources. Current Grid research covers many areas from low level infrastructure issues to high level application concerns. However providing support for efficient exploration and processing of very large scientific datasets stored in distributed archival storage systems remains a challenging research issue. We have initiated an effort that focuses on developing efficient data-intensive applications in a Grid environment. We present a framework, called filter-stream programming, that represents the processing units of a data-intensive application as a set of filters, which are designed to be efficient in their use of memory and scratch space. We describe a prototype infrastructure that supports execution of applications wing the proposed framework. We present the implementation of two applications using the filter-stream programming framework, and discuss experimental results demonstrating the effects of heterogeneous resources on application performance.","PeriodicalId":351836,"journal":{"name":"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"55","resultStr":"{\"title\":\"Design of a framework for data-intensive wide-area applications\",\"authors\":\"M. Beynon, T. Kurç, A. Sussman, J. Saltz\",\"doi\":\"10.1109/HCW.2000.843737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Applications that use collections of very large, distributed datasets have become an increasingly important part of science and engineering. With high performance wide-area networks becoming more pervasive, there is interest in making collective use of distributed computational and data resources. Recent work has converged to the notion of the Grid, which attempts to uniformly present a heterogeneous collection of distributed resources. Current Grid research covers many areas from low level infrastructure issues to high level application concerns. However providing support for efficient exploration and processing of very large scientific datasets stored in distributed archival storage systems remains a challenging research issue. We have initiated an effort that focuses on developing efficient data-intensive applications in a Grid environment. We present a framework, called filter-stream programming, that represents the processing units of a data-intensive application as a set of filters, which are designed to be efficient in their use of memory and scratch space. We describe a prototype infrastructure that supports execution of applications wing the proposed framework. We present the implementation of two applications using the filter-stream programming framework, and discuss experimental results demonstrating the effects of heterogeneous resources on application performance.\",\"PeriodicalId\":351836,\"journal\":{\"name\":\"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"55\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HCW.2000.843737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HCW.2000.843737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 55

摘要

使用非常大的分布式数据集集合的应用程序已经成为科学和工程中越来越重要的一部分。随着高性能广域网变得越来越普遍，人们对集体使用分布式计算和数据资源很感兴趣。最近的工作已经融合到网格的概念，它试图统一地呈现分布式资源的异构集合。当前的网格研究涵盖了从底层基础设施问题到高层应用问题的许多领域。然而，如何对存储在分布式档案存储系统中的大型科学数据集进行有效的探索和处理仍然是一个具有挑战性的研究问题。我们已经开始致力于在网格环境中开发高效的数据密集型应用程序。我们提出了一个框架，称为过滤器流编程，它将数据密集型应用程序的处理单元表示为一组过滤器，这些过滤器旨在有效地使用内存和刮擦空间。我们描述了一个原型基础设施，该基础设施支持在提议的框架中执行应用程序。我们介绍了使用过滤器流编程框架的两个应用程序的实现，并讨论了证明异构资源对应用程序性能影响的实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Design of a framework for data-intensive wide-area applications

Applications that use collections of very large, distributed datasets have become an increasingly important part of science and engineering. With high performance wide-area networks becoming more pervasive, there is interest in making collective use of distributed computational and data resources. Recent work has converged to the notion of the Grid, which attempts to uniformly present a heterogeneous collection of distributed resources. Current Grid research covers many areas from low level infrastructure issues to high level application concerns. However providing support for efficient exploration and processing of very large scientific datasets stored in distributed archival storage systems remains a challenging research issue. We have initiated an effort that focuses on developing efficient data-intensive applications in a Grid environment. We present a framework, called filter-stream programming, that represents the processing units of a data-intensive application as a set of filters, which are designed to be efficient in their use of memory and scratch space. We describe a prototype infrastructure that supports execution of applications wing the proposed framework. We present the implementation of two applications using the filter-stream programming framework, and discuss experimental results demonstrating the effects of heterogeneous resources on application performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)

自引率

0.00%

发文量