S. Russ, Jonathan Robinson, M. Gleeson, B. Meyers, L. Rajagopalan, Chun-Heong Tan
{"title":"Using Hector to run MPI programs over networked workstations","authors":"S. Russ, Jonathan Robinson, M. Gleeson, B. Meyers, L. Rajagopalan, Chun-Heong Tan","doi":"10.1002/(SICI)1096-9128(19990410)11:4%3C189::AID-CPE381%3E3.0.CO;2-H","DOIUrl":null,"url":null,"abstract":"Networked workstations represent an increasingly popular distributed platform for running large parallel programs. They can present a low-cost alternative to purchasing supercomputer time or additional usable computational capability. Several capabilities are desirable in order to harness workstations, including support for a widely accepted parallel programming environment, task migration, intelligent resource allocation, fault tolerance, and totally transparent support of these features. The Hector system is designed to provide these capabilities to MPI programs. The structure of the system and experiences using the system on loaded workstations to run scientific codes are described. Copyright 1999 John Wiley & Sons, Ltd.","PeriodicalId":199059,"journal":{"name":"Concurr. Pract. Exp.","volume":"2017 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurr. Pract. Exp.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/(SICI)1096-9128(19990410)11:4%3C189::AID-CPE381%3E3.0.CO;2-H","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Networked workstations represent an increasingly popular distributed platform for running large parallel programs. They can present a low-cost alternative to purchasing supercomputer time or additional usable computational capability. Several capabilities are desirable in order to harness workstations, including support for a widely accepted parallel programming environment, task migration, intelligent resource allocation, fault tolerance, and totally transparent support of these features. The Hector system is designed to provide these capabilities to MPI programs. The structure of the system and experiences using the system on loaded workstations to run scientific codes are described. Copyright 1999 John Wiley & Sons, Ltd.