An experimental evaluation of a peer-model monitoring system for the support of a parallel processing environment

JoséMagalhães Cruz , João Falcãoe Cunha
{"title":"An experimental evaluation of a peer-model monitoring system for the support of a parallel processing environment","authors":"JoséMagalhães Cruz ,&nbsp;João Falcãoe Cunha","doi":"10.1016/0956-0521(95)00043-7","DOIUrl":null,"url":null,"abstract":"<div><p>The process of monitoring the machines or computing nodes in a network, and of monitoring the communication traffic between them, is very important to efficiently launch and execute parallel coarse-grained applications or even classical (serial-type) applications, taking advantage of machines in the network that are not heavily used. An experimental software system, named MONSYS, that is capable of monitoring the machines in a network, is presented. MONSYS can be used in the support of an Application Manager system capable of distributing parallel tasks (or classical programs) over the machines in a local area network with the objective of achieving load balancing. It can also be used as a tool in the administration of networks. MONSYS exhibits a highly decentralized and fault tolerant architecture based on the Peer-Model, which, together with its information diffusion algorithms, constitute its prime novelty. A set of experiments that investigate the performance and scalability of a prototype of MONSYS is presented and discussed. The experiments reported show that MONSYS offers a reasonably accurate picture of the internal state of the machines monitored, without being a burden to the network communication channels or to the machines themselves. In fact, the quantitative results obtained indicate that MONSYS can be several times more performant than an equivalent system using a multicast communication scheme for the exchange of machine state information.</p></div>","PeriodicalId":100325,"journal":{"name":"Computing Systems in Engineering","volume":"6 4","pages":"Pages 331-343"},"PeriodicalIF":0.0000,"publicationDate":"1995-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0956-0521(95)00043-7","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computing Systems in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/0956052195000437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The process of monitoring the machines or computing nodes in a network, and of monitoring the communication traffic between them, is very important to efficiently launch and execute parallel coarse-grained applications or even classical (serial-type) applications, taking advantage of machines in the network that are not heavily used. An experimental software system, named MONSYS, that is capable of monitoring the machines in a network, is presented. MONSYS can be used in the support of an Application Manager system capable of distributing parallel tasks (or classical programs) over the machines in a local area network with the objective of achieving load balancing. It can also be used as a tool in the administration of networks. MONSYS exhibits a highly decentralized and fault tolerant architecture based on the Peer-Model, which, together with its information diffusion algorithms, constitute its prime novelty. A set of experiments that investigate the performance and scalability of a prototype of MONSYS is presented and discussed. The experiments reported show that MONSYS offers a reasonably accurate picture of the internal state of the machines monitored, without being a burden to the network communication channels or to the machines themselves. In fact, the quantitative results obtained indicate that MONSYS can be several times more performant than an equivalent system using a multicast communication scheme for the exchange of machine state information.

支持并行处理环境的对等模型监测系统的实验评估
监视网络中的机器或计算节点以及监视它们之间的通信流量的过程对于有效地启动和执行并行粗粒度应用程序甚至经典(串行类型)应用程序(利用网络中不经常使用的机器)非常重要。介绍了一种能够对网络中的机器进行监控的实验软件系统MONSYS。MONSYS可以用于支持应用程序管理器系统,该系统能够在局域网中的机器上分发并行任务(或经典程序),以实现负载平衡。它还可以用作网络管理的工具。MONSYS展示了一个基于Peer-Model的高度去中心化和容错架构,它与它的信息扩散算法一起构成了它的主要新颖之处。提出并讨论了一组实验,以研究MONSYS原型的性能和可扩展性。实验报告表明,MONSYS提供了被监控机器内部状态的相当准确的图像,而不会成为网络通信通道或机器本身的负担。事实上,所获得的定量结果表明,MONSYS的性能可以比使用组播通信方案交换机器状态信息的等效系统高出几倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信