Int. J. High Speed Comput.最新文献

筛选
英文 中文
Simulation of Cycles in the IEH Graph IEH图中周期的模拟
Int. J. High Speed Comput. Pub Date : 1999-09-01 DOI: 10.1142/S0129053399000168
Jen-Chih Lin
{"title":"Simulation of Cycles in the IEH Graph","authors":"Jen-Chih Lin","doi":"10.1142/S0129053399000168","DOIUrl":"https://doi.org/10.1142/S0129053399000168","url":null,"abstract":"The Incrementally Extensible Hypercube (IEH) is a novel interconnection network derived from the hypercube. Unlike the hypercube, the IEH graph is incrementally extensible, that is, it can be constructed for any number of nodes. In addition, it has optimal fault tolerance and its diameter is logarithmic in the number of nodes and the difference of the maximum and the minimum degree of a node in the graph is (i.e., the graph is almost regular). In this paper, we show that almost the entire IEH graph, except for those with N =2n-1 nodes for all , has a Hamiltonian cycle; if an IEH graph has N=2n-1 nodes then it has only a Hamiltonian path, not cycle. These results enable us to obtain the good embedding of rings and linear arrays into the IEH graph.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130262543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Parallel Neural Learning for Control Problems on a Bus-Based Architecture 基于总线结构的并行神经学习控制问题
Int. J. High Speed Comput. Pub Date : 1999-09-01 DOI: 10.1142/S0129053399000120
T. Hong, Jyh-Jong Lee
{"title":"Parallel Neural Learning for Control Problems on a Bus-Based Architecture","authors":"T. Hong, Jyh-Jong Lee","doi":"10.1142/S0129053399000120","DOIUrl":"https://doi.org/10.1142/S0129053399000120","url":null,"abstract":"In [6], we distributed training instances over a single-channel broadcast communication model to speed up execution of the back-propagation learning algorithm for classification problems. In this paper, we extend this concept to control problems, where the output is not necessarily 0 or 1, but ranges over an interval. We first propose a modified back-propagation learning algorithm that incrementally decreases the error threshold by half in order to process training instances with large weight changes as quickly as possible. This modified back-propagation learning algorithm is then parallelized using the single-channel broadcast communication model to n processors, where n is the number of training instances. Finally, the parallel back-propagation learning algorithm is modified for execution on a bounded number of processors to cope with real-world conditions.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128494763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Study of Two Self Healing Protocols for ATM Networks ATM网络中两种自愈协议的比较研究
Int. J. High Speed Comput. Pub Date : 1999-09-01 DOI: 10.1142/S0129053399000119
Vikas Bajaj, A. Sarje
{"title":"A Comparative Study of Two Self Healing Protocols for ATM Networks","authors":"Vikas Bajaj, A. Sarje","doi":"10.1142/S0129053399000119","DOIUrl":"https://doi.org/10.1142/S0129053399000119","url":null,"abstract":"ATM networks are high speed networks and the volume of information carried by them at any instant of time is enormous. Since this information is likely to be of high commercial value, the effect of a transmission link or node failure would be catastrophic. Therefore ATM networks need to be equipped with mechanisms by which the failure of a network entity can be detected and service restored by changing over to alternate entities. Restoration mechanisms based on the self healing concept have been successful in achieving restoration times well below the two seconds limit to prevent any interruption of service to the users. This paper studies two self healing protocols for ATM networks. One is the Backup Virtual Path Self Healing protocol, and the second is the Broadcast Self Healing protocol, which is an extension of the protocol proposed for STM networks. Simulation studies were carried out to compare their restoration speeds, spare resource requirements and maximum restoration ratio. The results suggest that the backup virtual path protocol has an edge over the broadcast protocol.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123335250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Comparisons of the Parallel Preconditioners on the Cray-T3E for Large Nonsymmetric Linear Systems 大型非对称线性系统Cray-T3E上并行预调节器的比较
Int. J. High Speed Comput. Pub Date : 1999-09-01 DOI: 10.1142/S0129053399000144
Sangback Ma
{"title":"Comparisons of the Parallel Preconditioners on the Cray-T3E for Large Nonsymmetric Linear Systems","authors":"Sangback Ma","doi":"10.1142/S0129053399000144","DOIUrl":"https://doi.org/10.1142/S0129053399000144","url":null,"abstract":"In this paper we consider five types of parallel preconditioners for solving large sparse nonsymmetric linear systems on the CRAY-T3E. They are ILU(0) in the wavefront ordering, ILU(0) in the multi-coloring ordering, SSOR in the wavefront ordering, the SPAI(SParse Approximate Inverse) preconditioner, and finally Multi-color Block SOR preconditioner. The ILU(0) is known to be robust and the wavefront ordering naturally exploits the parallelism but has a limited speedup due to the nonuniform lengths of the wavefronts. Multi-coloring is an efficient way of introducing the parallelism of order(N), where N is the order of the matrix but the convergence rate often deteriorates. The SPAI type preconditioner is inherently parallel and is gaining popularity. Finally, for the 5-point Laplacian matrix SOR method is known to have a nondeteriorating rate of convergence when the multi-coloring order is adopted. Also, Block SOR is expected to incur less communication overheads in a message-passing machine. Hence, Multi-Color Block SOR method is expected to have a good performance. Experiments were conducted for the Finite Difference discretizations of two problems with various meshsizes varying up to 1024×1024. MPI library was used for interprocess communications. The results show that ILU(0) in the multi-coloring ordering gives the best performance.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"173 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120881089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Simulation Study of Combining Load Value and Address Predictors 负载值与地址预测器相结合的仿真研究
Int. J. High Speed Comput. Pub Date : 1999-09-01 DOI: 10.1142/S0129053399000156
Toshinori Sato
{"title":"A Simulation Study of Combining Load Value and Address Predictors","authors":"Toshinori Sato","doi":"10.1142/S0129053399000156","DOIUrl":"https://doi.org/10.1142/S0129053399000156","url":null,"abstract":"In this paper, we evaluate a variety of combinations of a load value predictor and a load address predictor. We consider a dynamic hybrid predictor using a predictor selection counter, a static hybrid predictor utilizing execution profiles, and a cooperative predictor. The cooperative predictor is a load value predictor supported by a load address predictor when it is unable to predict a load value. The static hybrid and the cooperative predictors have a benefit that the hardware cost of the selection counter is removed. On the other hand, the dynamic hybrid and the cooperative predictors are free from tedious process of profiling. Based on cycle-by-cycle simulations, we have evaluated the variations and found that the cooperative predictor exploits instruction level parallelism most effectively.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126857198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Constant Propagation in a Hierarchical Intermediate Program Representation 分层中间程序表示中的常数传播
Int. J. High Speed Comput. Pub Date : 1999-06-01 DOI: 10.1142/S0129053399000089
M. Giordano, M. Furnari, Renata Napolitano, Antonio Spagnolo
{"title":"Constant Propagation in a Hierarchical Intermediate Program Representation","authors":"M. Giordano, M. Furnari, Renata Napolitano, Antonio Spagnolo","doi":"10.1142/S0129053399000089","DOIUrl":"https://doi.org/10.1142/S0129053399000089","url":null,"abstract":"A crucial problem in parallelizing compiler design is the choice of a suitable Intermediate Representation (IR) to make parallelism detection and extraction easy and possible. Dependence graphs have long been recognized as useful tools for this aim. Several versions were proposed [1,4,6,20], which encapsulate relations of data dependence, control dependence or both. They were also used in conventional optimizations, but always coupled with the traditional Control Flow Graph (CFG). However, it would be preferable to apply both conventional and parallelizing optimizations to the same IR. Recently, some attempts were made to use dependence graphs as a unifying framework on which all types of optimization [4] are applied. This is one of the aims of our work in this research area. We started working on the well known Hierarchical Task Graph (HTG) of Polychronopoulos and Girkar [6,7,8]. The HTG, by definition, is an acyclic and then a reducible graph. In fact it is built from a CFG arranged hierarchically around loops with all loop back edges removed. This CFG is then enriched with control and data dependence information (edges). Our proposal is an extension of the HTG, named Extended Hierarchical Task Graph (EHTG), that is a HTG where data dependence edges are annotated with a boolean branch path expression indicating the CFG paths through which the data dependences are established. We developed two algorithms of constant propagation (used for dead code elimination) running directly on our EHTG without using the traditional CFG. The two algorithms can be applied only to sequential programs whose parallelism is represented by the HTG structure. They are not suited to perform constant propagation on explicitly parallel programs [12]. Complexity and correctness of the algorithms are analyzed and we also prove that one of them has the same complexity and finds the same class of constants as the well known Wegman & Zadeck constant propagation algorithm [20], that uses a hybrid sparse representation.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126499426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Shared Memory Multiprocessor System for the Recognition of solid Objects 面向实体物体识别的共享内存多处理器系统
Int. J. High Speed Comput. Pub Date : 1999-06-01 DOI: 10.1142/S0129053399000077
M. Yaqub, Q. Shaikh, S. Ahmad
{"title":"A Shared Memory Multiprocessor System for the Recognition of solid Objects","authors":"M. Yaqub, Q. Shaikh, S. Ahmad","doi":"10.1142/S0129053399000077","DOIUrl":"https://doi.org/10.1142/S0129053399000077","url":null,"abstract":"Details of object recognition as applied to a shared memory multiprocessor system is presented, along with the software for image pre-processing; separation of the object from background; and the calculation of the invariant characteristic of the object using the Hu-invariant technique. Partitioning of the sequential algorithms for implementation in the multiprocessor hardware by the determination of precedence and data dependency, is also described.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"180 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133553491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diagonal-Implicitly Iterated Runge-Kutta Methods on Distributed Memory Machines 分布式存储机器上的对角隐式迭代龙格-库塔方法
Int. J. High Speed Comput. Pub Date : 1999-06-01 DOI: 10.1142/S0129053399000090
T. Rauber, G. Rünger
{"title":"Diagonal-Implicitly Iterated Runge-Kutta Methods on Distributed Memory Machines","authors":"T. Rauber, G. Rünger","doi":"10.1142/S0129053399000090","DOIUrl":"https://doi.org/10.1142/S0129053399000090","url":null,"abstract":"We consider diagonal-implicitly iterated Runge–Kutta methods which are one-step methods for stiff ordinary differential equations providing embedded solutions for stepsize control. In these methods, algorithmic parallelism is introduced at the expense of additional computations. In this paper, we concentrate on the algorithmic structure of these Runge–Kutta methods and consider several parallel variants of the method exploiting algorithmic and data parallelism in different ways. Our aim is to investigate whether these variants lead to good performance on current distributed memory machines such as the Intel Paragon and the IBM SP2. As test application we use ordinary differential equations with dense and sparse right-hand side functions.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126398890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Vector and Parallel Implementations for the FDTD Analysis of Millimeter Wave Planar Antennas 毫米波平面天线时域有限差分分析的矢量与并行实现
Int. J. High Speed Comput. Pub Date : 1999-06-01 DOI: 10.1142/S0129053399000107
H. Hoteit, R. Sauleau, B. Philippe, P. Coquet, J. Daniel
{"title":"Vector and Parallel Implementations for the FDTD Analysis of Millimeter Wave Planar Antennas","authors":"H. Hoteit, R. Sauleau, B. Philippe, P. Coquet, J. Daniel","doi":"10.1142/S0129053399000107","DOIUrl":"https://doi.org/10.1142/S0129053399000107","url":null,"abstract":"The 3D Finite-Difference Time-Domain (FDTD) method is a powerful numerical technique for directly solving Maxwell's equations. This paper describes its implementation on high speed computers. This technique is used here for the analysis of millimeter wave planar antennas. In our algorithm, Berenger's Perfectly Matched Layers (PML) are implemented as absorbing boundary conditions to mimic free space. Dielectric and metallic losses are taken into account in a recursive and dispersive formulation. We present the main techniques implemented to optimize the non-sequential program on vector computers. Besides, two parallel supercomputers of different architectures as well as a multi-user network of Sun workstations are used to investigate the parallel FDTD code. The performances obtained on vector/distributed memory massively parallel/hybrid computers show that the FDTD algorithm is ideally suited for the implementations on both vector and parallel computers. Comparisons with experimental results in the millimeter wave frequency band validate our codes.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126733675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Dynamic Load Assignment of Real-Time Tasks in Distributed Memory Multiprocessors 分布式内存多处理器实时任务的动态负载分配
Int. J. High Speed Comput. Pub Date : 1999-03-01 DOI: 10.1142/S0129053399000053
Yacine Atif
{"title":"Dynamic Load Assignment of Real-Time Tasks in Distributed Memory Multiprocessors","authors":"Yacine Atif","doi":"10.1142/S0129053399000053","DOIUrl":"https://doi.org/10.1142/S0129053399000053","url":null,"abstract":"In this paper, we consider a scalable distributed-memory architecture for which we propose a problem representation that assigns real-time tasks on the processing units of the architecture to maximize deadline compliance rate. Based on the selected problem representation, we derive an algorithm that dynamically schedules real-time tasks on the processors of the distributed architecture. The algorithm uses a formula to generate the adequate scheduling time so that deadline loss due to scheduling overhead is minimized while deadline compliance rate is being maximized. The technique we propose proved to be correct in the sense that the delivered solutions are not obsolete, i.e., the assigned tasks to working processors are guaranteed to meet their deadlines once executed. The correctness criterion is obtained based on our technique to control the scheduling time. To evaluate the performance of the algorithms that we propose, we provide a number of experiments through a simulation study. We also propose an implementation of our algorithms in the context of scheduling real-time transactions on an Intel-Paragon distributed-memory multiprocessor. The results of the conducted experiments show interesting performance trade-offs among the candidate algorithms.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121843513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信