Int. J. High Speed Comput.最新文献

筛选
英文 中文
Performance Analysis of k-Ary n-Cube Networks with Pipelined Circuit Switching 基于流水线电路交换的k-Ary n-Cube网络性能分析
Int. J. High Speed Comput. Pub Date : 2000-06-01 DOI: 10.1142/S0129053300000102
G. Min, H. Sarbazi-Azad, M. Ould-Khaoua
{"title":"Performance Analysis of k-Ary n-Cube Networks with Pipelined Circuit Switching","authors":"G. Min, H. Sarbazi-Azad, M. Ould-Khaoua","doi":"10.1142/S0129053300000102","DOIUrl":"https://doi.org/10.1142/S0129053300000102","url":null,"abstract":"Several existing studies have revealed that pipelined circuit switching (or PCS for short) can provide superior performance characteristics over wormhole switching. This paper proposes a new analytical model for PCS in high-radix high-dimensional k-ary n-cubes augmented with virtual channel support. The model uses Random Walk Theory to analyse the backtracking actions of the message header during the path set-up phase, and M/G/1 queueing systems to compute the mean waiting time that a message experiences at a source node before entering the network. Results from simulation experiments confirm that the proposed model exhibits a good degree of accuracy for various network sizes and under different operating conditions.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128212177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Overhead Reduction in the Network Communication for Web Computing Web计算中网络通信开销的降低
Int. J. High Speed Comput. Pub Date : 2000-06-01 DOI: 10.1142/S0129053300000114
Juil Oh, Ju-wook Jang, T. Han
{"title":"Overhead Reduction in the Network Communication for Web Computing","authors":"Juil Oh, Ju-wook Jang, T. Han","doi":"10.1142/S0129053300000114","DOIUrl":"https://doi.org/10.1142/S0129053300000114","url":null,"abstract":"One major source of overhead for conventional network communication is the involvement of operating system to allow for virtual sharing of the network interface card among applications and its processing of the UDP/TCP/IP headers on behalf of applications. The involvement of operating system results in overheads such as extra write/read operations to the kernel buffer through which each inbound/outbound messages should pass. Another overhead comes from scheduling which may hinder immdiate sending or receiving of messages to and from network interface buffer. Recently, U-Net over Fast Ethernet [9] addressed this problem and proposed a new scheme which circumvents the operating system in processing the network communication. However, since the approach has removed IP header and provides multiplexing in the ethernet layer, it is only applicable to an ethernet LAN. In this paper, we define a new protocol by which messages are allowed to move across routers and the virtual sharing of network interface card is supported with minimal overhead. In a word, we extended the U-Net over Fast Ethernet [9] beyond routers to WAN and hence opening a new way to exploit vast amount of computing resources all over Internet. With our protocol, cluster computing over any part of WAN is realizable as far as total delay caused by intervening routers is tolerable to the application. As a justification of our approach, we show a small part of WAN containing a router exhibits latency comparable to (differ by less than 1 ms) that of a shared ethernet. Another motivation for our approach is the observation that the aggregate bandwidth of two subnets (part of WAN) is greater than that of a single subnet (LAN). With same number of hosts, each connection will experience more available bandwidth with the part of WAN than the LAN. The experiment over WAN involving a router shows reduction of latency comparable to that of U-Net over Fast Ethernet.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121998874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parallel Performance of Fast Wavelet Transforms 快速小波变换的并行性能
Int. J. High Speed Comput. Pub Date : 2000-03-01 DOI: 10.1142/S0129053300000059
O. M. Nielsen, M. Hegland
{"title":"Parallel Performance of Fast Wavelet Transforms","authors":"O. M. Nielsen, M. Hegland","doi":"10.1142/S0129053300000059","DOIUrl":"https://doi.org/10.1142/S0129053300000059","url":null,"abstract":"We present a parallel 2D wavelet transform algorithm with modest communication requirements. Data are transmitted between nearest neighbors only and the amount is independent of the problem size as well as the number of processors. An analysis of the theoretical performance shows that the algorithm is scalable approaching perfect speedup as the problem size is increased. This performance is realized in practice on the IBM SP2 as well as on the Fujitsu VPP300 where it will form part of the Scientific Software Library.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121908774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 41
Parallel Spectral Division Using the Matrix Sign Function for the Generalized Eigenproblem 广义特征问题的矩阵符号函数并行谱划分
Int. J. High Speed Comput. Pub Date : 2000-03-01 DOI: 10.1142/S0129053300000084
S. Huss-Lederman, E. S. Quintana‐Ortí, Xiaobai Sun, Yuan-Jye J. Wu
{"title":"Parallel Spectral Division Using the Matrix Sign Function for the Generalized Eigenproblem","authors":"S. Huss-Lederman, E. S. Quintana‐Ortí, Xiaobai Sun, Yuan-Jye J. Wu","doi":"10.1142/S0129053300000084","DOIUrl":"https://doi.org/10.1142/S0129053300000084","url":null,"abstract":"In this paper we demonstrate the parallelism of the spectral division using the matrix sign function for the generalized nonsymmetric eigenproblem. We employ the so–called generalized Newton iterative scheme in order to compute the sign function of the matrix pair. A recent study showed a considerable reduction (by 75%) in the computational cost of this iteration, making this approach competitive when compared to the traditional QZ algorithm. The experimental results on an IBM SP3 multicomputer report the parallel performance (efficiency around 60–80%) and scalability of this approach.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133472923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
The Vector Multiprocessor 矢量多处理器
Int. J. High Speed Comput. Pub Date : 2000-03-01 DOI: 10.1142/S0129053300000060
P. Swarztrauber
{"title":"The Vector Multiprocessor","authors":"P. Swarztrauber","doi":"10.1142/S0129053300000060","DOIUrl":"https://doi.org/10.1142/S0129053300000060","url":null,"abstract":"The Vector Multiprocessor brings to the multiprocessor what vectorization brought to the single processor. In addition to the usual complement of logic and arithmetic units, each processor contains a programmable communication unit with registers that communicate directly with comparable registers in neighboring processors via an n-dimensional interconnection network. Interprocessor communication tasks are performed to and from these registers in the same way that computational tasks are performed on a vector uniprocessor. Communication is shown to be optimal for a large class of communication tasks. Elements are transmitted, in parallel, to their destination processors at an average rate of one per communication cycle. This result, called O(1) access, is used to develop a balanced communication system where local and global access are comparable. It is also used to support the \"vector parallel paradigm\" where all arrays are uniformly distributed and the user interface \"looks\" like a vector uniprocessor interface. Both coarse- and fine-grain performance models are provided, which demonstrate the unexpected result that communication is asymptotically negligible compared to computational time. Finally, three performance models are presented for the spherical harmonic transform, which is the most communication-intensive part of climate model dynamics.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"613 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123340108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Path Selection for Real-Time Communication in Wormhole Networks 虫洞网络中实时通信的路径选择
Int. J. High Speed Comput. Pub Date : 1999-12-01 DOI: 10.1142/S012905339900017X
Kyungwan Nam, Sunggu Lee, Jong Kim
{"title":"Path Selection for Real-Time Communication in Wormhole Networks","authors":"Kyungwan Nam, Sunggu Lee, Jong Kim","doi":"10.1142/S012905339900017X","DOIUrl":"https://doi.org/10.1142/S012905339900017X","url":null,"abstract":"For real-time communication, we must be able to guarantee timely delivery of messages. In a previous paper, Kim et al. presented a real-time communication method for networks which uses a determini...","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"233 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116861231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Balanced Generalized Hypercubes: Complexity and Cost/Performance Analysis 平衡广义超立方体:复杂性和成本/性能分析
Int. J. High Speed Comput. Pub Date : 1999-12-01 DOI: 10.1142/S0129053399000193
L. Lin
{"title":"Balanced Generalized Hypercubes: Complexity and Cost/Performance Analysis","authors":"L. Lin","doi":"10.1142/S0129053399000193","DOIUrl":"https://doi.org/10.1142/S0129053399000193","url":null,"abstract":"The BGHC is a generalized hypercube that has exactly w nodes along each of the d dimensions for a total of wd nodes. A BGHC is said to be maximal if the w nodes along each dimension form a complete directed graph. A BGHC is said to be minimal if the w nodes along each dimension form a unidirectional ring. Lower bound complexities are derived for three intensive communication patterns assuming the balanced generalized hypercube (BGHC) topology. A maximal N node BGHC with a node degree equal to αlog2N, where α≥2, can process certain intensive communication patterns α(α-1) times faster than an N node binary hypercube (which has a node degree equal to log2N). On the other hand, a minimal N node BGHC with a node degree equal to , where β≥2, is 2β times slower at processing certain intensive communication patterns than an N node binary hypercube. For certain communication patterns, increasing one unit cost gains a normalized speedup to the binary hypercube by wlog2w times for the maximal BGHC. For the minimal BGHC, reducing one unit cost gains times speedup normalized to the binary hypercube.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126896259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parallel Implementation of Alternate Quadrant Interlocking Factorisation Method on Star Topology 星型拓扑上交替象限联锁分解方法的并行实现
Int. J. High Speed Comput. Pub Date : 1999-12-01 DOI: 10.1142/S0129053399000181
B. Prameela, L. Patnaik
{"title":"Parallel Implementation of Alternate Quadrant Interlocking Factorisation Method on Star Topology","authors":"B. Prameela, L. Patnaik","doi":"10.1142/S0129053399000181","DOIUrl":"https://doi.org/10.1142/S0129053399000181","url":null,"abstract":"This paper discusses the parallel implementation of the solution of a set of linear equations using the Alternative Quadrant Interlocking Factorisation Methods (AQIF), on a star topology. Both the AQIF and LU decomposition methods are mapped onto star topology on an IBM SP2 system, with MPI as the internode communicator. Performance parameters such as speedup, efficiency have been obtained through experimental and theoretical means. The studies demonstrate (i) a mismatch of 15% between the theoretical and experimental results, (ii) scalability of the AQIF algorithm, and (iii) faster executing AQIF algorithm.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124413415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balanced Generalized Hypercubes: Optimal Communication Algorithms 平衡广义超立方体:最优通信算法
Int. J. High Speed Comput. Pub Date : 1999-12-01 DOI: 10.1142/S012905339900020X
L. Lin
{"title":"Balanced Generalized Hypercubes: Optimal Communication Algorithms","authors":"L. Lin","doi":"10.1142/S012905339900020X","DOIUrl":"https://doi.org/10.1142/S012905339900020X","url":null,"abstract":"In this paper three communication algorithms are proposed for two types of generalized hypercube multiprocessor. The algorithms are intended to solve three intensive communication problems: complete broadcast, single-node scatter and total exchange. The algorithms achieve both the time and transmission complexity bounds for the three problems on the balanced generalized hypercube (BGHC). The BGHC is a wd-node network with w nodes along each of the d dimensions. These communication algorithms are performed based on a balanced spanning tree, called a compatible tree, which can be used to solve any of the tree problems. Several theoretical results related to the compatible tree and then the sufficient and necessary condition for concurrent transmissions are presented. The concurrent condition ensures the maximum use of network bandwidth so that the optimal bounds are achieved. It is shown that the proposed scheduling algorithms achieve the optimal bounds for any w and d.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125020408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved Lempel-Ziv Algorithm for Sequential Data Compression 一种改进的序列数据压缩Lempel-Ziv算法
Int. J. High Speed Comput. Pub Date : 1999-09-01 DOI: 10.1142/S0129053399000132
S. K. Jena
{"title":"An Improved Lempel-Ziv Algorithm for Sequential Data Compression","authors":"S. K. Jena","doi":"10.1142/S0129053399000132","DOIUrl":"https://doi.org/10.1142/S0129053399000132","url":null,"abstract":"The importance of data compression has been increasing of late due to the increasing demand put on the networks by applications consuming high amount of network bandwidth. Thus compressing data to at least a particular level becomes mandatory for applications to avoid congestion in the network. Recently there has been an interest in increasing the capacity of storage systems, which employ lossless data compression. In this paper, we have proposed two improved algorithms for data compression. These algorithms are tested and compared with best known existing algorithms of data compression. It has been observed that compression rate of our proposed algorithms reaches up to 99% for huge files like swap files.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126568646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信