{"title":"Virtual Permanent Connection: network computing over switch-based high-speed networks","authors":"Mengjou Lin, D. Du","doi":"10.1109/ICPADS.1994.590410","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590410","url":null,"abstract":"Recent progress in switch based high speed local area networks (LANs) makes distributed network computing promising. Three evolving switch based high speed networks are the High Performance Parallel Interface (HIPPI), Fiber Channel (FC), and Asynchronous Transfer Mode (ATM) standards. We study how high performance computing can be carried out over such networks. High performance computing can be characterized as follows: it includes multiple modules and each module is executed in a processor; its communication data flow forms a special application topology and usually such application topologies are regular; and it requires frequent communication between adjacent modules in the application topology. In order to reduce the amount of time required for a processor to set up a connection during the execution of an application, we propose a new communication protocol called the Virtual Permanent Connection (VPC). For a given application topology, a set of connections are set up and permanently maintained during the execution of the application. Communication between processors are via this group of connections. We study how a set of VPCs are chosen based on a given application topology (this process is called application topology embedding).","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127776053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A theory of fault-tolerant routing in wormhole networks","authors":"J. Duato","doi":"10.1109/ICPADS.1994.590404","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590404","url":null,"abstract":"Fault-tolerant systems aim at providing continuous operations in the presence of faults. Multicomputers rely on an interconnection network between processors to support the message-passing mechanism. Therefore, the reliability of the interconnection network is very important for the reliability of the whole system. This paper analyzes the effective redundancy available in a wormhole network by combining connectivity and deadlock freedom. Redundancy is defined at the channel level. We propose a sufficient condition for channel redundancy, also computing the set of redundant channels. The redundancy level of the network is also defined, proposing a theorem that supplies its value. This theory is developed on top of our necessary and sufficient condition for deadlock-free adaptive routing. Finally, a fault-tolerant routing algorithm for n-dimensional meshes is proposed.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128029438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Lock-free concurrent tree structures for multiprocessor systems","authors":"Jyh-Jong Tsay, Hsin-Chi Li","doi":"10.1109/ICPADS.1994.590370","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590370","url":null,"abstract":"This paper presents a window-based approach to design lock-free concurrent implementations for a class of top-down tree structures that supports operations whose executions can be modeled as a process of moving a window along a rooted simple path. Our approach can be implemented on multiprocessor systems that, support load-linked, store-conditional and check-valid synchronization primitives that are supported in MIPS-II and DEC Alpha architectures. Our approach achieves high degree of concurrency, requires low coordination overhead, is wait-free and fault tolerant. Simulation shows that our approach is efficient.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114530592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new and efficient FFT algorithm for distributed memory systems","authors":"N. Anupindi, M. An, J. Cooley, Q. Yang","doi":"10.1109/ICPADS.1994.590059","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590059","url":null,"abstract":"This paper presents a new and optimal parallel implementation of multidimensional fast Fourier transform algorithm on distributed memory multiprocessors. Its optimality is obtained by minimizing the number of message passings necessary, at the cost of increase in message length. This distinctive feature of the new algorithm effectively utilizes the important architectural property of most of today's distributed memory multiprocessors-wormhole routing for interprocessor communications. By using the algebra of stride permutations and tenser products as a mathematical tool, we are able to derive and formulate an efficient data partition and communication scheme that reduces communication cost from O(N/sup 2/) required for the best known FFT to O(N) on an N/sup 2/-processor machine. Our data partition scheme is natural and efficient for solving discretized boundary value problems such as partial differential equations and finite element analysis. To evaluate the actual performance of our new algorithm in comparison with other existing parallel FFT algorithms, we have carried out implementation experiments on the Intel's Touchstone Delta machine.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113956983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Probabilistic timed protocol verification for the extended state transition model","authors":"Chung-Ming Huang, Shiun-Wei Lee, Jenq-Muh Hsu","doi":"10.1109/ICPADS.1994.590352","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590352","url":null,"abstract":"We propose a Timed Communicating State Machine (TCSM), which belongs to the extended state transition model, to formally specify protocols that incorporate timed properties as part of their specifications. Based on the TCSM model we also propose (1) a timed global state reachability analysis that takes time bounds and predicates into consideration, and (2) a probabilistic timed verification scheme that is based on the occurrence rates of communicating entities' transitions and the occurrence probabilities of channel entities' transitions. In this way, probability-based partial timed verification can be achieved for extended-state-transition-specified timed protocols.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116459564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Achieving dependability in mission-critical operating systems through adaptability and large-scale functional integration","authors":"H. Wedde, J. Lind, A. Eiss","doi":"10.1109/ICPADS.1994.590420","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590420","url":null,"abstract":"As part of the DRAGON SLAYER project, the adaptive and reliable distributed file system MELODY has emerged for supporting hard real-time applications in unpredictable environments. In MELODY, the time criticality of tasks and their sensitivity with respect to the latest file information are explicitly used for novel, flexible task scheduling algorithms and file replication management policies, featuring dynamic file replication and relocation of file copies as well as offering file versions of varying recency. The achieved adaptability far outweighs the additional overhead in comparison to simpler models, and enhances both reliability and real-time responsiveness for file access. Both the development of the model and the experimental analysis at the various stages were done in an incremental manner. This was necessary in order to cope with the complexity of the problems resulting from conflicting requirements and dynamic trade-offs (e.g. real-time responsiveness vs. reliability). As the next incremental model extension, and as the major contribution of this paper, a series of integration policies are developed for task and resource scheduling, after redefining the role and order of task and resource scheduling: the periodic and dynamic models (and combinations thereof) are used to invoke the task scheduler for a fixed interval of time. All of these policies compare very convincingly against the \"classical\" model where the task scheduler only schedules tasks after their resources have been allocated. The results are discussed.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116344807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient algorithms for data distribution on distributed memory multicomputers","authors":"Peizong Lee","doi":"10.1109/ICPADS.1994.590385","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590385","url":null,"abstract":"Data distribution has been one of the most important research topics in parallelizing compilers for distributed memory parallel computers. In this paper, we show that data re-distribution is necessary for executing a sequence of Do-loops if the communication cost due to perform this sequence of Do-loops is larger than a threshold value. Based on this observation, we propose efficient algorithms which can determine effective data distribution schema for executing a sequence of Do-loops with a general structure. Our result contributes towards automatic compilation of sequential programs to message-passing version programs running on distributed memory parallel computers.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127620667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Designing general-purpose fault-tolerant distributed systems-a layered approach","authors":"A. Nayak, W. Jone, Sunil R. Das","doi":"10.1109/ICPADS.1994.590327","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590327","url":null,"abstract":"General-purpose distributed systems comprised of computing nodes with different characteristics and connected by high-speed communication networks are very popular these days. The development of a dependable distributed system, however, necessitates the use of various techniques including fault tolerance to avert occurrences of failures or system malfunction. The ad hoc techniques of adding redundancy to improve reliability are not always suitable in these circumstances because of excessive design cost. Redundancies have to be allocated at various hardware and software levels in order to optimize their utilization in the system. This paper considers the design of general-purpose fault-tolerant distributed systems based on a layered approach. The benefits of the layered approach in the process of allocation of redundancy and fault tolerance at various system levels are presented and analyzed in the paper.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"569 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130379349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Performance analysis of distributed client-server message queuing","authors":"E. Zimran, M. Rosen","doi":"10.1109/ICPADS.1994.590372","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590372","url":null,"abstract":"Queuing is an efficient programming paradigm for centralized and distributed systems. It enables multiple programs to share and control the flow of data and work objects, and to establish transparent and asynchronous communication among non-coherent software products, platforms or applications. This paper presents the performance analysis and specifications of a reliable, distributed and high performance queuing facility, which is capable of delivering thousands of enqueue and dequeue operations per second on local or distributed systems.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122346769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Parallel algorithms for verification and sensitivity analysis of minimum spanning trees","authors":"W. Liang, Xiaojun Shen, Qing Hu","doi":"10.1109/ICPADS.1994.590315","DOIUrl":"https://doi.org/10.1109/ICPADS.1994.590315","url":null,"abstract":"To verify whether a spanning tree T(V,E) of graph G(V,E') is a minimum spanning tree, two parallel algorithms are presented. The first algorithm requires O(log n) time and O(max{m/log n, n/sup 3/2//log n}) processors, where |E'|=m and |V|=n. The second algorithm requires O(log n) time and O(m) processors or O(log nloglog n) time and O(max{m/log n, n}) processors. The first algorithm is optimal when G is dense, compared with its O(m) time sequential version. The second algorithm has better performance when G is sparse. By using above results, we also present an efficient algorithm for sensitivity analysis of minimum spanning trees which requires O(log n) time and O(max{m, n/sup 2//log n}) processors. All proposed algorithms in this paper are based on the parallel computational model called CREW PRAM.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123930608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}