{"title":"Implementation of a portable parallelizing compiler with loop partition","authors":"M.-C. Hsiao, S. Tseng, Chao-Tung Yang, C.-S. Chen","doi":"10.1109/ICPADS.1994.590318","DOIUrl":null,"url":null,"abstract":"We have implemented a portable FORTRAN parallelizing compiler with loop partition on our experimental target system, Acer Altos 10000, running OSF/1 operating system. We have defined a minimal set of thread-related functions and data types, called B Threads, that is required to support the execution of this parallelizing compiler. Our compiler is highly modularized so that the porting to other platforms will be very easy, and it can partition parallel loops into multithreaded codes based on several loop partition algorithms. We have also proposed a general model of parallel compilers, which is an extension from previous model and is useful in constructing a parallelizing compiler for a particular language. The experimental results show that the best speedups are 3.75, 3.46, and 3.81 for matrix multiplication, adjoint convolution, and increasing workload sample, respectively, when the number of processors is four. It has been shown that this approach works and the experimental results are satisfied.","PeriodicalId":154429,"journal":{"name":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 1994 International Conference on Parallel and Distributed Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPADS.1994.590318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
We have implemented a portable FORTRAN parallelizing compiler with loop partition on our experimental target system, Acer Altos 10000, running OSF/1 operating system. We have defined a minimal set of thread-related functions and data types, called B Threads, that is required to support the execution of this parallelizing compiler. Our compiler is highly modularized so that the porting to other platforms will be very easy, and it can partition parallel loops into multithreaded codes based on several loop partition algorithms. We have also proposed a general model of parallel compilers, which is an extension from previous model and is useful in constructing a parallelizing compiler for a particular language. The experimental results show that the best speedups are 3.75, 3.46, and 3.81 for matrix multiplication, adjoint convolution, and increasing workload sample, respectively, when the number of processors is four. It has been shown that this approach works and the experimental results are satisfied.