{"title":"A new O(n log n) scheduling heuristic for parallel decomposition of sparse matrices","authors":"R. Telichevesky, P. Agrawal, J. Trotter","doi":"10.1109/ICCD.1991.139985","DOIUrl":null,"url":null,"abstract":"The problem of sparse matrix decomposition using distributed memory multiprocessors is addressed. The data partitioning scheme is simple and is based on equalizing the load among the processors. A new O(n log n) task scheduling heuristic with provably deadlock-free properties is presented. The key idea is the ordering of nodes in a task graph that represents the matrix decomposition steps in a levelized manner, based on a new measure, delta the remaining completion time. The method tends to minimize the idle time of processors by revising the overall decomposition schedule by permitting the execution of tasks within these idle periods. For large sparse matrices, the analysis and simulation results show that a multiprocessor with even a small number of processors will exceed the performance of a supercomputer like the Cray X-MP.<<ETX>>","PeriodicalId":239827,"journal":{"name":"[1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD.1991.139985","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The problem of sparse matrix decomposition using distributed memory multiprocessors is addressed. The data partitioning scheme is simple and is based on equalizing the load among the processors. A new O(n log n) task scheduling heuristic with provably deadlock-free properties is presented. The key idea is the ordering of nodes in a task graph that represents the matrix decomposition steps in a levelized manner, based on a new measure, delta the remaining completion time. The method tends to minimize the idle time of processors by revising the overall decomposition schedule by permitting the execution of tasks within these idle periods. For large sparse matrices, the analysis and simulation results show that a multiprocessor with even a small number of processors will exceed the performance of a supercomputer like the Cray X-MP.<>