{"title":"一种新的O(n log n)调度启发式稀疏矩阵并行分解算法","authors":"R. Telichevesky, P. Agrawal, J. Trotter","doi":"10.1109/ICCD.1991.139985","DOIUrl":null,"url":null,"abstract":"The problem of sparse matrix decomposition using distributed memory multiprocessors is addressed. The data partitioning scheme is simple and is based on equalizing the load among the processors. A new O(n log n) task scheduling heuristic with provably deadlock-free properties is presented. The key idea is the ordering of nodes in a task graph that represents the matrix decomposition steps in a levelized manner, based on a new measure, delta the remaining completion time. The method tends to minimize the idle time of processors by revising the overall decomposition schedule by permitting the execution of tasks within these idle periods. For large sparse matrices, the analysis and simulation results show that a multiprocessor with even a small number of processors will exceed the performance of a supercomputer like the Cray X-MP.<<ETX>>","PeriodicalId":239827,"journal":{"name":"[1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A new O(n log n) scheduling heuristic for parallel decomposition of sparse matrices\",\"authors\":\"R. Telichevesky, P. Agrawal, J. Trotter\",\"doi\":\"10.1109/ICCD.1991.139985\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The problem of sparse matrix decomposition using distributed memory multiprocessors is addressed. The data partitioning scheme is simple and is based on equalizing the load among the processors. A new O(n log n) task scheduling heuristic with provably deadlock-free properties is presented. The key idea is the ordering of nodes in a task graph that represents the matrix decomposition steps in a levelized manner, based on a new measure, delta the remaining completion time. The method tends to minimize the idle time of processors by revising the overall decomposition schedule by permitting the execution of tasks within these idle periods. For large sparse matrices, the analysis and simulation results show that a multiprocessor with even a small number of processors will exceed the performance of a supercomputer like the Cray X-MP.<<ETX>>\",\"PeriodicalId\":239827,\"journal\":{\"name\":\"[1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors\",\"volume\":\"111 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1991-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"[1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCD.1991.139985\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD.1991.139985","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new O(n log n) scheduling heuristic for parallel decomposition of sparse matrices
The problem of sparse matrix decomposition using distributed memory multiprocessors is addressed. The data partitioning scheme is simple and is based on equalizing the load among the processors. A new O(n log n) task scheduling heuristic with provably deadlock-free properties is presented. The key idea is the ordering of nodes in a task graph that represents the matrix decomposition steps in a levelized manner, based on a new measure, delta the remaining completion time. The method tends to minimize the idle time of processors by revising the overall decomposition schedule by permitting the execution of tasks within these idle periods. For large sparse matrices, the analysis and simulation results show that a multiprocessor with even a small number of processors will exceed the performance of a supercomputer like the Cray X-MP.<>