{"title":"Balanced Dense Polynomial Multiplication on Multi-Cores","authors":"M. M. Maza, Yuzhen Xie","doi":"10.1145/1823931.1823942","DOIUrl":null,"url":null,"abstract":"In symbolic computation, polynomial multiplication is a fundamental operation akin to matrix multiplication in numerical computation. We present efficient implementation strategies for FFT-based dense polynomial multiplication targeting multi-cores. We show that {\\it balanced input data} can maximize parallel speedup and minimize cache complexity for bivariate multiplication. However, unbalanced input data, which are common in symbolic computation, are challenging. We provide efficient techniques, what we call {\\it contraction} and {\\it extension}, to reduce multivariate (and univariate) multiplication to {\\it balanced bivariate multiplication}. Our implementation in {\\tt Cilk++} demonstrates good speed upon multi-cores.","PeriodicalId":312929,"journal":{"name":"2009 International Conference on Parallel and Distributed Computing, Applications and Technologies","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Parallel and Distributed Computing, Applications and Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1823931.1823942","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 29
Abstract
In symbolic computation, polynomial multiplication is a fundamental operation akin to matrix multiplication in numerical computation. We present efficient implementation strategies for FFT-based dense polynomial multiplication targeting multi-cores. We show that {\it balanced input data} can maximize parallel speedup and minimize cache complexity for bivariate multiplication. However, unbalanced input data, which are common in symbolic computation, are challenging. We provide efficient techniques, what we call {\it contraction} and {\it extension}, to reduce multivariate (and univariate) multiplication to {\it balanced bivariate multiplication}. Our implementation in {\tt Cilk++} demonstrates good speed upon multi-cores.