Proceedings of the 2015 International Workshop on Parallel Symbolic Computation最新文献

Cache oblivious sparse polynomial factoring using the funnel heap 缓存无关稀疏多项式分解使用漏斗堆

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790283

Fatima K. Abu Salem, Khalil El-Harake, Karl Gemayel

{"title":"Cache oblivious sparse polynomial factoring using the funnel heap","authors":"Fatima K. Abu Salem, Khalil El-Harake, Karl Gemayel","doi":"10.1145/2790282.2790283","DOIUrl":"https://doi.org/10.1145/2790282.2790283","url":null,"abstract":"In [2] we demonstrated that overlapping sums of products arising in the Hensel lifting phase of the polytope factoring method using a Max priority queue reduces expression swell and achieves asymptotic reductions in the Hensel lifting phase. In this paper, we propose to implement the priority queue as a Funnel Heap, when polynomials are in sparse distributed representation. Funnel Heap is a cache oblivious priority queue with optimal cache complexity, and we additionally tailor several of its features to the polynomial arithmetic required. Funnel Heap is able to identify equal order monomials \"for free\" whilst it re-organises itself over sufficiently many updates. We adopt a batched mode for chaining equal order monomials that gets overlapped with Funnel Heap's mechanism for emptying its in-core components. We also develop a customised analysis of performance that captures the overhead due to chaining in terms of the fraction of reduction and replication observed in the queue, and get that batched chaining is sensitive to the number of distinct monomials residing in the queue, as opposed to the number of replicas chained. For sufficiently large input size with respect to the cache-line length, batched chaining that is \"search free\" leads to an implementation of Hensel lifting that exhibits optimal cache complexity in the number of replicas found in the queue. Additionally, we obtain an order of magnitude reduction in space, as well as a reduction in the logarithmic factor in work and cache complexity, when comparing our adaptation against [2]. Also, the resulting Hensel lifting process is cache-oblivious. Our benchmarks of the polytope method using Funnel Heap with chaining demonstrate dramatic improvements over the regular binary heap as well as MAGMA, where the latter fails to process sufficiently high degree but sparse polynomial factorisations.","PeriodicalId":384227,"journal":{"name":"Proceedings of the 2015 International Workshop on Parallel Symbolic Computation","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131599280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A compact parallel implementation of F4 一个紧凑的并行实现的F4

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790293

M. Monagan, Roman Pearce

引用次数: 5

Direct solution of the (11,9,8)-MinRank problem by the block Wiedemann algorithm in magma with a tesla GPU 用tesla GPU直接求解岩浆中(11,9,8)-MinRank问题

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2791392

A. Steel

引用次数: 5

Parallel algebraic linear algebra dedicated interface 并行代数线性代数专用接口

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790286

T. Gautier, Jean-Louis Roch, Ziad Sultan, Bastien Vialla

引用次数: 0

A hybrid symbolic-numeric approach to exceptional sets of generically zero-dimensional systems 一般零维系统异常集的混合符号-数值方法

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790288

J. Hauenstein, Alan C. Liddell

引用次数: 2

A parallel implementation for polynomial multiplication modulo a prime 一个对素数取模的多项式乘法的并行实现

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790291

M. Law, M. Monagan

引用次数: 5

Parallel sparse multivariate polynomial division 并行稀疏多元多项式除法

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790285

M. Gastineau, J. Laskar

引用次数: 10

High performance implementation of the inverse TFT 逆TFT的高性能实现

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790292

Lingchuan Meng, Jeremy R. Johnson

引用次数: 5

Optimizing and parallelizing the modular GCD algorithm 模块化GCD算法的优化与并行化

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790287

Matthew Gibson, M. Monagan

引用次数: 1

GPU-acceleration of optimal permutation-puzzle solving 最优排列解谜的gpu加速

Proceedings of the 2015 International Workshop on Parallel Symbolic Computation Pub Date : 2015-07-10 DOI: 10.1145/2790282.2790289

Hayakawa Hiroki, Ishida Naoaki, M. Hirokazu

引用次数: 3