Sparse polynomial division using a heap

2011, Journal of Symbolic Computation

Abstract

In 1974, Johnson showed how to multiply and divide sparse polynomials using a binary heap. This paper introduces a new algorithm that uses a heap to divide with the same complexity as multiplication. It is a fraction-free method that also reduces the number of integer operations for divisions of polynomials with integer coefficients over the rationals. Heap-based algorithms use very little memory and do not generate garbage. They can run in the cpu cache and achieve high performance. We compare our C implementation of sparse polynomial multiplication and division with integer coefficients to the routines of existing computer algebra systems.

Figures (12)

Table 2. Integer operations for fraction-free sparse polynomial division (worst case).

Our heap contains elements that behave like either a quotient heap or a divisor heap. We say a product qjg; “moves along g” if the next term inserted into the heap is q;g;+1. Likewise, a product q;g; “moves along q” if the next term inserted is qj+1g;. We now have two algorithms to divide f + g = (q,r) using a heap with #q elements or a heap with #g — 1 elements. However we do not know which algorithm to run since the size of the quotient q is a priori unknown. Our new division algorithm starts with a quotient heap and switches to a divisor heap when the number of terms in the quotient equals the number of terms in the divisor. In short, we compute

Table 3. Dense multiplication and division over Z, W(f,g) = 831.76. Fateman’s benchmark is a dense computation. The ats d degree < d are multiplied to produce (re) terms. Maple and Singular can both divide faster than they can multiply because they use recursive algorithms. In sdmp division is slightly slower than multiplication because the code is more complex. ) monomials in n variables of Tha Fateman’s benchmark is a dense computation. ‘The ("5 ) monomials in n variables of Our first problem is due to Fateman (3). Let f= (1+a+y+z2+t)” andg=f +1. We multiply p = f-g and divide g = p/f. The coefficients of f and g are 39 bit integers and the coefficients of p are 83 bit integers.

Table 4. Sparse multiplication and division over Z, W/(f,g) = 17.86.

Table 5. Very sparse multiplication and division over Z, W/(f,g) = 2.9. On sparse problems the advantages of heaps are substantial. The product p = f-g may be hundreds of megabytes but that data is written to memory sequentially. The heap is less than half a megabyte. For division the heap is larger but the total memory required is still only one or two megabytes which easily fits in the cache.

Table 6. Varying the quotient and the divisor.

Table 7. Sparse division with remainder, W(q,g) = 79.6. nq The fraction-free strategy of Section 2.3 holds up very well. Provided the coefficients remain small, it imposes only a small cost in overhead versus a computation modulo p. Magma also has a good strategy but we do not know what it is. Singular uses too much coefficient arithmetic to do the division over the rationals.

Notice that by storing {f;,g;} outside of the heap, we allow the heap comparisons to access as little memory as possible (two words). The pointers for {f;,9;} and chains are needed less often, to add up products. This organization of data by frequency of access further improves cache performance.

[Table 8. The percentage of heap extractions saved by chaining.

Table 9. The cost of sorting terms and the effect of disabling optimizations.