Efficient Polynomial Modular Reduction

On Computation of Polynomial Modular Reduction
Huapeng Wu June 9, 2000
Abstract In this paper, we consider the problem of efcient computation of polynomial modular reduction: , where is a monic polynomial of degree and is a polynomial of degree not greater than , both and are dened over a commutative ring with identity. For given and the degree of , we present an algorithm to compute this problem in addition operations in and the same number of multiplication operations in , where is the Hamming weight of . Applications of the proposed algorithm to nite eld arithmetic are also discussed.
Key Word:
Polynomial arithmetic, modular operation, nite eld arithmetic, complexity.
1.
INTRODUCTION
The recent advances in public key cryptography, especially elliptic curve cryptography, have rekindled the research in polynomial arithmetic, which is required in many nite eld operations. One example is nite eld multiplication. Let gree . Then
H.
be an irreducible polynomial over GF of de-
forms a standard basis in GF over GF . Every element in
Wu is with the Centre for Applied Cryptographic Research, Department of Combinatorics and Optimization, University of Waterloo, Waterloo, Canada N2L 3G1. E-mail:h3wu@cacr.math.uwaterloo.ca .
GF can be represented with a polynomial over GF of degree not greater than . Then a multiplication operation in GF can be realized in two steps: First we perform polynomial multiplication and obtain a product polynomial of degree not greater than ; Then the degree of this product polynomial is reduced to the proper degree range by applying to it polynomial modular operation. We consider here the problem of efcient computation of polynomial modular reduction:
, where is a monic polynomial of degree and
is a polynomial of degree not greater
than , the coefcients of both
and
are dened over a commutative ring with

identity. One way to obtain polynomial modular reduction is to use the well known polynomial division algorithm ([2], pp 402).1 It has a complexity of operations in . Let denote the Hamming weight of
.
Then a complexity upper bound of bit operations
is also known for the second step of a standard basis nite eld multiplication as explained in the last paragraph, if
[1, 3]. To our knowledge, however, there is no explicit algorithm available
in the literature regarding to how efciently computing polynomial modular reduction tting the above complexity bound. In this paper, for given
present an algorithm to compute this problem in addition operations in and the same number of multiplication operations in , where is the Hamming weight of of this algorithm to nite eld arithmetic are also discussed. The organization of this paper is as follows: We present the algorithm in 2.1 and illustrate
.
of degree and the degree upper bound of
we
Applications
how it works with an example in 2.2. The complexity issue is discussed in 2.3. Modications of the algorithm to suit for polynomials over nite eld of characteristic two are made in Section 3. The complexities of nite eld multiplication and squaring operation are respectively discussed in
3.1 and 3.2.

In this algorithm, polynomials are required to be dened over a eld. However, if the divisor polynomial is a monic polynomial, then this method also applies to polynomials over a commutative ring with identity.
1
2.
Efcient Algorithm for Polynomial Modular Reduction
2.1. Algorithm
Let
be a monic polynomial of Hamming weight over a commutative ring with identity
and be given by
where
Let the polynomial
whose degree is to be reduced have its degree bounded by modular operation is given by

. Then polynomial
(1)
If
is a monomial, then
is simply a sum of those terms of
whose degree is not
higher than . If
has a more complex form, then we have the following algorithm to
compute the polynomial modular reduction operation (1).
Algorithm 1 Reduction modulo a polynomial Input: . Output: , where
Part 1. Precomputation Input: and the upper bound of Output: Prepared coefcient lists . 1. Initialization of coefcient lists:
To , Step
2. Compute the prepared coefcient lists: For To
For
Append the pair
as one element to the coefcient list .
Part 2. Main Program Input: The coefcients of Output: The coefcients of For
(ii).
and the prepared coefcient lists : .
To , Step
(i). Compute the product of the two terms of a pair for all the pairs in ;
the sum of all the elements in ;

Output as results: First, for each term of

are
. , a coefcient list (CL) is introduced,

of
which initially has the coefcient the polynomial

. Note that in the precomputation part, the coefcients
unknown and they are treated as variables. We extend the terms
, as follows:

For each term to
on the right-hand side of the above expression, we append the pair
. Then after precomputation each CL contains one element

and possibly a few other
elements of the form of a pair
, and the list is now referred to as a prepared CL. Part two and computation is performed on the CLs. In the
of Algorithm 1 is to assign value to variable
following we rst show the correctness of the algorithm with an example. Then complexity of the algorithm is analyzed.
2.2. An Example
Let the monic polynomial
be given as
, and be the integer ring . Let
the polynomial whose degree is to be reduced by modular operation is
(2)

In following we proceed with Algorithm 1 when it takes inputs of 4
and
given above.
1. Precomputation: preparing coefcient lists. (i). The coefcient lists are initialized:
(3)
(ii). The coefcient lists (3) can be updated as
4d 4c 4b 4a

4d 4c 4b 4a

This step can be explained as follows: For the terms whose degree is equal to or higher than that of lower than the bound
,
and equal to or
, we extend them using the expressions

,
or:
(4a) (4b) (4c) (4d)
By using (4a-4d), the coefcient lists (3) can be updated based on the following argument. For the rst expression (4a), as an example, after (4a) has been applied to
to reduce its degree, the coefcient of the term should be added to these of the terms
and , or added to the coefcient lists and . Since the term initially has
the coefcient

, after using (4a) for
the coefcient of should be updated as

, and the coefcient of
should be updated as
Consequently, , respectively.
we have the CLs for and updated as
and
The rest undates can be done based on the expressions (4b-4d) with similar argument. (iii). The precomputed or the prepared coefcient list can be given as

2. Main program Input:
(5)
The prepared coefcient lists and the coefcients of

:

Output: Coefcients of Compute:
The output is
. So we have
Clearly, the cost of Main program of Algorithm 1 (part 2) is
addition operations in
and
constant multiplication operations (The multiply is a constant) in .
It can be seen that Algorithm 1 it does not require
to be completely xed. In fact, the
values of the coefcients can be variable as long as the degree and the distribution of Hamming weight of
are xed. In the above example, the precomputation step can still be performed
even if we do not know the values of coefcients that can be nonzero besides
and
, as long as we know that In that case,
and
are the only
multiplication operations (instead
of constant multiplication operations) are required.
2.3. Complexity
It can be seen that the complexity of Algorithm 1 (Part 2) depends on the size of coefcient lists. During the precomputation, each coefcient list is rst initialized to have one term . Then, in the second step in precomputation, the coefcient lists are expanded by terms. These terms are in the form of a pair. So the total number of terms in the prepared CLs is
The complexity of Algorithm 1 (Part 2) is decided by the steps (i) and (ii), where the sum of all the terms in a CL is obtained. Note that in the process of summing up, the product is used if a term is in the form of a pair. Since there are non-empty coefcient lists and they contain terms in total (of which terms are pairs), we conclude that
addition operations in and the same number of multiplication operations in are required for Algorithm 1 (Part 2). Since there are total elements of which are pairs, the amount for the required

memory for storing the prepared CLs is one element in . Note that only

units, with each unit storing
CLs are used in Part 2 of Algorithm 1. The required

7
memory amount can be reduced by storing only those CLs that are used in the Main program. Then the necessary memory should have

units. To further
save memory we may store in CLs the indices of the coefcients instead of their values.
3.
Application in Finite Field Arithmetic
3.1. Polynomial basis multiplication

Algorithm Let the nite eld GF be dened by the irreducible polynomial is a basis in GF over GF . Let eld multiplication
and can , thus
be two eld elements in GF .
Then a nite
be realized in the following two steps:
1.

where
2.
(6)
Complexity Clear, Algorithm 1 can be directly used to compute the expression (6) with
It has a complexity of addition operations in GF and constant multiplication operations in GF , where is the Hamming weight of the monic irreducible polynomial
.
If
is chosen to have a low Hamming weight, i.e., binomial and trinomial,
then only operations in the ground eld are required for reduction modulo a polynomial.
Reduction modulo a polynomial over GF is either
In the case that
since a constant in GF
or , the constant multiplication operations can be saved. Thus the step of reduction
modulo the irreducible polynomial requires only addition operations in GF. A version of Algorithm 1 over GF is given below.
Algorithm 2 Polynomial modular reduction over GF Input: . Output: , where . Part 1. Precomputation and the degree upper bound of Input: Output: Prepared coefcient lists . 1. Initialization of coefcient lists:
2. Compute the prepared CLs: For
To , Step
For
To
Append
to .
Part 2. Main Program Input: The coefcients of Output: The coefcients of For
and the prepared coefcient lists .
To , Step
the sum of all the terms in ;

Output as results:
. .
It can be seen that Algorithm 2 can be used for computing the expression (6) with
3.2. Polynomial basis Squaring in GF

Algorithm Assume that
is an irreducible polynomial over GF . Let
be a eld element in GF and be represented in the polynomial basis expanded from a root of
.
Then squaring operation

in GF is given by

where
In this case of application of Algorithm 1, since we have additional information on the polynomial that is to be reduced ( from that in Algorithm 2. Algorithm 3 Polynomial basis squaring in GF Input: . Output: , where Part 1. Precomputation Input: . Output: Prepared coefcient lists 1. Initialization of coefcient lists:
for odd), the precomputation part is done in a slightly different way
otherwise. To , Step To
2. Compute the prepared CLs: For
For
Append
to .

Let If
is odd.
if
To , Step
(7) otherwise.
Then
For
10
For
To to .
Append
Part 2. Main Program Input: The coefcients of , and the prepared coefcient lists Output: The coefcients of : . For
To
Step
the sum of all the terms in ;

Output as results:
Complexity Let denote the number of terms appended to the precomputation part in Algorithm 3. Then we have
CLs in Step 2 of the
if
otherwise,
Let the number of bit additions required in Part 2 in Algorithm 3 be denoted by . Clearly,
. However, the CL with being odd was initially empty and thus in part two of the . Then is the number of the different odd values of and . If is irreducible over GF ,

algorithm one bit operation can be saved in summing up the elements in if it becomes non-empty after step 2 of part 1. Let for
then at least one of and Therefore, it follows
is an odd number. We thus have
if
(8)
otherwise, 11
where
is dened in (7) in Algorithm 3.

to
Consider
be an irreducible trinomial
We try to use the bound (8) to decide
the complexity for this case. From (7), we have

Then from (8) and note
,
if
is odd,
otherwise.
it gives the bound below:
if if
is odd, is even.
References
[1] V. B. Afanasyev, C. Gehrmann and B. Smeets, Fast message authentication using efcient polynomial evaluation, Fast Software Encryption Workshop (E. Biham, Ed.), Lecture Notes in Computer Science, Springer-Verlag, New York, 1267 (1997) pp. 190-204. [2] D. E. Knuth, The Art of Computer Programming: Seminumerical Algorithms, AddisonWesley Publishing Company, Reading, MA (1981). [3] H. Wu, Efcient Computations in Finite Fields with Cryptographic Signicance, Ph.D Thesis, University of Waterloo, Waterloo,Ontario,Canada (1998).
12
Key Words:
Polynomial arithmetic, modular operation, nite eld arithmetic, complexity.
Postal: Huapeng Wu The Centre for Applied Cryptographic Research Dept of Combinatorics and Optimization University of Waterloo Waterloo Ontario Canada N2L 3G1 Voice: 519-888-4567 x3600 Fax: 519-725-5441 Email: h3wu@cacr.math.uwaterloo.ca
Afliation of Author: The Centre for Applied Cryptographic Research Dept of Combinatorics and Optimization University of Waterloo Footnotes:
1. In this algorithm, polynomials are required to be dened over a eld. However, if the divisor polynomial is a monic polynomial, then this method also applies to polynomials over a commutative ring with identity.
Melissa Sullivan Designs, Codes and Cryptography - Editorial Ofce Kluwer Academic Publishers 101 Philip Drive Norwell, MA 02061, U.S.A.

Efficient Polynomial Modular Reduction

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Efficient Polynomial Modular Reduction

Uploaded by

Copyright:

Available Formats

On Computation of Polynomial Modular Reduction

Huapeng Wu June 9, 2000

Polynomial arithmetic, modular operation, nite eld arithmetic, complexity.

be an irreducible polynomial over GF of de-

forms a standard basis in GF over GF . Every element in

is a polynomial of degree not greater

than , the coefcients of both

are dened over a commutative ring with

Then a complexity upper bound of bit operations

[1, 3]. To our knowledge, however, there is no explicit algorithm available

of degree and the degree upper bound of

3.1 and 3.2.

Efcient Algorithm for Polynomial Modular Reduction

be a monic polynomial of Hamming weight over a commutative ring with identity

Let the polynomial

is simply a sum of those terms of

whose degree is not

has a more complex form, then we have the following algorithm to

compute the polynomial modular reduction operation (1).

Algorithm 1 Reduction modulo a polynomial Input: . Output: , where

2. Compute the prepared coefcient lists: For To

Append the pair

as one element to the coefcient list .

and the prepared coefcient lists : .

the sum of all the elements in ;

Output as results: First, for each term of

. , a coefcient list (CL) is introduced,

which initially has the coefcient the polynomial

. Note that in the precomputation part, the coefcients

unknown and they are treated as variables. We extend the terms

For each term to

on the right-hand side of the above expression, we append the pair

. Then after precomputation each CL contains one element

and possibly a few other

elements of the form of a pair

of Algorithm 1 is to assign value to variable

, and be the integer ring . Let

the polynomial whose degree is to be reduced by modular operation is

In following we proceed with Algorithm 1 when it takes inputs of 4

(ii). The coefcient lists (3) can be updated as

, we extend them using the expressions

(4a) (4b) (4c) (4d)

, after using (4a) for

the coefcient of should be updated as

, and the coefcient of

we have the CLs for and updated as

The prepared coefcient lists and the coefcients of

Output: Coefcients of Compute:

Clearly, the cost of Main program of Algorithm 1 (part 2) is

constant multiplication operations (The multiply is a constant) in .

It can be seen that Algorithm 1 it does not require

to be completely xed. In fact, the

, as long as we know that In that case,

are the only

multiplication operations (instead

of constant multiplication operations) are required.

units, with each unit storing

CLs are used in Part 2 of Algorithm 1. The required

Application in Finite Field Arithmetic

3.1. Polynomial basis multiplication

be two eld elements in GF .

be realized in the following two steps: