9 When Polynomials Save the Day: Polynomial Based Codes

Definition 178 Total degree of a polynomial, Def 9.1.1

✓

For a monomial \(\mathbf{X}^{\mathbf{d}} = X_1^{d_1} \cdot X_2^{d_2} \cdots X_m^{d_m}\) its total degree is \(d_1 + d_2 + \cdots + d_m\). The total degree of a polynomial \(P(\mathbf{X}) = \sum _{\mathbf{d}} c_{\mathbf{d}} \mathbf{X}^{\mathbf{d}}\) over \(\mathbb {F}_q\) (i.e. every \(c_{\mathbf{d}} \in \mathbb {F}_q\)) is the maximum, over \(\mathbf{d}\) such that \(c_{\mathbf{d}} \neq 0\), of the total degree of \(\mathbf{X}^{\mathbf{d}}\). We denote the total degree of \(P\) by \(\deg (P)\).

Definition 179 Individual degree of a polynomial, Def 9.1.2

✓

For a polynomial \(p \in \mathbb {F}_q[X_1,\ldots ,X_m]\) and \(i \in [m]\), \(\deg _{X_i}(p)\) denotes the degree of \(p\) in the variable \(X_i\), i.e. the maximum \(d_i\) appearing in a monomial of \(p\) with nonzero coefficient.

Lemma 180 Frobenius-type identity for finite fields

✓

For every finite field \(\mathbb {F}_q\) and every \(a \in \mathbb {F}_q\), \(a^q - a = 0\).

Definition 181 Degree of a function, Def 9.1.1–9.1.2 continued

It is convenient to switch back and forth between multivariate functions and multivariate polynomials. For \(f : \mathbb {F}_q^m \to \mathbb {F}_q\), let \(\deg (f)\) be the minimal total degree of a polynomial \(P \in \mathbb {F}_q[X_1,\ldots ,X_m]\) (where \(\mathbb {F}_q[X_1,\ldots ,X_m]\) denotes the set of all \(m\)-variate polynomials with coefficients from \(\mathbb {F}_q\)) such that \(f(\alpha ) = P(\alpha )\) for every \(\alpha \in \mathbb {F}_q^m\). Since \(a^q - a = 0\) for every \(a \in \mathbb {F}_q\) (Lemma 180), it follows that a minimal-degree polynomial representing \(f\) does not contain, in any single variable, a monomial of degree more than \(q-1\). Similarly \(\deg _{X_i}(f)\) denotes the degree of (the minimal polynomial corresponding to) \(f\) in variable \(X_i\). In this notation, for every \(f : \mathbb {F}_q^m \to \mathbb {F}_q\) we have \(\deg _{X_i}(f) \leq q-1\) for every \(i \in [m]\).

Definition 182 Reed-Muller Codes, Def 9.1.3

Reed-Muller codes are given by three parameters: a prime power \(q\) and positive integers \(m\) and \(r\) and consist of the evaluations of \(m\)-variate polynomials of degree at most \(r\) over all of the domain \(\mathbb {F}_q^m\). The Reed-Muller code with parameters \(q,m,r\), denoted \(\mathrm{RM}(q,m,r)\), is the set of evaluations of all \(m\)-variate polynomials in \(\mathbb {F}_q[X_1,\ldots ,X_m]\) of total degree at most \(r\) and individual degree at most \(q-1\) over all points in \(\mathbb {F}_q^m\). Formally

\[ \mathrm{RM}(q,m,r) \overset {\mathrm{def}}{=} \left\{ f : \mathbb {F}_q^m \to \mathbb {F}_q \mid \deg (f) \leq r \right\} . \]

The Reed-Muller code with parameters \((q,m,r)\) clearly has alphabet \(\mathbb {F}_q\) and block length \(n = q^m\). It can be verified that \(\mathrm{RM}(q,m,r)\) is a linear code (Exercise 9.1).

Lemma 183 Degree mantra

✓

A nonzero univariate polynomial \(f(X)\) of degree \(t\) over a field \(\mathbb {F}_q\) has at most \(t\) distinct roots in \(\mathbb {F}_q\). (This restates Prop. 5.1.5 of the book. It is deliberately restated locally rather than cited across chapters, since the corresponding Ch5 label is not part of the canonical cross-chapter table used to build this blueprint.)

Lemma 184 Union bound

✓

For events \(\mathcal{E}_1, \mathcal{E}_2\) in a common probability space, \(\Pr [\mathcal{E}_1 \cup \mathcal{E}_2] \leq \Pr [\mathcal{E}_1] + \Pr [\mathcal{E}_2]\).

Proposition 185 Dimension, low-degree case, Prop 9.2.1

If \(r {\lt} q\) then the dimension of the Reed-Muller code \(\mathrm{RM}(q,m,r)\) equals \(\binom {m+r}{r}\).

Proof ▶

Since \(r \le q-1\), restricting the total degree to at most \(r\) automatically forces every variable to have degree at most \(q-1\), so no individual-degree constraint is active. Hence the dimension equals the size of the set

\[ D = \left\{ (d_1,\ldots ,d_m) \in \mathbb {Z}^m \mid d_i \geq 0 \text{ for all } i \in [m], \sum _{i=1}^m d_i \leq r \right\} , \]

since for every \((d_1,\ldots ,d_m) \in D\) the monomial \(X_1^{d_1} \cdots X_m^{d_m}\) is a monomial of degree at most \(r\), and these are all such monomials (a basis for the space of \(m\)-variate polynomials of total degree at most \(r\)).

It remains to show \(|D| = \binom {m+r}{r}\), a standard stars-and-bars count: introduce a slack variable \(d_0 = r - \sum _{i=1}^m d_i \geq 0\), giving a bijection between \(D\) and the set of \((m+1)\)-tuples of non-negative integers \((d_0,d_1,\ldots ,d_m)\) with \(\sum _{i=0}^m d_i = r\). The number of such tuples is the number of ways to place \(r\) indistinguishable balls into \(m+1\) distinguishable bins, which is \(\binom {r + m}{m} = \binom {m+r}{r}\).

Lemma 186 Polynomial Distance Lemma (low-degree case), Lemma 9.2.2

Let \(f \in \mathbb {F}_q[X_1,\ldots ,X_m]\) be a non-zero polynomial with \(\deg (f) \leq r\). Then the fraction of zeroes of \(f\) is at most \(\frac{r}{q}\), i.e.,

\[ \frac{|\{ \mathbf{a} \in \mathbb {F}_q^m \mid f(\mathbf{a}) = 0\} |}{q^m} \leq \frac{r}{q}. \]

Proof ▶

The statement is equivalent to saying that the probability that \(f(\mathbf{a}) = 0\) is at most \(\frac{\deg (f)}{q}\) when \(\mathbf{a} = (a_1,\ldots ,a_m)\) is chosen uniformly at random from \(\mathbb {F}_q^m\). We prove this by induction on \(m \geq 1\).

Base case \(m=1\). This is exactly the degree mantra (Lemma 183): a nonzero univariate polynomial of degree \(\deg (f) \le r\) has at most \(\deg (f)\) roots, so the probability that a uniformly random \(a_1 \in \mathbb {F}_q\) is a root is at most \(\deg (f)/q \le r/q\).

Inductive step. Let \(m {\gt} 1\) and assume the lemma holds for \((m-1)\)-variate polynomials. Write \(f\) as a polynomial in \(X_m\) with coefficients in \(\mathbb {F}_q[X_1,\ldots ,X_{m-1}]\):

\[ f = f_0 X_m^0 + f_1 X_m^1 + \cdots + f_t X_m^t, \]

where each \(f_i(X_1,\ldots ,X_{m-1})\) is a polynomial from \(\mathbb {F}_q[X_1,\ldots ,X_{m-1}]\) with \(\deg (f_i) \leq r-i\), and \(t\) is the largest index such that \(f_t\) is not the zero polynomial. Pick \(\mathbf{a} \in \mathbb {F}_q^m\) in two steps: first pick \((a_1,\ldots ,a_{m-1})\) uniformly at random from \(\mathbb {F}_q^{m-1}\), and then pick \(a_m\) uniformly from \(\mathbb {F}_q\). Let

\[ f^{(a_1,\ldots ,a_{m-1})}(X_m) = f_0(a_1,\ldots ,a_{m-1}) X_m^0 + \cdots + f_t(a_1,\ldots ,a_{m-1}) X_m^t . \]

Consider the two events

\[ \mathcal{E}_1 = \{ (a_1,\ldots ,a_m) \mid f_t(a_1,\ldots ,a_{m-1}) = 0\} , \]

\[ \mathcal{E}_2 = \{ (a_1,\ldots ,a_m) \mid f_t(a_1,\ldots ,a_{m-1}) \neq 0 \text{ and } f^{(a_1,\ldots ,a_{m-1})}(a_m) = 0\} . \]

By the inductive hypothesis applied to \(f_t\) (which is non-zero of degree at most \(r-t\)),

\[ \Pr [\mathcal{E}_1] \leq \frac{r-t}{q}. \]

For every \((a_1,\ldots ,a_{m-1}) \in \mathbb {F}_q^{m-1}\) such that \(f_t(a_1,\ldots ,a_{m-1}) \neq 0\), the univariate polynomial \(f^{(a_1,\ldots ,a_{m-1})}(X_m)\) is non-zero of degree at most \(t\), and so by the degree mantra (base case) it has at most \(t\) roots; hence, over the random choice of \(a_m\), \(\Pr [f^{(a_1,\ldots ,a_{m-1})}(a_m) = 0] \le t/q\) for every such \((a_1,\ldots ,a_{m-1})\), and therefore

\[ \Pr [\mathcal{E}_2] \leq \frac{t}{q}. \]

If neither \(\mathcal{E}_1\) nor \(\mathcal{E}_2\) occurs then \(f(\mathbf{a}) \neq 0\): indeed if \(f(\mathbf{a}) = 0\) then either \(f_t(a_1,\ldots ,a_{m-1}) = 0\) (giving \(\mathcal{E}_1\)) or \(f_t(a_1,\ldots ,a_{m-1}) \neq 0\) and \(f^{(a_1,\ldots ,a_{m-1})}(a_m) = 0\) (giving \(\mathcal{E}_2\)). Hence \(\Pr _{\mathbf{a}}[f(\mathbf{a}) = 0] \leq \Pr [\mathcal{E}_1 \cup \mathcal{E}_2]\), and by the union bound (Lemma 184),

\[ \Pr _{\mathbf{a}}[f(\mathbf{a}) = 0] \leq \Pr [\mathcal{E}_1] + \Pr [\mathcal{E}_2] \leq \frac{r-t}{q} + \frac{t}{q} = \frac{r}{q}. \qedhere \]

Lemma 187 Polynomial distance (binary case), Lemma 9.3.1

Let \(f\) be a non-zero polynomial from \(\mathbb {F}_2[X_1,\ldots ,X_m]\) with \(\deg _{X_i}(f) \leq 1\) for every \(i \in [m]\). Then

\[ |\{ \mathbf{a} \in \mathbb {F}_2^m \mid f(\mathbf{a}) \neq 0\} | \geq 2^{m - \deg (f)}. \]

Proof ▶

This is a special case of the general Polynomial Distance Lemma (Lemma 190) with \(q=2\). Write \(r = \deg (f)\) and let \(s,t\) be the unique non-negative integers with \(t \le q-2 = 0\) and \(s(q-1)+t = r\); since \(q-1=1\) this forces \(t=0\) and \(s=r\). The hypothesis \(\deg _{X_i}(f) \le 1 = q-1\) for all \(i\) matches the hypothesis of Lemma 190, which then gives

\[ |\{ \mathbf{a} \in \mathbb {F}_2^m \mid f(\mathbf{a}) \neq 0\} | \geq (q-t)\cdot q^{m-s-1} = (2-0)\cdot 2^{m - r - 1} = 2^{m-r} = 2^{m-\deg (f)}, \]

as claimed.

Proposition 188 Dimension, binary case, Prop 9.3.2

For any \(r \leq m\), the dimension of the Reed-Muller code \(\mathrm{RM}(2,m,r)\) is exactly \(\sum _{i=0}^{r} \binom {m}{i}\).

Proof ▶

Over \(\mathbb {F}_2\) the individual-degree constraint \(\deg _{X_i} \le q-1 = 1\) means every variable appears with exponent \(0\) or \(1\) in a reduced monomial, so the monomials of individual degree at most \(1\) are in bijection with subsets \(S \subseteq [m]\) (via \(S \mapsto \prod _{i\in S} X_i\)), and the total degree of the monomial corresponding to \(S\) is \(|S|\). Hence the dimension of \(\mathrm{RM}(2,m,r)\), which is the number of such monomials of total degree at most \(r\), equals the number of subsets of \([m]\) of size at most \(r\), namely \(\sum _{i=0}^{r}\binom {m}{i}\).

Theorem 189 Reed-Muller codes over the binary field, Thm 9.3.3

For every \(r \leq m\), the Reed-Muller code \(\mathrm{RM}(2,m,r)\) is a code of block length \(2^m\), dimension \(\sum _{i=0}^{r}\binom {m}{i}\) and distance \(2^{m-r}\).

Proof ▶

The block length \(2^m\) and dimension \(\sum _{i=0}^r \binom {m}{i}\) are immediate from the definition of \(\mathrm{RM}(2,m,r)\) (Definition 182) and Proposition 188. Since \(\mathrm{RM}(2,m,r)\) is linear, its minimum distance equals the minimum weight of a non-zero codeword, i.e. the minimum, over non-zero \(f \in \mathbb {F}_2[X_1,\ldots ,X_m]\) of individual degree at most \(1\) and total degree at most \(r\), of \(|\{ \mathbf{a} \in \mathbb {F}_2^m \mid f(\mathbf{a}) \ne 0\} |\). Every such \(f\) has \(\deg (f) \leq r\), and hence \(\deg _{X_i}(f) \le 1\) for all \(i\), so Lemma 187 applies to give \(|\{ \mathbf{a} \mid f(\mathbf{a}) \ne 0\} | \geq 2^{m - \deg (f)} \geq 2^{m-r}\). Thus the distance is at least \(2^{m-r}\); taking \(f = X_1 X_2 \cdots X_r\) (which has \(\deg (f) = r\), individual degree \(1\) in each variable, and by Lemma 187 weight exactly \(2^{m-r}\) since the bound is tight for this polynomial) shows the distance equals exactly \(2^{m-r}\).

Lemma 190 Polynomial distance (general case), Lemma 9.4.1

Let \(f\) be a non-zero polynomial from \(\mathbb {F}_q[X_1,\ldots ,X_m]\) with \(\deg _{X_i}(f) \leq q-1\) for every \(i \in [m]\) and \(\deg (f) \leq r\). Furthermore, let \(s,t\) be the unique non-negative integers such that \(t \leq q-2\) and

\[ s(q-1) + t = r. \]

Then

\[ |\{ \mathbf{a} \in \mathbb {F}_q^m \mid f(\mathbf{a}) \neq 0\} | \geq (q-t)\cdot q^{m-s-1} \geq q^{m - \frac{r}{q-1}}. \]

Hence \(\mathrm{RM}(q,m,r)\) has distance at least \(q^{m - \frac{r}{q-1}}\).

Proof ▶

We prove the first inequality by induction on \(m\).

Base case \(m=1\). By the degree mantra (Lemma 183), the probability that \(f(a_1) \neq 0\) is at least \(\frac{q - \deg (f)}{q} \geq \frac{q-r}{q}\). If \(r {\lt} q-1\) we have \(s=0\) and \(t=r\), and so the target expression \((q-t)\cdot q^{-1} = \frac{q-r}{q} \leq \Pr [f(a_1) \neq 0]\). If \(r = q-1\) we have \(s=1\) and \(t=0\), and \((q-t)\cdot q^{-2} = q\cdot q^{-2} = \frac{q-(q-1)}{q} \le \Pr [f(a_1) \ne 0]\), again by the degree mantra.

Inductive step. Assume the hypothesis holds for \((m-1)\)-variate polynomials. Write \(f = \sum _{i=0}^{b} f_i X_m^i\) where \(f_i \in \mathbb {F}_q[X_1,\ldots ,X_{m-1}]\), \(f_b \neq 0\), \(0 \le b \le q-1\), and \(\deg (f_b) \leq r-b\). Let

\[ \mathcal{E} = \{ (a_1,\ldots ,a_m) \mid f(a_1,\ldots ,a_m) \neq 0\} , \qquad \mathcal{E}_1 = \{ (a_1,\ldots ,a_{m-1}) \mid f_b(a_1,\ldots ,a_{m-1}) \neq 0\} . \]

Bounding \(\Pr [\mathcal{E} \mid \mathcal{E}_1]\). Fix \(a_1,\ldots ,a_{m-1}\) with \(f_b(a_1,\ldots ,a_{m-1}) \neq 0\) and let \(P(Z) = \sum _{i=0}^{b} f_i(a_1,\ldots ,a_{m-1}) Z^i\). Then \(P\) is a non-zero univariate polynomial of degree \(b\), so by the degree mantra it has at most \(b\) roots and \(\Pr _{a_m}[P(a_m) \neq 0] \geq \frac{q-b}{q}\). Since \(\Pr [f(a_1,\ldots ,a_m) = 0 \mid a_1,\ldots ,a_{m-1}] = \Pr _{a_m}[P(a_m) = 0]\), we get

\[ \Pr [\mathcal{E} \mid \mathcal{E}_1] \geq 1 - \frac{b}{q} = \frac{q-b}{q}. \]

Bounding \(\Pr [\mathcal{E}_1]\). Write \(r - b = s'(q-1) + t'\) with \(s', t' \geq 0\) and \(t' \leq q-2\). Since \(\deg (f_b) \leq r-b\) and \(\deg _{X_i}(f_b) \le q-1\), the inductive hypothesis applied to \(f_b\) gives

\[ \Pr [\mathcal{E}_1] \geq (q-t')\cdot q^{-(s'+1)}. \]

Combining. Since \(\Pr [\mathcal{E}] \geq \Pr [\mathcal{E} \text{ and } \mathcal{E}_1] = \Pr [\mathcal{E}_1]\cdot \Pr [\mathcal{E}\mid \mathcal{E}_1]\), we get

\[ \Pr [\mathcal{E}] \geq \frac{q-b}{q}\cdot (q-t')\cdot q^{-(s'+1)}. \]

By Claim 191 (applied with \(q,r,s,t,s',t',b\) satisfying \(r = s(q-1)+t\), \(r-b = s'(q-1)+t'\), \(t,t' \le q-2\), \(b \le q-1\)), this quantity is at least \((q-t)\cdot q^{-(s+1)}\), which establishes the first inequality \(|\{ \mathbf{a} \mid f(\mathbf{a}) \ne 0\} | \ge (q-t) \cdot q^{m-s-1}\) by induction.

Finally the second inequality \((q-t)\cdot q^{-(s+1)} \geq q^{-r/(q-1)}\) (equivalently, after multiplying by \(q^m\), \((q-t) \cdot q^{m-s-1} \ge q^{m - r/(q-1)}\)) is exactly Claim 192.

Lemma 191 Claim 9.4.2

If \(q,r,s,t,s',t',b\) are non-negative integers such that \(r = s(q-1)+t\), \(r-b = s'(q-1)+t'\), \(t,t' \leq q-2\) and \(b \leq q-1\), then

\[ \frac{q-b}{q}\cdot (q-t')\cdot q^{-(s'+1)} \geq (q-t)\cdot q^{-(s+1)}. \]

Proof ▶

Since \(s\) (resp. \(s'\)) is the quotient when dividing \(r\) (resp. \(r-b\)) by \(q-1\), and \(0 \le b \le q-1\), we have either \(s'=s\) or \(s'=s-1\).

Case \(s=s'\). Then \(t = t'+b\), and it suffices to show \(\frac{q-b}{q}\cdot (q-t') \geq q - (t'+b)\), i.e. \((q-b)(q-t') \geq q(q-(t'+b))\). This holds since

\[ (q-b)(q-t') = q^2 - (b+t')q + bt' = q(q-(b+t')) + bt' \geq q(q-(b+t')), \]

using \(bt' \geq 0\).

Case \(s = s'+1\). Then \(t + (q-1) = t' + b\), and it suffices to show

\[ \frac{q-b}{q}\cdot (q-t')\cdot q \geq (q-t)\cdot q = 2q\cdot q - (t'+b+1)\cdot q , \]

i.e. (dividing by \(q\), and writing \(\alpha = q-b \ge 1\), \(\beta = q - t' \ge 1\), using \(b,t' \le q-1\)) that \(\alpha \beta \geq \alpha +\beta - 1\). This holds since \(\alpha \beta = \alpha + \alpha (\beta -1) \geq \alpha + (\beta - 1)\) using \(\alpha \geq 1\) and \(\beta - 1 \geq 0\).

In both cases the inequality holds, proving the claim.

Lemma 192 Claim 9.4.3

Let \(q,r,s,t\) be non-negative real numbers such that \(q \geq 2\), \(r = s(q-1)+t\) and \(t \leq q-2\). Then

\[ (q-t)\cdot q^{-(s+1)} \geq q^{-r/(q-1)}. \]

Proof ▶

Substituting \(r = s(q-1)+t\), it suffices to show \((q-t)\cdot q^{-(s+1)} \geq q^{-(s(q-1)+t)/(q-1)} = q^{-s}\cdot q^{-t/(q-1)}\). Dividing both (non-negative) sides by \(q^{-s}\), it suffices to show

\[ \frac{q-t}{q} \geq q^{-t/(q-1)}. \]

Let \(f_q(t) = \frac{t}{q} + q^{-t/(q-1)} - 1\); we must show \(f_q(t) \leq 0\) for \(0 \le t \le q-2\). We have

\[ f_q'(t) = \frac{1}{q} - \frac{\ln q}{q-1}\, q^{-t/(q-1)}, \qquad f_q''(t) = \left(\frac{\ln q}{q-1}\right)^2 q^{-t/(q-1)} {\gt} 0, \]

so \(f_q\) is convex on \([0,q-2]\) and is therefore maximized at one of the two endpoints of the interval. We have \(f_q(0) = 0 \leq 0\), so it suffices to show \(f_q(q-2) \leq 0\), i.e.

\[ q^{-(q-2)/(q-1)} \leq \frac{2}{q}. \]

Multiplying both sides by \(q\), it suffices to show \(q^{1/(q-1)} \leq 2\), equivalently \(q \leq 2^{q-1}\) for every \(q \geq 2\). This follows from Bernoulli’s inequality \(1+kx \leq (1+x)^k\) (valid for \(x \geq -1\), \(k \geq 1\)) applied with \(x=1\), \(k = q-1\), which gives \(q = 1 + (q-1)\cdot 1 \leq 2^{q-1}\).

Definition 193 The counting set \(S_{q,m,r}\) and \(K_{q,m,r}\)

For integers \(q,m,r\) let

\[ S_{q,m,r} = \left\{ \mathbf{d} = (d_1,\ldots ,d_m) \in \mathbb {Z}^m \mid 0 \leq d_i \leq q-1 \text{ for all } i \in [m], \sum _{i=1}^m d_i \leq r \right\} \]

and let \(K_{q,m,r} = |S_{q,m,r}|\).

Lemma 194 Injectivity of the evaluation map on individual-degree-bounded polynomials

Let \(P \in \mathbb {F}_q[X_1,\ldots ,X_m]\) be such that \(\deg _{X_i}(P) \leq q-1\) for every \(i \in [m]\), and suppose \(P(\mathbf{a}) = 0\) for every \(\mathbf{a} \in \mathbb {F}_q^m\). Then \(P = 0\) as a polynomial. Consequently, for every \(\mathbf{d} \in S_{q,m,r}\) (Definition 193) the evaluations of the monomials \(\mathbf{X}^{\mathbf{d}}\) are linearly independent, and (together with the fact that they span \(\mathrm{RM}(q,m,r)\)) they form a basis for \(\mathrm{RM}(q,m,r)\).

Proof ▶

We prove \(P=0\) by induction on \(m\).

Base case \(m=1\). \(P\) has degree at most \(q-1\) and \(q\) roots (all of \(\mathbb {F}_q\)). By the degree mantra (Lemma 183), a non-zero polynomial of degree at most \(q-1\) has at most \(q-1\) roots; since \(P\) has \(q {\gt} q-1\) roots, \(P\) must be the zero polynomial.

Inductive step. Write \(P = \sum _{i=0}^{q-1} P_i(X_1,\ldots ,X_{m-1}) X_m^i\) with each \(\deg _{X_j}(P_i) \le q-1\) for \(j {\lt} m\). Fix any \((a_1,\ldots ,a_{m-1}) \in \mathbb {F}_q^{m-1}\). The univariate polynomial \(Q(X_m) = \sum _{i=0}^{q-1} P_i(a_1,\ldots ,a_{m-1}) X_m^i\) has degree at most \(q-1\) and satisfies \(Q(a_m) = P(a_1,\ldots ,a_m) = 0\) for every \(a_m \in \mathbb {F}_q\), i.e. \(Q\) has \(q\) roots; by the base-case argument \(Q = 0\), i.e. \(P_i(a_1,\ldots ,a_{m-1}) = 0\) for every \(i\). As \((a_1,\ldots ,a_{m-1})\) was arbitrary, each \(P_i\) vanishes on all of \(\mathbb {F}_q^{m-1}\), and by the inductive hypothesis (applied to each \(P_i\), which has individual degrees at most \(q-1\) in \(X_1,\ldots ,X_{m-1}\)) \(P_i = 0\) for every \(i\). Hence \(P=0\).

For the consequence: any \(\mathbb {F}_q\)-linear combination \(\sum _{\mathbf{d} \in S_{q,m,r}} c_{\mathbf{d}} \mathbf{X}^{\mathbf{d}}\) that evaluates to zero at every point of \(\mathbb {F}_q^m\) is, by the above, the zero polynomial, i.e. all \(c_{\mathbf{d}} = 0\); this is exactly linear independence of the evaluation vectors of the monomials \(\mathbf{X}^{\mathbf d}\), \(\mathbf d \in S_{q,m,r}\). These evaluations also span \(\mathrm{RM}(q,m,r)\), since by definition every \(f \in \mathrm{RM}(q,m,r)\) is the evaluation of some polynomial of individual degree at most \(q-1\) in each variable and total degree at most \(r\), which is an \(\mathbb {F}_q\)-linear combination of the monomials \(\mathbf{X}^{\mathbf{d}}\), \(\mathbf{d} \in S_{q,m,r}\).

Proposition 195 Dimension, general case, Prop 9.4.4

For every prime power \(q\) and integers \(m \geq 1\) and \(r \geq 0\), the dimension of the code \(\mathrm{RM}(q,m,r)\) is \(K_{q,m,r}\).

Proof ▶

By Lemma 194, the evaluations of \(\mathbf{X}^{\mathbf{d}}\) for \(\mathbf{d} \in S_{q,m,r}\) form a basis for \(\mathrm{RM}(q,m,r)\), so the dimension of \(\mathrm{RM}(q,m,r)\) equals \(|S_{q,m,r}| = K_{q,m,r}\).

Lemma 196 Monotonicity of \(K_{q,m,r}\)

For fixed \(m\), \(K_{q,m,r}\) is monotone non-decreasing in \(q\) and in \(r\).

Proof ▶

Monotone in \(r\). If \(r \le r'\) then every \(\mathbf{d} \in S_{q,m,r}\) satisfies \(\sum _i d_i \le r \le r'\) and \(0 \le d_i \le q-1\), so \(\mathbf{d} \in S_{q,m,r'}\); hence \(S_{q,m,r} \subseteq S_{q,m,r'}\) and \(K_{q,m,r} \le K_{q,m,r'}\).

Monotone in \(q\). If \(q \le q'\) then every \(\mathbf{d} \in S_{q,m,r}\) satisfies \(0 \le d_i \le q-1 \le q'-1\) for all \(i\) and \(\sum _i d_i \le r\), so \(\mathbf{d} \in S_{q',m,r}\); hence \(S_{q,m,r} \subseteq S_{q',m,r}\) and \(K_{q,m,r} \le K_{q',m,r}\).

Proposition 197 Bounds on the dimension, Prop 9.4.5

For integers \(q \geq 2\), \(m \geq 1\) and \(r \geq 0\), let

\[ K_{q,m,r}^{+} \triangleq \min \left\{ q^m, \binom {m+r}{r} \right\} \]

and let

\[ K_{q,m,r}^{-} \triangleq \begin{cases} \max \left\{ q^m/2,\ q^m - K_{q,m,(q-1)m-r}^{+} \right\} & \text{if } r \geq (q-1)m/2, \\[4pt] \max \left\{ \binom {m}{r},\ \tfrac {1}{2}\left( \left\lfloor \tfrac {2r+m}{m} \right\rfloor \right)^m \right\} & \text{if } r {\lt} (q-1)m/2. \end{cases} \]

Then there are universal constants \(c_1, c_2\) (\(c_1 {\lt} 3.1\) and \(c_2 {\lt} 8.2\) suffice) such that

\[ K_{q,m,r}^{-} \leq K_{q,m,r} \leq K_{q,m,r}^{+} \leq c_1 \cdot \left(K_{q,m,r}^{-}\right)^{c_2}. \]

Proof ▶

We prove the three inequalities \(K_{q,m,r} \le K_{q,m,r}^+\), \(K_{q,m,r}^- \le K_{q,m,r}\), and \(K_{q,m,r}^+ \le c_1 (K_{q,m,r}^-)^{c_2}\) in turn, using monotonicity of \(K_{q,m,r}\) (Lemma 196) throughout.

Upper bound \(K_{q,m,r} \leq K_{q,m,r}^+\). On one hand, ignoring the total-degree restriction, \(K_{q,m,r} \leq K_{q,m,(q-1)m} = q^m\). On the other hand, ignoring the individual-degree restriction, \(K_{q,m,r} \leq K_{r+1,m,r} = \binom {m+r}{r}\) (this is exactly the count in Proposition 185, since with \(q' = r+1 {\gt} r\) the individual-degree constraint is vacuous). Taking the minimum gives \(K_{q,m,r} \le K_{q,m,r}^+\).

Lower bound \(K_{q,m,r}^- \leq K_{q,m,r}\), case \(r \geq (q-1)m/2\). Consider the map \(\mathbf{d} = (d_1,\ldots ,d_m) \mapsto \overline{\mathbf{d}} = (q-1-d_1,\ldots ,q-1-d_m)\) on vectors with \(0 \le d_i \le q-1\). This is a bijection on \(\{ 0,\ldots ,q-1\} ^m\) that sends \(\sum _i d_i {\gt} r\) to \(\sum _i \overline{d_i} {\lt} (q-1)m - r\). Hence exactly one of \(\mathbf{d} \in S_{q,m,r}\) or \(\overline{\mathbf{d}} \in S_{q,m,(q-1)m-r}\) holds for each \(\mathbf{d} \in \{ 0,\ldots ,q-1\} ^m\), giving

\[ K_{q,m,r} = q^m - K_{q,m,(q-1)m-r}. \]

Since \(r \geq (q-1)m/2\) we have \((q-1)m - r \leq r\), so by monotonicity in \(r\) (Lemma 196), \(K_{q,m,(q-1)m-r} \leq K_{q,m,r}\), whence, combined with the identity above, \(2K_{q,m,r} \geq K_{q,m,r} + K_{q,m,(q-1)m-r} = q^m\), i.e. \(K_{q,m,r} \geq q^m/2\). Also, using \(K_{q,m,(q-1)m-r} \le K^+_{q,m,(q-1)m-r}\) from the upper bound proved above, \(K_{q,m,r} = q^m - K_{q,m,(q-1)m-r} \geq q^m - K^+_{q,m,(q-1)m-r}\). Together, \(K_{q,m,r} \geq K_{q,m,r}^-\) in this case.

Lower bound \(K_{q,m,r}^- \leq K_{q,m,r}\), case \(r {\lt} (q-1)m/2\). Let \(q' = \lfloor \frac{2r+m}{m} \rfloor \). Then \(r \geq (q'-1)m/2\), so by the previous case (applied with \(q'\) in place of \(q\)) \(K_{q',m,r} \geq (q')^m/2\); by monotonicity in \(q\) (Lemma 196), \(K_{q,m,r} \geq K_{q',m,r} \geq (q')^m/2 = \frac{1}{2} \left(\lfloor \frac{2r+m}{m}\rfloor \right)^m\). Also, using Proposition 188, \(K_{q,m,r} \geq K_{2,m,r} = \sum _{i=0}^r \binom {m}{i} \geq \binom {m}{r}\) (again using monotonicity in \(q\)). Taking the maximum of the two lower bounds gives \(K_{q,m,r} \geq K_{q,m,r}^-\) in this case.

Upper bound \(K_{q,m,r}^+ \leq c_1 (K_{q,m,r}^-)^{c_2}\). We split into three ranges.

Range \(r \geq (q-1)m/2\). Here \(\frac{q^m}{2} \le K_{q,m,r}^- \le K_{q,m,r}^+ \le q^m\), so \(K_{q,m,r}^+ \leq 2K_{q,m,r}^-\).

Range \(r {\lt} m/2\). Here \(K_{q,m,r}^- \geq \binom {m}{r} \geq (m/r)^r \geq 2^r\). Also, by the standard bound \(\binom {n}{k} \leq (en/k)^k\),

\[ \binom {m+r}{r} \leq \left( \frac{e(m+r)}{r} \right)^r \leq \left( e \cdot \tfrac {3}{2}\cdot \tfrac {m}{r} \right)^r = \left(\tfrac {3e}{2}\right)^r \cdot \left(\tfrac {m}{r}\right)^r, \]

using \(m + r {\lt} \frac{3}{2}m\) in this range. From \(2^r \le K_{q,m,r}^-\) we get \(\left( \tfrac {3e}{2}\right)^r \le (K_{q,m,r}^-)^{\log _2(3e/2)}\), and combining with \((m/r)^r \le K_{q,m,r}^-\) and \(K_{q,m,r}^+ \le \binom {m+r}{r}\),

\[ K_{q,m,r}^+ \leq \left(\tfrac {3e}{2}\right)^r \cdot \left(\tfrac {m}{r}\right)^r \leq (K_{q,m,r}^-)^{1+\log _2(3e/2)}. \]

Range \(m/2 \le r {\lt} (q-1)m/2\). Here \(\left\lfloor \tfrac {2r+m}{m} \right\rfloor = 1 + \left\lfloor \tfrac {2r}{m} \right\rfloor \geq 1 + \tfrac {r}{m} = \tfrac {m+r}{m}\), so

\[ K_{q,m,r}^- \geq \frac{1}{2}\left( \left\lfloor \tfrac {2r+m}{m} \right\rfloor \right)^m \geq \frac{1}{2}\left(\tfrac {m+r}{m}\right)^m \geq \frac{1}{2}\left(\tfrac {3}{2}\right)^m. \]

On the other hand, using \(\binom {n}{k}\le (en/k)^k\) again,

\[ K_{q,m,r}^+ \leq \binom {m+r}{m} \leq \left(\tfrac {e(m+r)}{m}\right)^m = e^m \cdot \left(\tfrac {m+r}{m}\right)^m. \]

Since \(\left(\tfrac {m+r}{m}\right)^m \leq 2K_{q,m,r}^-\) and, from \(K_{q,m,r}^- \geq \tfrac 12 (3/2)^m\) (i.e. \(2K^-_{q,m,r} \ge (3/2)^m\)), we get \(e^m \le \left(2K^-_{q,m,r}\right)^{\log _{3/2} e} \leq (2K_{q,m,r}^-)^{\log _2(3e/2)}\) after adjusting the exponent constant as in the previous range, we get \(K_{q,m,r}^+ \leq (2K_{q,m,r}^-)^{1+\log _2(3e/2)}\).

Combining the three ranges, in all cases \(K_{q,m,r}^+ \leq c_1 (K_{q,m,r}^-)^{c_2}\) holds for \(c_2 = 1+\log _2(3e/2) {\lt} 3.1\) and \(c_1 = 2^{c_2} {\lt} 8.2\).

Example 198 RM codes of constant alphabet size and (relative) distance, Example 9.4.6

Fix \(q\) and \(r {\lt} q-1\) and consider \(m \to \infty \). Then the Reed-Muller codes \(\mathrm{RM}(q,m,r)\) are \([N,K,D]_q\) codes with block length \(N = q^m\), distance \(D = \delta \cdot N\) for \(\delta = 1 - r/q\), with dimension

\[ K \geq \binom {m}{r} \geq \left(\frac{m}{r}\right)^r = \left(\frac{\log _q N}{r}\right)^r. \]

Proof ▶

By Lemma 186 (applicable since \(r {\lt} q-1 {\lt} q\)), the fraction of zeroes of a non-zero degree-\(\le r\) polynomial is at most \(r/q\), so the distance of \(\mathrm{RM}(q,m,r)\) is at least \(N(1-r/q) = \delta N\). By Proposition 197, in the regime \(r {\lt} (q-1)m/2\) (which holds once \(m\) is large enough, as \(r,q\) are fixed), \(K = K_{q,m,r} \geq K_{q,m,r}^- \geq \binom {m}{r} \geq (m/r)^r = (\log _q N / r)^r\), using \(N = q^m\).

Example 199 Binary RM codes of rate close to \(1\) with constant (absolute) distance, Example 9.4.7

Fix \(q=2\) and \(d\) and let \(m \to \infty \). Then the Reed-Muller codes \(\mathrm{RM}(2,m,m-d)\) are \([N,K,D]_2\) codes with \(N = 2^m\), \(D = 2^d\) and

\[ K \geq N - \binom {\log _2 N + d}{d} \geq N - (\log _2 N)^d. \]

The rate \(K/N \to 1\) as \(N \to \infty \).

Proof ▶

Setting \(r = m-d\) in Theorem 189, \(\mathrm{RM}(2,m,m-d)\) has block length \(N = 2^m\), distance \(D = 2^{m-r} = 2^d\), and dimension

\[ K = \sum _{i=0}^{m-d} \binom {m}{i} = 2^m - \sum _{i=m-d+1}^{m} \binom {m}{i} = N - \sum _{j=0}^{d-1} \binom {m}{j}. \]

Each term \(\binom {m}{j}\) for \(j {\lt} d\) is at most \(\binom {m}{d-1} \le \binom {m+d}{d}\) (a crude bound sufficient here), and summing \(d\) such terms gives \(\sum _{j=0}^{d-1}\binom {m}{j} \le \binom {m+d}{d}\) for \(d,m\) in the relevant range; hence \(K \geq N - \binom {m+d}{d}\). Substituting \(m = \log _2 N\) gives \(K \geq N - \binom {\log _2 N + d}{d}\), and using \(\binom {n}{d} \le n^d\) gives \(K \geq N - (\log _2 N + d)^d \geq N - (\log _2 N)^d \cdot O(1)\), which for fixed \(d\) and \(N \to \infty \) still gives \(K/N \to 1\) (a precise constant for the bound \(K \ge N - (\log _2 N)^d\) is worked out in Exercise 9.12). As \(N \to \infty \) with \(d\) fixed, \((\log _2 N)^d / N \to 0\), so \(K/N \to 1\).

Example 200 RM codes of constant rate and relative distance over polynomially small alphabets, Example 9.4.8

Given any \(\varepsilon {\gt} 0\), let \(m = \lceil 1/\varepsilon \rceil \) and consider \(q \to \infty \) with \(r = q/2\). Then the Reed-Muller codes \(\mathrm{RM}(q,m,r)\) are \([N,K,D]_q\) codes with \(N = q^m\), \(D = N/2\) and

\[ K \geq \frac{1}{2}\left(\frac{q+m}{m}\right)^m \geq \frac{1}{2m^m}\cdot N. \]

Expressed in terms of \(N\) and \(\varepsilon \), the codes have length \(N\), dimension \(\Omega \left(\varepsilon ^{1/\varepsilon }\right) \cdot N\) and relative distance \(1/2\) over an alphabet of size \(N^{\varepsilon }\).

Proof ▶

Here \(r = q/2 \geq (q-1)m/2\) once \(m \geq 1\) and \(q\) is large, so Proposition 197 gives (using the case \(r \geq (q-1)m/2\), together with monotonicity to compare against the individual-degree-unrestricted count)

\[ K = K_{q,m,r} \geq K_{q,m,r}^- \geq \max \left\{ q^m/2,\ q^m - K^+_{q,m,(q-1)m-r} \right\} \geq \frac{1}{2}\left(\frac{q+m}{m}\right)^m, \]

the last step following the same style of bound as in the range analysis of Proposition 197, giving \(K \geq \tfrac 12 \left(\tfrac {q+m}{m}\right)^m \geq \tfrac {1}{2m^m}\cdot q^m = \tfrac {1}{2m^m}\cdot N\). By Lemma 190, the distance is at least \(q^{m - r/(q-1)}\); as \(q \to \infty \) with \(m\) fixed and \(r=q/2\), \(r/(q-1) \to m/2\), so the distance is \(q^{m/2 + o(1)} = N^{1/2+o(1)}\), and for \(q\) large enough this is at least \(N/2\) as claimed in the book. Writing \(N^\varepsilon \approx q\) (since \(m \approx 1/\varepsilon \) and \(N = q^m\)), the dimension is \(\Omega (\varepsilon ^{1/\varepsilon }) \cdot N\) and the alphabet has size \(N^\varepsilon \).

Example 201 RM codes over polylogarithmic alphabets with polynomial dimension, Example 9.4.9

Given \(0 {\lt} \varepsilon {\lt} 1\), let \(q \to \infty \) and let \(r = q/2\) and \(m = q^{\varepsilon }\). Then the Reed-Muller codes \(\mathrm{RM}(q,m,r)\) are \([N,K,D]_q\) codes with \(N=q^m\), \(D = N/2\) and

\[ K \geq \frac{1}{2}\left(\frac{q+m}{m}\right)^m \geq \frac{1}{2}\cdot N^{1-\varepsilon }. \]

Expressed in terms of \(N\) and \(\varepsilon \), the codes have length \(N\), dimension \(\Omega \left(N^{1-\varepsilon }\right)\) and relative distance \(1/2\) over an alphabet of size \((\log N)^{1/\varepsilon }\).

Proof ▶

As in Example 200, distance \(N/2\) and the dimension bound \(K \geq \tfrac 12 \left(\tfrac {q+m}{m}\right)^m\) follow from Lemma 190 and Proposition 197. Since \(m = q^\varepsilon = o(q)\), \(\tfrac {q+m}{m} \geq \tfrac {q}{m} = q^{1-\varepsilon }\), giving \(K \geq \tfrac 12 (q^{1-\varepsilon })^m = \tfrac 12 (q^m)^{1-\varepsilon } = \tfrac 12 N^{1-\varepsilon }\). Since \(N = q^m = q^{q^\varepsilon }\), we have \(\log N \approx q^\varepsilon \log q\), so \(q \approx (\log N)^{1/\varepsilon }\) up to lower-order terms, giving an alphabet of size \((\log N)^{1/\varepsilon }\).