When Do Low-Rate Concatenated Codes Approach the Gilbert–Varshamov Bound?

6 A High Min-Entropy Sufficient Condition

Definition 52 Empirical symbol distribution

For a word \(c \in \mathbb {F}_q^n\), the empirical distribution \(\mathcal{D}_c\) on \(\mathbb {F}_q\) is given by \(\Pr _{\mathcal{D}_c}[\sigma ] = \Pr _{\alpha \sim [n]}[c_\alpha = \sigma ]\), the fraction of coordinates of \(c\) equal to \(\sigma \).

Definition 53 Min-entropy

For \(c \in \mathbb {F}_q^n\), the min-entropy of \(\mathcal{D}_c\) is \(H_\infty (c) \triangleq -\log _2 \max _\sigma \Pr _{\mathcal{D}_c}[\sigma ]\), so that \(H_\infty (c) \leq \log _2 q\).

Definition 54 Smoothed min-entropy

For a smoothness parameter \(\eta {\gt} 0\), the smoothed min-entropy is

\[ H_\infty ^\eta (\mathcal{D}_c) \triangleq \max _{\mathcal{D}_{c'} : \Delta _{\mathrm{TV}}(\mathcal{D}_c,\mathcal{D}_{c'})\leq \eta } H_\infty (\mathcal{D}_{c'}), \]

where the maximum is over distributions within total variation distance \(\eta \).

For any \(n \in \mathbb {N}\), \(\frac{8}{\sqrt n}\leq \gamma \leq \frac13\), integers \(\frac{24\log n}{\gamma }\leq k\leq \frac{n}{5}\), and \(2^{2k/3}\leq T\leq 2^{(1-\gamma )k}\), a random linear code \(\mathcal{C}\subseteq \mathbb {F}_2^n\) of dimension \(k\) satisfies the following with probability \(1 - 2^{-\Omega (\gamma k + \gamma ^2 n + \sqrt n)}\). For \(0 \leq j \leq n\) let \(\Delta _j\) be the number of codewords of weight \(j\), and let \(j^\star \) be the minimal \(j\) with \(\sum _{i=0}^j \Delta _i \geq T\). Then:

  1. \(j^\star \geq \alpha n\) for some \(\alpha \geq h_2^{-1}\! \big(1 - 2\cdot \frac{k-\log T}{n}\big)\);

  2. \(\dfrac {\sum _{i=0}^{j^\star } i\, \Delta _i}{\sum _{i=0}^{j^\star }\Delta _i} \geq (1 - 2\gamma )\, j^\star \);

  3. \(\dfrac {\Delta _{j^\star +1}}{\sum _{i=0}^{j^\star }\Delta _i}\leq 2^{\sqrt n}\).

Proof

As in Theorem 43 we work with codes whose weight distribution deviates only slightly from that of a uniformly random code, but here we need both an upper and a lower bound on the \(\Delta _i\).

Write \(p = 2^{-(n-k)}\) and \(p' = 2^{-(n-k)}(1 - 2^{-k})\). For \(1 \leq i \leq n\), \(\binom {n}{i}p' \leq \mathbb {E}_\mathcal{C}[\Delta _i] = \binom {n}{i}\frac{2^k-1}{2^n-1}\leq \binom {n}{i}p\), and \(\operatorname {Var}_\mathcal{C}[\Delta _i]\leq \binom {n}{i}p\) by the negative correlation of distinct nonzero vectors (Lemma 19). Fix \(\tau {\gt} 0\). Markov (Lemma 23) gives \(\Pr [\Delta _i \geq 2^{\tau n}\binom {n}{i}p]\leq 2^{-\tau n}\), and Chebyshev (Lemma 24) gives \(\Pr [\Delta _i \leq \binom {n}{i}p/2^{\tau n}]\leq \frac{1}{(1 - 2^{-\tau n} - 2^{-k})^2\binom {n}{i}p}\). Fixing \(1 \leq \ell \leq n/2\) and union bounding (Lemma 25), with probability at least \(1 - \frac{n}{2^{\tau n}} - \frac{n}{(1 - 2^{-\tau n} - 2^{-k})^2\binom {n}{\ell }p}\),

\begin{align} \Delta _i & \leq 2^{\tau n}\binom {n}{i}p \quad \text{for all } 1 \leq i \leq n, \tag {19}\\ \Delta _i & \geq \binom {n}{i}p/2^{\tau n}\quad \text{for all } \ell \leq i\leq n/2. \tag {20} \end{align}

Choosing \(\tau = \min \big\{ \frac18\gamma ^2(h_2^{-1}(1 - k/n))^2, \frac{k\gamma }{n} - \frac{\log n}{n}\big\} \) and \(\ell = \lfloor h_2^{-1}(1 - \frac23\frac{k}{n})n\rfloor \), the failure probability is \(2^{-\Omega (\gamma k + \gamma ^2 n + \sqrt n)}\), using \(h_2^{-1}(1-x)\geq \frac12 - 5x^2\) (Lemma 36) to lower bound \(h_2^{-1}(1 - k/n)\geq \frac14\) from \(k \leq n/5\), and the bounds \(\binom {n}{\ell }p \geq 2^{k/4}\) from \(k\) large.

Item 1. Equation (19) makes \(\mathcal{C}^\perp \) effectively \(\tau \)-nice, and by (19), to bound \(j^\star \) it suffices to solve \(\sum _{i=0}^j\binom {n}{i} 2^{-(n-k-\tau n)}\geq T\) for \(j\). Writing \(j = \alpha n\) and using \(\sum _{i\leq \alpha n}\binom {n}{i}\leq 2^{nh_2(\alpha )}\) (Lemma 32; sharpened to \(2^{h_2(\alpha )n - \frac12 \log n - 1}\)), the smallest such \(\alpha \) satisfies \(\alpha \geq h_2^{-1}\big(1 - \tau - \frac{k - \log T + \log n}{n}\big)\geq h_2^{-1}\big(1 - 2\frac{k - \log T}{n}\big)\), using \(\tau \leq \frac{k\gamma }{n} - \frac{\log n}{n}\leq \frac{k - \log T - \log n}{n}\).

Item 2. Let \(j' = \lceil \alpha (1 - \gamma )n\rceil \). By (19), \(A \triangleq \sum _{i=0}^{j'-1}\Delta _i \leq \sum _{i=0}^{j'-1}\binom {n}{i} 2^{-(n-k+\tau n)}\leq 2^{n(h_2(\alpha (1-\gamma )) + \tau + k/n - 1)}\). By Item 1 and \(T \geq 2^{2k/3}\), \(j^\star = \alpha n \geq h_2^{-1}(1 - 2\frac{k-\log T}{n}) n \geq h_2^{-1}(1 - \frac23\frac{k}{n})n \geq \ell \), so (20) yields \(\Delta _{j^\star }\geq \binom {n}{j^\star }p\, 2^{-\tau n}\), hence \(B \triangleq \sum _{i=j'+1}^{j^\star }\Delta _i \geq \Delta _{j^\star }\geq 2^{n(h_2(\alpha ) - \frac{\log (2n)}{2n} - 1 + k/n - \tau )}\) (Lemma 33). Therefore

\[ \frac{\sum _{i=0}^{j^\star } i\Delta _i}{\sum _{i=0}^{j^\star }\Delta _i} \geq \frac{j'\sum _{i=j'}^{j^\star }\Delta _i}{\sum _{i=0}^{j^\star }\Delta _i} = j'\frac{B}{A+B}\geq (1-\gamma )j^\star \frac{1}{1 + A/B}. \]

Now \(\frac{\log (A/B)}{n}\leq h_2(\alpha (1-\gamma )) - h_2(\alpha ) + 2\tau + \frac{\log (2n)}{2n}\leq h_2(\alpha (1-\gamma )) - h_2(\alpha ) + 4\tau \leq -(\alpha \gamma )^2 + 4\tau \leq -\frac12\gamma ^2(h_2^{-1}(1 - k/n))^2 \triangleq -\theta \), using \(h_2'\geq 0\), \(h_2''\leq -2\), Item 1, and the choice of \(\tau \). Hence \(\frac{\sum i\Delta _i}{\sum \Delta _i}\geq \frac{(1-\gamma )j^\star }{1 + 2^{-\theta n}}\geq (1-2\gamma )j^\star \), since \(2^{-\theta n}\leq \gamma \) from the lower bound on \(k\).

Item 3. For some \(\tau {\gt} 0\), (19) gives \(\Delta _{j^\star +1}\leq 2^{\tau n}\binom {n}{j^\star +1}p\) and (20) gives \(\Delta _{i}\geq 2^{-\tau n}\binom {n}{i}p\), with probability at least \(1 - \frac{1}{2^{\tau n}} - \frac{1}{(1 - 2^{-\tau n} - 2^{-k})^2\binom {n}{j^\star }p}\). Then

\[ \frac{\Delta _{j^\star +1}}{\sum _{i=0}^{j^\star +1}\Delta _i}\leq \frac{\Delta _{j^\star +1}}{\Delta _{j^\star }}\leq \frac{2^{\tau n}\binom {n}{j^\star +1}p}{2^{-\tau n}\binom {n}{j^\star }p} = 2^{2\tau n}\frac{n - j^\star }{j^\star + 1}\leq 2^{2\tau n + 2}, \]

using \(j^\star \geq \frac14 n\). Setting \(\tau = \frac{1}{4\sqrt n}\) makes this at most \(2^{\sqrt n}\), and the success probability is at least \(1 - 2^{-\frac14\sqrt n} - \frac14 2^{-k/4}\), using \(\binom {n}{j^\star } 2^{-n+k}\geq 2^{-k/4}\).

Fix any sufficiently small \(\varepsilon {\gt} 0\). For integers \(k,q \in \mathbb {N}\) with \(q = 2^{k_0}\), let \(n_0 = k_0/\varepsilon \) and \(n = k/\varepsilon \). Let \(\mathcal{C}_{\mathrm{out}}\subseteq \mathbb {F}_q^n\) be an \(\mathbb {F}_2\)-linear code of rate \(\varepsilon \), and let \(\mathcal{C}_{\mathrm{in}}\subseteq \mathbb {F}_2^{n_0}\) be a random linear code of rate \(\varepsilon \). Assume there exist constants \(\bar c_\gamma , \bar c_\eta \) such that for every nonzero \(c \in \mathcal{C}_{\mathrm{out}}\),

\[ H_\infty ^{\bar c_\eta \varepsilon }(c)\geq (1 - \bar c_\gamma \varepsilon )\log q, \]

and \(n_0 = \Omega _{\bar c_\gamma }\big(\frac{\log (1/\varepsilon )}{\varepsilon ^2}\big)\). Then with probability at least \(1 - 2^{-\Omega _{\bar c_\gamma }(\varepsilon ^2 n_0 + \sqrt{n_0})}\) over \(\mathcal{C}_{\mathrm{in}}\), the concatenated code \(\mathcal{C}= \mathcal{C}_{\mathrm{out}}\circ \mathcal{C}_{\mathrm{in}}\) has relative distance \(\frac12 - O_{\bar c_\gamma ,\bar c_\eta }(\varepsilon )\).

Proof

Let \(\gamma = \bar c_\gamma \varepsilon \) and \(\eta = \bar c_\eta \varepsilon \). Since \(\mathcal{C}\) is linear it suffices to lower bound the weight of every nonzero codeword. Fix nonzero \(c \in \mathcal{C}_{\mathrm{out}}\) and consider \(w = (\mathcal{C}_{\mathrm{in}}(c_1),\dots ,\mathcal{C}_{\mathrm{in}}(c_n))\). Order \(\mathbb {F}_q= \{ \sigma _0,\dots ,\sigma _{q-1}\} \) with \(\Pr _{\mathcal{D}_c}[\sigma _0]\geq \cdots \geq \Pr _{\mathcal{D}_c}[\sigma _{q-1}]\) and set \(p_t(c) = \Pr _{\mathcal{D}_c}[\sigma _t]\).

Worst-case inner map. It suffices to lower bound \(\operatorname {weight}(w)\) under a worst-case (not necessarily linear) assignment of symbols of \(\mathbb {F}_q\) to codewords of \(\mathcal{C}_{\mathrm{in}}\): the most frequent symbols are mapped to the lowest-weight codewords. For \(j \in \{ 0,\dots ,n_0\} \) let \(\Delta _j = |\{ x\in \mathcal{C}_{\mathrm{in}}: \operatorname {weight}(x) = j\} |\) and \(\ell _j = \Delta _0 + \cdots + \Delta _j\) (with \(\ell _{-1} = 0\)). Assigning the symbols \(\{ \sigma _{\ell _{j-1}},\dots ,\sigma _{\ell _j - 1}\} \) to the weight-\(j\) codewords, the weight of \(w\) satisfies \(\operatorname {weight}(w)\geq d(c)\), where with \(d_{\mathrm{in}}\) the minimum distance of \(\mathcal{C}_{\mathrm{in}}\),

\begin{equation} d(c) = n\sum _{j = d_{\mathrm{in}}}^{n_0}\Big(\Big(\sum _{t=\ell _{j-1}}^{\ell _j-1} p_t(c)\Big)\cdot j\Big). \tag {16} \end{equation}
3

Worst-case distribution. Minimising over symbol distributions \(\mathbf p = (p_0\geq \cdots \geq p_{q-1}\geq 0)\) obeying the smoothed entropy constraint \(H_\infty ^\eta (\mathbf p)\leq b\) with \(b = (1-\gamma )\log q\),

\begin{equation} d(c)\geq d \triangleq \min _{\mathbf p : H_\infty ^\eta (\mathbf p)\leq b} n\sum _{j=d_{\mathrm{in}}}^{n_0}\Big(\Big(\sum _{t=\ell _{j-1}}^{\ell _j-1}p_t\Big) \cdot j\Big). \tag {17} \end{equation}
4

The minimiser puts mass uniformly on the \(2^b\) lowest-weight symbols and shifts an \(\eta \)-fraction to the zero codeword’s symbol: \(p_t = \eta + 2^{-b}\) for \(t = 0\), \(p_t = 2^{-b}\) for \(1 \leq t \leq (1-\eta )2^b - 1\), and \(0\) otherwise. Set \(T = (1-\eta )2^b\), \(j^\star = \min \{ j \leq n_0 : \ell _j \geq T\} \), and define \(b'\) by \(\ell _{j^\star -1} = (1-\eta )2^{b'} \triangleq T'\). The worst case for \(b'\) is no better, and \(\frac{T}{T'}\leq 1 + \frac{\Delta _{j^\star }}{\ell _{j^\star -1}}\). Then

\begin{equation} d\geq n\sum _{j=d_{\mathrm{in}}}^{j^\star -1}\Big(\Big(\sum _{t=\ell _{j-1}}^{\ell _j-1} 2^{-b}\Big)j\Big) = \frac{n}{2^{b'}}\sum _{j=d_{\mathrm{in}}}^{j^\star -1} (\Delta _j\cdot j). \tag {18} \end{equation}
5

Applying Lemma 55. Apply Lemma 55 to \(\mathcal{C}_{\mathrm{in}}\) with this \(\gamma \) and \(T'\) (valid since \(\gamma \geq 8/\sqrt{n_0}\) and \(k_0 \geq 24\log n_0/\gamma \) by the lower bound on \(n_0\)). With probability \(1 - 2^{-\Omega (\gamma k_0 + \gamma ^2 n_0 + \sqrt{n_0})} = 1 - 2^{-\Omega _{\bar c_\gamma }(\varepsilon ^2 n_0 + \sqrt{n_0})}\) the conclusions hold. Item 1 gives \(j^\star - 1 = \alpha n_0\) with

\[ \alpha \geq h_2^{-1}\Big(1 - 2\tfrac {\log q - \log T’}{n_0}\Big) \geq h_2^{-1}\Big(1 - 2\varepsilon \big(\gamma + \tfrac {2\eta }{\log q}\big) - \varepsilon \Big) - \tfrac {2}{\sqrt{n_0}} \geq h_2^{-1}(1 - 4\varepsilon \gamma ) - \tfrac {2}{\sqrt{n_0}} \geq \tfrac 12 - \sqrt{\gamma \varepsilon \ln 2}, \]

using \(\log T - \log T'\leq \sqrt{n_0} + 1\) (Item 3), \(\log \frac{1}{1-\eta }\leq 2\eta \), \(n_0 \geq 4\varepsilon ^{-2}\), and \(h_2^{-1}(1 - x^2)\leq \frac12 - \frac{\sqrt{\ln 2}}{2}x\) (Lemma 35). Item 2 gives \(\sum _{j=0}^{j^\star -1}\Delta _j j\geq (1-2\gamma )(j^\star -1)\sum _{j=0}^{j^\star -1} \Delta _j\geq (1-2\gamma )\alpha n_0 T'\). Returning to (18),

\[ \frac{d}{n_0 n}\geq \frac{1}{n_0 2^{b'}}\sum _{j=0}^{j^\star -1}\Delta _j j \geq (1-2\gamma )\alpha \frac{T'}{2^{b'}} = (1-2\gamma )\alpha (1-\eta ), \]

using \(T' = (1-\eta )2^{b'}\). Since \(N = n_0 n\), the relative distance is at least

\[ \frac{d}{N}\geq (1-2\gamma )(1-\eta )\alpha \geq (1-2\gamma )(1-\eta )\Big(\tfrac 12 - \sqrt{\gamma \varepsilon \ln 2}\Big) \geq \tfrac 12 - \big(\sqrt{\bar c_\gamma \ln 2} + \bar c_\gamma + \tfrac 12\bar c_\eta \big)\varepsilon = \tfrac 12 - O_{\bar c_\gamma ,\bar c_\eta }(\varepsilon ), \]

as desired.

Proposition 57 Random outer codes are mildly smooth, Remark 9

Let \(\mathcal{C}_{\mathrm{out}}\) be a random linear code of dimension \(k = \varepsilon n\) over \(\mathbb {F}_q\) and let \(\gamma = \frac12\varepsilon \), \(q = O(n)\). With high probability the symbols of every nonzero codeword of \(\mathcal{C}_{\mathrm{out}}\) are not contained in any set of size smaller than \(q^{1-\gamma }\); equivalently every nonzero codeword has \(H_\infty (c)\geq (1-\gamma )\log _2 q\) on the support level required.

Proof

For a fixed nonzero codeword, the probability that all \(n\) of its symbols fall inside a fixed set \(S \subseteq \mathbb {F}_q\) of size \(q^{1-\gamma }\) is at most \((|S|/q)^n = q^{-\gamma n}\); summing over the \(\binom {q}{q^{1-\gamma }}\) choices of \(S\) gives \(\sum _{i=1}^{q^{1-\gamma }}\binom {q}{i}(i/q)^n\approx q^{-\gamma (n - q^{1-\gamma })}\). Taking a union bound over the \(q^k = 2^{\varepsilon n\log q}\) nonzero codewords, with \(\gamma = \frac12\varepsilon \) and \(q = O(n)\), no nonzero codeword violates the constraint with high probability.