Expander Codes

11 The expansion of random graphs (Appendix II)

Theorem 43 Theorem 25: upper bound on expansion

Let \(B\) be a bipartite graph between \(n\) \(c\)-regular vertices and \((c/d)n\) \(d\)-regular vertices. For all \(0{\lt}\alpha {\lt}1\), there exists a set of \(\alpha n\) \(c\)-regular vertices with at most

\[ n\, \frac{c}{d}\bigl(1-(1-\alpha )^d\bigr) + O(1) \]

neighbours.

Proof

Choose a set \(X\) of \(\alpha n\) \(c\)-regular vertices uniformly at random. Consider a fixed \(d\)-regular vertex; each of its \(d\) neighbours lies in \(X\) with probability \(\alpha \), so the probability it is not a neighbour of \(X\) is

\[ \prod _{i=0}^{d-1}\frac{n-\alpha n-i}{n-i}, \]

which tends to \((1-\alpha )^d\) as \(n\to \infty \). Hence the expected number of \(d\)-regular non-neighbours tends to \(\frac{c}{d}n(1-\alpha )^d\), so the expected number of neighbours is \(\frac{c}{d}n\bigl(1-(1-\alpha )^d\bigr)+O(1)\). Some choice of \(X\) achieves at most the expectation, giving the bound. The bound becomes tight as \(c\) grows.

Theorem 44 Theorem 26 (= Proposition 1): expansion of random graphs

Let \(B\) be a randomly chosen \((c,d)\)-regular bipartite graph between \(n\) variables and \((c/d)n\) constraints. Then for all \(0{\lt}\alpha {\lt}1\), with exponentially high probability all sets of \(\alpha n\) variables in \(B\) have at least

\[ n\! \left(\frac{c}{d}\bigl(1-(1-\alpha )^d\bigr) - \sqrt{\frac{2c\alpha \, H(\alpha )}{\log _2 e}}\right) \]

neighbours, where \(H(\cdot )\) is the binary entropy function.

Proof

Fix a set \(V\) of \(\alpha n\) variables. The probability that a given constraint is a neighbour of \(V\) is at least \(1-(1-\alpha )^d\), so the expected number of neighbours of \(V\) is at least \(n\frac{c}{d}\bigl(1-(1-\alpha )^d\bigr)\).

Each of the \(c\alpha n\) edges leaving \(V\) has its endpoint revealed one at a time. Let \(X_i\) be the expected size of the neighbour set of \(V\) given that the first \(i\) edges leaving \(V\) have been revealed. Then \(X_0,\dots ,X_{c\alpha n}\) is a martingale with \(|X_{i+1}-X_i|\le 1\), \(X_0\) is the expectation above, and \(X_{c\alpha n}\) is the actual number of neighbours. By Azuma’s inequality (Lemma 5),

\[ \Pr \bigl[\, E[X_{c\alpha n}] - X_{c\alpha n} {\gt} \lambda \sqrt{c\alpha n}\, \bigr] {\lt} e^{-\lambda ^2/2}. \]

There are only \(\binom {n}{\alpha n}\) choices of \(V\), so it suffices to choose \(\lambda \) with \(\binom {n}{\alpha n}e^{-\lambda ^2/2}\ll 1\). By Stirling’s formula \(\binom {n}{\alpha n}\le 2^{nH(\alpha )}\), so the condition holds for large \(n\) once \(\frac{nH(\alpha )}{\log _2 e}{\lt}\frac{\lambda ^2}{2}\), i.e. \(\lambda {\gt} \sqrt{2nH(\alpha )/\log _2 e}\). Substituting this \(\lambda \) into the deviation \(\lambda \sqrt{c\alpha n}\) and dividing by \(n\) gives the stated shortfall \(\sqrt{2c\alpha H(\alpha )/\log _2 e}\) below the expectation, uniformly over all \(V\) with exponentially high probability.