← All blueprints

Expander Graphs and Their Applications

13 Metric embedding

Definition 238 Metric space
#

A set \(X\) with \(d:X\times X\to \mathbb {R}^+\) symmetric, vanishing only on the diagonal, and satisfying the triangle inequality.

Definition 239 Distortion of an embedding

For \(f:X\to (\mathbb {R}^n,\ell _2)\), \(\mathrm{distortion}(f)= \mathrm{expansion}(f)\cdot \mathrm{contraction}(f)\), the product of the largest ratios \(\| f(x)-f(y)\| /d(x,y)\) and \(d(x,y)/\| f(x)-f(y)\| \).

Claim 240 A star metric is not isometrically \(\ell _2\)-embeddable

The \(4\)-point metric with one centre at distance \(1\) and the others at distance \(2\) has no isometric Euclidean embedding.

Proof

Each leaf–centre–leaf triple would be collinear, forcing all four points onto a line, contradicting the mutual distance \(2\) of the leaves.

Definition 241 Least \(\ell _2\) distortion

\(c_2(X,d)\) is the least distortion of an embedding of \((X,d)\) into Euclidean space (achievable with \(n\le |X|\)).

Theorem 242 13.1, Bourgain’s embedding

Every \(n\)-point metric embeds into Euclidean space with distortion \(O(\log n)\).

Definition 243 \(\ell _p\) metric

An \(\ell _2\) (\(\ell _1\)) metric is one isometric to a finite subset of \((\mathbb {R}^n,\ell _2)\) (resp. \(\ell _1\)).

Theorem 244 13.3, Johnson–Lindenstrauss

Any \(n\)-point \(\ell _2\) metric embeds into \(O(\log n/\epsilon ^2)\) dimensions with distortion \(\le 1+\epsilon \).

Theorem 245 13.4, \(c_2\) is polynomial-time computable (LLR)

\(c_2(X,d)\) is computable in polynomial time by semidefinite programming.

Proof

Scaling contraction to \(1\), \(\mathrm{distortion}\le \gamma \) iff there is a PSD Gram matrix \(Z\) with \(d(x_i,x_j)^2\le z_{ii}+z_{jj}-2z_{ij}\le \gamma ^2 d(x_i,x_j)^2\); this feasibility problem is solved by the ellipsoid method.

Claim 246 13.5, Self-duality of the PSD cone

\(Z\) is PSD iff \(\sum _{ij}q_{ij}z_{ij}\ge 0\) for every PSD \(Q\).

Proof

(\(\Leftarrow \)) take \(Q=vv^\top \) to get \(v^\top Zv\ge 0\). (\(\Rightarrow \)) every PSD \(Q\) is a sum of rank-one PSD matrices \(vv^\top \), on each of which the inequality holds.

Theorem 247 13.6, SDP-dual formula for \(c_2\) (LLR)

\(c_2(X,d)=\max _{P\in \mathrm{PSD},\, P\mathbf1=0} \sqrt{\frac{\sum _{p_{ij}{\gt}0}p_{ij}d(x_i,x_j)^2}{-\sum _{p_{ij}{\lt}0}p_{ij}d(x_i,x_j)^2}}\).

Proof

Dualizing the feasibility problem of Theorem 245: a contradicting nonnegative combination selects a PSD \(P\) with \(P\mathbf1=0\) (to cancel diagonal terms), yielding \(0\ge \sum _{p_{ij}{\gt}0}p_{ij}d^2+\gamma ^2 \sum _{p_{ij}{\lt}0}p_{ij}d^2\), contradictory exactly below the stated ratio.

Definition 248 \(c_2\) of a graph

\(c_2(G)=c_2(V(G),d_G)\) for the shortest-path metric.

Claim 249 Enflo: distortion of the cube

\(c_2(Q_r)=\sqrt r\).

Proof

The identity embedding gives distortion \(\sqrt r\). For the lower bound take \(P\) with \(P(x,x)=r-1\), \(P(x,y)=-1\) for edges, \(+1\) for antipodes; then \(P\mathbf1=0\), \(P\) is PSD, and Theorem 247 gives \(\sqrt{2^rr^2/(2^rr)}=\sqrt r\).

Lemma 250 Dirac’s theorem
#

A graph on \(n\) vertices with minimum degree \(\ge n/2\) has a Hamiltonian cycle.

Lemma 251 13.7, Far-distance perfect matching

For \(k\)-regular \(G\) of even order \(n\), the graph \(H\) joining vertices at \(G\)-distance \(\ge \lfloor \log _kn\rfloor \) has a perfect matching.

Proof

At most \(n/2\) vertices lie within distance \(\lfloor \log _kn\rfloor -1\), so \(H\) has minimum degree \(\ge n/2\); by Dirac’s theorem \(H\) has a Hamiltonian cycle, hence (even \(n\)) a perfect matching.

Theorem 252 13.8, Expanders have \(\Omega (\log n)\) distortion (LLR)

If \(G\) is an \((n,k)\)-graph with \(\lambda _2\le k-\epsilon \), then \(c_2(G)=\Omega (\log n)\).

Proof

Let \(B\) be a perfect matching of the far-graph \(H\) (Lemma 251) and \(P=kI-A_G+\tfrac \epsilon 2(B-I)\). Then \(P\mathbf1=0\), and for \(x\perp \mathbf1\), \(x^\top (kI-A_G)x\ge \epsilon \| x\| ^2\) while \(x^\top (B-I)x\ge -2\| x\| ^2\), so \(P\) is PSD. In Theorem 247, \(-\sum _{p_{ij}{\lt}0}d^2p_{ij}=kn\) and \(\sum _{p_{ij}{\gt}0}d^2p_{ij}\ge \tfrac \epsilon 2n\lfloor \log _kn\rfloor ^2\), giving \(c_2(G)=\Omega (\log n)\).

Theorem 253 13.9, Poincaré inequality for graphs

For \(k\)-regular \(G\) and any \(f:V\to \mathbb {R}^n\), \(\mathbb {E}_{u,v}\| f(u)-f(v)\| ^2\le \tfrac k{k-\lambda _2}\mathbb {E}_{(u,v)\in E}\| f(u)-f(v)\| ^2\).

Proof

Reduce to scalar zero-average \(f\) (both sides additive over coordinates and shift-invariant); the inequality is then the variational characterization of \(\lambda _2\).

Problem 254 Computing the expansion ratio

Computing \(h(G)\) is co-\(\mathbf{NP}\)-hard; up to a constant it reduces to the sparsest cut \(\min _S|E(S,\overline S)|/(|S|(n-|S|))\).

Definition 255 All-pairs multicommodity flow
#

Ship \(\delta \) units between every pair under unit edge capacities; \(\delta _{\max } (G)\) is the maximum feasible \(\delta \) (a linear program).

Definition 256 Cut cone and cut metrics

The cut cone is the convex cone of \(\ell _1\) metrics; its extreme rays are the cut metrics \(d_S(x,y)=\mathbf1[|\{ x,y\} \cap S|=1]\).

\(\delta _{\max }(G)\le \min _S\frac{|E(S,\overline S)|}{|S|(n-|S|)}\le O(\delta _{\max }(G)\log n)\).

Proof

The left inequality is capacity counting. By LP duality \(\delta _{\max }\) is the minimum of \(\sum _{(i,j)\in E}d_{ij}/\sum _{ij}d_{ij}\) over metrics; over \(\ell _1\) metrics the minimum is attained at a cut metric, giving the sparsest cut, and by Bourgain’s theorem restricting to \(\ell _1\) loses only \(O(\log n)\).

Definition 258 Negative-type metric

\((X,d)\) is of negative type if \(\sqrt d\) is an \(\ell _2\) metric; every \(\ell _1\) metric is of negative type.

Theorem 259 13.12, Negative-type into \(\ell _1\) (refuted conjecture)

Whether every negative-type metric embeds into \(\ell _1\) with bounded distortion (refuted by Khot–Vishnoi with an \((\log \log n)^c\) lower bound).