13 Metric embedding

Definition 238 Metric space

✓

A set \(X\) with \(d:X\times X\to \mathbb {R}^+\) symmetric, vanishing only on the diagonal, and satisfying the triangle inequality.

Definition 239 Distortion of an embedding

For \(f:X\to (\mathbb {R}^n,\ell _2)\), \(\mathrm{distortion}(f)= \mathrm{expansion}(f)\cdot \mathrm{contraction}(f)\), the product of the largest ratios \(\| f(x)-f(y)\| /d(x,y)\) and \(d(x,y)/\| f(x)-f(y)\| \).

Claim 240 A star metric is not isometrically \(\ell _2\)-embeddable

The \(4\)-point metric with one centre at distance \(1\) and the others at distance \(2\) has no isometric Euclidean embedding.

Proof ▶

Each leaf–centre–leaf triple would be collinear, forcing all four points onto a line, contradicting the mutual distance \(2\) of the leaves.

Definition 241 Least \(\ell _2\) distortion

\(c_2(X,d)\) is the least distortion of an embedding of \((X,d)\) into Euclidean space (achievable with \(n\le |X|\)).

Theorem 242 13.1, Bourgain’s embedding

Every \(n\)-point metric embeds into Euclidean space with distortion \(O(\log n)\).

Definition 243 \(\ell _p\) metric

An \(\ell _2\) (\(\ell _1\)) metric is one isometric to a finite subset of \((\mathbb {R}^n,\ell _2)\) (resp. \(\ell _1\)).

Theorem 244 13.3, Johnson–Lindenstrauss

Any \(n\)-point \(\ell _2\) metric embeds into \(O(\log n/\epsilon ^2)\) dimensions with distortion \(\le 1+\epsilon \).

Theorem 245 13.4, \(c_2\) is polynomial-time computable (LLR)

\(c_2(X,d)\) is computable in polynomial time by semidefinite programming.

Proof ▶

Scaling contraction to \(1\), \(\mathrm{distortion}\le \gamma \) iff there is a PSD Gram matrix \(Z\) with \(d(x_i,x_j)^2\le z_{ii}+z_{jj}-2z_{ij}\le \gamma ^2 d(x_i,x_j)^2\); this feasibility problem is solved by the ellipsoid method.

Claim 246 13.5, Self-duality of the PSD cone

\(Z\) is PSD iff \(\sum _{ij}q_{ij}z_{ij}\ge 0\) for every PSD \(Q\).

Proof ▶

(\(\Leftarrow \)) take \(Q=vv^\top \) to get \(v^\top Zv\ge 0\). (\(\Rightarrow \)) every PSD \(Q\) is a sum of rank-one PSD matrices \(vv^\top \), on each of which the inequality holds.

Theorem 247 13.6, SDP-dual formula for \(c_2\) (LLR)

\(c_2(X,d)=\max _{P\in \mathrm{PSD},\, P\mathbf1=0} \sqrt{\frac{\sum _{p_{ij}{\gt}0}p_{ij}d(x_i,x_j)^2}{-\sum _{p_{ij}{\lt}0}p_{ij}d(x_i,x_j)^2}}\).

Proof ▶

Dualizing the feasibility problem of Theorem 245: a contradicting nonnegative combination selects a PSD \(P\) with \(P\mathbf1=0\) (to cancel diagonal terms), yielding \(0\ge \sum _{p_{ij}{\gt}0}p_{ij}d^2+\gamma ^2 \sum _{p_{ij}{\lt}0}p_{ij}d^2\), contradictory exactly below the stated ratio.

Definition 248 \(c_2\) of a graph

\(c_2(G)=c_2(V(G),d_G)\) for the shortest-path metric.

Claim 249 Enflo: distortion of the cube

\(c_2(Q_r)=\sqrt r\).

Proof ▶

The identity embedding gives distortion \(\sqrt r\). For the lower bound take \(P\) with \(P(x,x)=r-1\), \(P(x,y)=-1\) for edges, \(+1\) for antipodes; then \(P\mathbf1=0\), \(P\) is PSD, and Theorem 247 gives \(\sqrt{2^rr^2/(2^rr)}=\sqrt r\).

Lemma 250 Dirac’s theorem

✓

A graph on \(n\) vertices with minimum degree \(\ge n/2\) has a Hamiltonian cycle.

Lemma 251 13.7, Far-distance perfect matching

For \(k\)-regular \(G\) of even order \(n\), the graph \(H\) joining vertices at \(G\)-distance \(\ge \lfloor \log _kn\rfloor \) has a perfect matching.

Proof ▶

At most \(n/2\) vertices lie within distance \(\lfloor \log _kn\rfloor -1\), so \(H\) has minimum degree \(\ge n/2\); by Dirac’s theorem \(H\) has a Hamiltonian cycle, hence (even \(n\)) a perfect matching.

Theorem 252 13.8, Expanders have \(\Omega (\log n)\) distortion (LLR)

If \(G\) is an \((n,k)\)-graph with \(\lambda _2\le k-\epsilon \), then \(c_2(G)=\Omega (\log n)\).

Proof ▶

Let \(B\) be a perfect matching of the far-graph \(H\) (Lemma 251) and \(P=kI-A_G+\tfrac \epsilon 2(B-I)\). Then \(P\mathbf1=0\), and for \(x\perp \mathbf1\), \(x^\top (kI-A_G)x\ge \epsilon \| x\| ^2\) while \(x^\top (B-I)x\ge -2\| x\| ^2\), so \(P\) is PSD. In Theorem 247, \(-\sum _{p_{ij}{\lt}0}d^2p_{ij}=kn\) and \(\sum _{p_{ij}{\gt}0}d^2p_{ij}\ge \tfrac \epsilon 2n\lfloor \log _kn\rfloor ^2\), giving \(c_2(G)=\Omega (\log n)\).

Theorem 253 13.9, Poincaré inequality for graphs

For \(k\)-regular \(G\) and any \(f:V\to \mathbb {R}^n\), \(\mathbb {E}_{u,v}\| f(u)-f(v)\| ^2\le \tfrac k{k-\lambda _2}\mathbb {E}_{(u,v)\in E}\| f(u)-f(v)\| ^2\).

Proof ▶

Reduce to scalar zero-average \(f\) (both sides additive over coordinates and shift-invariant); the inequality is then the variational characterization of \(\lambda _2\).

Problem 254 Computing the expansion ratio

Computing \(h(G)\) is co-\(\mathbf{NP}\)-hard; up to a constant it reduces to the sparsest cut \(\min _S|E(S,\overline S)|/(|S|(n-|S|))\).

Definition 255 All-pairs multicommodity flow

Ship \(\delta \) units between every pair under unit edge capacities; \(\delta _{\max } (G)\) is the maximum feasible \(\delta \) (a linear program).

Definition 256 Cut cone and cut metrics

The cut cone is the convex cone of \(\ell _1\) metrics; its extreme rays are the cut metrics \(d_S(x,y)=\mathbf1[|\{ x,y\} \cap S|=1]\).

Theorem 257 13.10, Leighton–Rao

\(\delta _{\max }(G)\le \min _S\frac{|E(S,\overline S)|}{|S|(n-|S|)}\le O(\delta _{\max }(G)\log n)\).

Proof ▶

The left inequality is capacity counting. By LP duality \(\delta _{\max }\) is the minimum of \(\sum _{(i,j)\in E}d_{ij}/\sum _{ij}d_{ij}\) over metrics; over \(\ell _1\) metrics the minimum is attained at a cut metric, giving the sparsest cut, and by Bourgain’s theorem restricting to \(\ell _1\) loses only \(O(\log n)\).

Definition 258 Negative-type metric

\((X,d)\) is of negative type if \(\sqrt d\) is an \(\ell _2\) metric; every \(\ell _1\) metric is of negative type.

Theorem 259 13.12, Negative-type into \(\ell _1\) (refuted conjecture)

Whether every negative-type metric embeds into \(\ell _1\) with bounded distortion (refuted by Khot–Vishnoi with an \((\log \log n)^c\) lower bound).