3.2 The VC Characterization

The main result of this section is:

Theorem 3.7 VC Characterization of PAC Learnability [

Lean: fundamental_theorem

Let \(\mathcal{H}\) be a hypothesis class over domain \(X\). The following are equivalent:

Moreover, if \(d = \mathrm{VCdim}(\mathcal{H}) {\lt} \infty \), then \(\mathcal{H}\) is PAC learnable with sample complexity

\[ m_\mathcal {H}(\varepsilon , \delta ) = O\! \left(\frac{d + \log (1/\delta )}{\varepsilon }\right). \]

The proof proceeds in four stages. Each stage is a separate lemma, and each lemma is a foothold on the ascent.

Stage 1: Finite VC dimension \(\Longrightarrow \) polynomial growth function (Sauer–Shelah).

Stage 2: Polynomial growth function \(\Longrightarrow \) uniform convergence (\(\varepsilon \)-net/symmetrization argument).

Stage 3: Uniform convergence \(\Longrightarrow \) ERM is a PAC learner.

Stage 4 (Converse): Infinite VC dimension \(\Longrightarrow \) not PAC learnable.