Rademacher Complexity #

theorem finite_massart_lemma {m : ℕ} (_hm : 0 < m) {N : ℕ} (hN : 0 < N) (Z : Fin N → SignVector m → ℝ) (σ_param : ℝ) (hσ : 0 < σ_param) (h_mgf : ∀ (j : Fin N) (t : ℝ), 0 ≤ t → 1 / ↑(Fintype.card (SignVector m)) * ∑ sv : SignVector m, Real.exp (t * Z j sv) ≤ Real.exp (t ^ 2 * σ_param ^ 2 / 2)) :

(1 / ↑(Fintype.card (SignVector m)) * ∑ sv : SignVector m, Finset.univ.sup' ⋯ fun (j : Fin N) => Z j sv) ≤ σ_param * √(2 * Real.log ↑N)

Massart finite lemma: E_σ[max_{j ≤ N} Z_j] ≤ σ√(2 log N).

Helper lemmas for Sauer-Shelah exponential bound #

source

theorem sum_choose_le_exp_pow (d m : ℕ) (hd : 0 < d) (hdm : d ≤ m) :

∑ i ∈ Finset.range (d + 1), ↑(m.choose i) ≤ (Real.exp 1 * ↑m / ↑d) ^ d

Pure combinatorial inequality: ∑_{i=0}^d C(m,i) ≤ (em/d)^d for d ≤ m, d ≥ 1.

source

theorem sauer_shelah_exp_bound {X : Type u} (C : ConceptClass X Bool) (d m : ℕ) (hd : 0 < d) (hdm : d ≤ m) (hvc : VCDim X C = ↑d) :

↑(GrowthFunction X C m) ≤ (Real.exp 1 * ↑m / ↑d) ^ d

Sauer-Shelah exponential bound: GrowthFunction(C,m) ≤ (em/d)^d.

VCDim → Rademacher bound #

source

theorem vcdim_bounds_rademacher_quantitative (X : Type u) [MeasurableSpace X] (C : ConceptClass X Bool) (D : MeasureTheory.Measure X) (m : ℕ) (hm : 0 < m) (d : ℕ) (hd : VCDim X C = ↑d) (hd_pos : 0 < d) (hdm : d ≤ m) [MeasureTheory.IsProbabilityMeasure (MeasureTheory.Measure.pi fun (x : Fin m) => D)] :

RademacherComplexity X C D m ≤ √(2 * ↑d * Real.log (Real.exp 1 * ↑m / ↑d) / ↑m)

VC dimension upper bounds Rademacher complexity: Rad ≤ √(2d·log(em/d)/m).

The proof decomposes into: (1) Pointwise: EmpRad(xs) ≤ B for all xs [Massart + Sauer-Shelah] (2) Integral: Rad = ∫ EmpRad ≤ ∫ B = B [probability measure]

Step (2) is proved. Step (1) for B ≥ 1 follows from EmpRad ≤ 1. Step (1) for B < 1 requires Massart finite lemma + Sauer-Shelah growth bound.

Rademacher ↔ PAC #

source

theorem rademacher_lower_bound_on_shattered (X : Type u) [MeasurableSpace X] [MeasurableSingletonClass X] (C : ConceptClass X Bool) (T : Finset X) (hT : Shatters X C T) (m : ℕ) (hm : 0 < m) (hT_large : 4 * m ^ 2 + 1 ≤ T.card) :

∃ (D : MeasureTheory.Measure X), MeasureTheory.IsProbabilityMeasure D ∧ 1 / 2 ≤ RademacherComplexity X C D m

Adversarial Rademacher lower bound on shattered sets. For |T| >= 4m^2 + 1, exists D with Rad_m(C,D) >= 1/2.

Proof: D = uniform on T. Product measure = uniform on T^m. On injective samples from T (shattered): EmpRad = 1 (by empRad_eq_one_of_injective_in_shattered). EmpRad ≥ 0 everywhere (by empRad_nonneg). Birthday bound: P[injective m draws from n ≥ 4m²+1 points] ≥ 1 - m(m-1)/(2n) ≥ 7/8 ≥ 1/2. So ∫ EmpRad ≥ P[injective] · 1 + P[¬injective] · 0 ≥ 1/2.

source

theorem vcdim_finite_imp_rademacher_vanishing (X : Type u) [MeasurableSpace X] (C : ConceptClass X Bool) (hvcdim : VCDim X C < ⊤) (ε : ℝ) :

ε > 0 → ∃ (m₀ : ℕ), ∀ (D : MeasureTheory.Measure X), MeasureTheory.IsProbabilityMeasure D → ∀ m ≥ m₀, RademacherComplexity X C D m < ε

VCDim finite → Rademacher vanishes uniformly. The bound m₀ depends only on d and ε, NOT on D.

Documentation

FLT_Proofs.Complexity.Rademacher

Rademacher Complexity #

Main results #

Helpers for VCDim → Rademacher bound (Massart + Sauer-Shelah) #

Helper lemmas for Sauer-Shelah exponential bound #

VCDim → Rademacher bound #

Rademacher ↔ PAC #