Finite Sample Approximation for FinitePMF #
Infrastructure for approximating FinitePMFs by empirical distributions of finite samples.
Main results #
expectation_approx_of_tv: TV bound implies test approximationtv_bound_implies_all_tests: TV ≤ ε implies all tests ε-approximatedboolFamilyToFinsetFamily/Finset.boolVCDim: VC dimension for Boolean function families
Expectations #
Expected value ∑ h, μ.prob h * f h of a real-valued test under a finitely supported
distribution. The base expectation primitive of the approximation layer; specialised to
indicator tests in boolTestExpectation.
Equations
- trueExpectation μ f = ∑ h : H, μ.prob h * f h
Instances For
Expected value of a Bool-valued test under a finite distribution, via the
indicator embedding if f h then 1 else 0. The central quantity of the finite-VC
approximation layer: a TV bound on distributions translates to a uniform bound on test
expectations via expectation_approx_of_tv.
Equations
- boolTestExpectation μ f = trueExpectation μ fun (h : H) => if f h = true then 1 else 0
Instances For
A convex combination of values in {0, 1} is nonnegative.
A convex combination of values in {0, 1} is at most 1.
Total Variation Distance #
Total variation distance ∑ h, |μ.prob h - ν.prob h| between two finitely
supported distributions, in its L1 form. (The probabilists' normalised TV is half of
this; the L1 form is what composes directly with the bound in expectation_approx_of_tv
and so is carried through the rest of the layer.) The approximation metric for finite
VC approximation: two distributions close in tvDistance have nearly identical
expectations on every Bool-valued test, which is what the compression argument needs
to substitute an empirical distribution for the true one.
Instances For
Total variation distance is nonnegative.
Total variation distance is symmetric.
Total variation distance from a distribution to itself is zero.
Key Approximation Lemma #
Transfer principle. If tvDistance μ ν ≤ δ, then for every Bool-valued test
f, |E_μ[f] - E_ν[f]| ≤ δ. This is the inequality that lets the approximate-minimax
route to compression replace a target distribution by an empirical distribution: any
high-probability event under μ is approximately high-probability under ν, with
slack controlled by the TV distance.
Uniform variant of expectation_approx_of_tv: a single tvDistance bound suffices
for every test in any finite family simultaneously, with no union bound. The form
actually used by the compression characterisation, where the family is the restricted
concept class.
Expectation of empiricalPMF equals average #
Bridges the FinitePMF view and the sample-average view: the expectation of a
Bool-valued test under the empirical PMF of a sample equals the sample average
(1/T) ∑_t f (s_t). This lets the MWU updates and the approximation transfer
principle live in the same distributional framework.
Approximate Minimax Connection #
Identifies the game-theoretic payoff (a row distribution against a fixed column in the Bool game) with the corresponding test expectation. The translation that lets the MWU regret bound be applied directly to the compression problem.
VC Dimension for Boolean Function Families #
Maps a finite family of Bool-valued functions to its image as a family of
accepting sets. The set-system view is what Mathlib's Finset.Shatters and
Finset.vcDim consume, so this is the entry point from the function-class view to the
combinatorial VC machinery.
Equations
- boolFamilyToFinsetFamily A = Finset.image (fun (f : H → Bool) => {h : H | f h = true}) A
Instances For
VC dimension of a finite Bool-valued family, computed via the set-system image
boolFamilyToFinsetFamily and Mathlib's Finset.vcDim. Declared noncomputable
because the underlying vcDim is.
Equations
Instances For
Note #
The VC uniform approximation theorem is in FLT_Proofs.Complexity.FiniteSupportUC
as finite_support_vc_approx, proved via the H ⊕ ℕ infinite-envelope route.