MWU update step and one-step potential bound #
Given a column c that the row player hits (M r c = true), the MWU
update multiplies weights c by 1 - η and leaves all other weights
unchanged. The one-step potential bound Φ' ≤ Φ · (1 - η · v) follows
from the row player's best-response property composed with the hypothesis
that every PMF-valued column mixture admits a pure response with payoff
at least v.
One step of the multiplicative-weights update: columns c with M r c = true have
their weights scaled by 1 - η; all other weights are unchanged.
Equations
Instances For
Best-response payoff in terms of weights: a pure response against the normalized PMF
gives expected payoff at least v · Φ when weighed by the un-normalized weight vector.
One-step MWU potential bound: after one update, the potential is multiplied by
(1 - η · v) or less.