Generalisation Error
We can define generalisation error as the discrepancy between
and . The Hoeffding Inequality characterises the generalization
error with a probabilistic bound:
Pick a tolerance level , and assert with probability
that
Notice the error bound depends on , the size of the hypothesis set . Most
learning models have infinite , including the simple perceptron. Hence, to
study generalisation in such models, we need to derive a counterpart that deals
with infinite .
Notice that the factor was obtained by taking the disjunction of events. Let
be the bad event that . Notice
that these bad events are often strongly overlapping, and the disjunction of
these events form a much smaller area.
The mathematical theory of generalisation hinges on this observation.
Upon accounting for the overlaps of different hypotheses, we will be
able to replace the number of hypotheses with an effective finite
number, even while is infinite.