Jethro's Braindump

Concept Learning

A concept is a boolean-valued function over a set of input instances (each comprising input attributes). Concept learning is a form of supervised learning. Infer an unknown boolean-valued function from training-examples.

Hypothesis

There is a trade-off between expressive power and smaller hypothesis space. Large hypothesis spaces are bad, because search is going to take a long time, and also requires more data. Humans exploit structure in the hypothesis space to guide search and learn faster.

A hypothesis h is consistent with a set of training examples D iff h(x)=c(x) for all <x,c(x)>∈D.

Inductive Learning

Any hypothesis found to approximate the target function well over a sufficient large set of training examples will also approximate the target function well over other unobserved examples.

The goal is to search for a hypothesis hH that is consistent with D.

Exploit Structure in Concept Learning

hj is more general than or equal to hk (denoted hjgphk) iff any input instance x that satisfies hj also satisfies hk.

This is relation is a partial order.

Find-S Algorithm

Intuition: Start with the most specific hypothesis h. Whenever it wrongly classifies a positive training example, we “minimally” generalize it to satisfy its input instance.

Limitations

  1. Can’t tell whether Find-S has learnt the target concept
  2. Can’t tell when training examples are inconsistent
  3. Picks a maximally specific h
  4. Depending on H, there may be several solutions

Version Space

VSH,D=hH|h is consistent with D

  • If cH, then D can reduce VSH,D to c.
  • If D is insufficient, then VSH,D represents the uncertainty of what the target concept is
  • VSH,D contains all consistent hypotheses, including maximally specific hypotheses

The general boundary G of VSH,D is the set of maximally general members of H consistent with D.

The specific boundary S of VSH,D is the set of maximally general members of H consistent with D.

VSH,D=hH|sSgGgghgs

List-Then-Eliminate Algorithm

Iterate through all hypotheses in H, and eliminate any hypothesis found inconsistent with any training example. This algorithm is often prohibitively expensive.

Candidate-Elimination Algorithm

Start with most general and specific hypotheses. Each training example “minimally” generalizes S and specializes G to remove inconsistent hypotheses from version space.