Formal Models of Learnability

Artificial Intelligence
Research Group

Research in Formal Models of Learnability

Framework

In a typical "inductive learning task", the learner uses a given set of "labeled instances" (eg, each might be a description of a particular patient, labeled with the correct diagnosis) to learn a classifier that will accurately label new (unlabled) instances drawn from the same distribution. (Here, this means other patients in this population.)

Much of the formal work is in the context of "PAC Learning": The learner is given some class of possible classifiers (eg, perceptrons, or decision-trees, or ...) as well as error and confidence constants. It then draws a sufficient number of labeled instances (from an oracle that produces such instances according to the underlying distribution, and labels each with correct label, based on the unknown target classifier), which it uses to identify a classifiers from the given class. Any classifier will have some error -- which is the probability that it will misclassify an instance, over instances drawn from the underlying distribution. (Note the target concept presumably has 0 error.) For certain classes, we can guarantee that the classifier returned by the learner will, with high probability, have small error.

Hill-climbing while Sampling

on average

The Palo algorithm approximates this hill-climbing search when the ``utility function'' (used to evaluate each agent's performance) can only be estimated by sampling. It can efficiently return an agent that is, with high probability, essentially a local optimum.

Making efficient use of training samples

Sequential Learning Algorithms

batch

any

sequential learners

Learning from partially specified instances

incomplete

If the desired classification algorithm must classify partially-specified instances, which learning algorithm should be used?
Should this learning algorithm use partially-specified instances (exactly like the ones its classifier will have to classify), or instances that have been ``filled in'' (ie, which have no missing values)?

Learning active classifiers

active

learning

Effectively Exploiting Domain and Context Information

Computing
Science

Artificial Intelligence