Why graphical models?
Tags | CS 228 |
---|
Graphical models do what?
- represent: they allow you to represent a system
- learn: they allow you to take information (training data) and encode it into the network
- inference: they allow you to take what you know and “query” the model to get an important answer
The joint distribution
We might start with a joint distribution . How many parameters do you need, if each variable can take on values?
Well, to encode each variable, you need numbers, which means that you need entries in the table. Now, you technically only need pices of information because the last point you can infer. This is pretty bad, as things become intractable really quick. It also means that you will have a hard time fitting any sort of model to it.
One (failed) approach
Why don’t we just represent it as the chain rule? This is known as factorization
.
We look at this, and we see that we need , which is still exponential in (in fact, it evaulates to be the same thing). Therefore, there is no free lunch.
Markov Assumption
Philosophically, the markov assumption states that the future can be entirely determined from the present states; the past is irrelevant.
Therefore, .
This requires parameters! It is no longer exponential!
The key assumption is that, conditioned on the present, the past does not matter. There is still no free lunch; this is a somewhat strong (but reasonable) assumption to make of the real world.