MLE and MAP

Machine Learning: Maximum Likelihood Estimation (MLE) and Maximum a Posteri (MAP) Estimation

Subtleties

ML doesn't work good with sparse data because P(X | Y) might be zero. (for example, Xi = birthdate, Xi = Jan_25_1992)

P(Y=1 | X1...Xn) = (P(Y=1) * Mult P(Xi | Y=1) for i) / P (X1...Xn)

We can solve it by using prior with MAP estimation.

MLE

Pros

invariant under reparameterization. So we can wrap $\theta_{MLE}$ in any function.

MAP

Pros

avoid overfitting (regularization / shrinkage)
tends to look like MLE asymptotically

Cons

point estimation (no representation of uncertainty in θ). Because it could choose spike of θ because it has higher probability
not invariant under reparameterization
must assume prior on θ

Examples

Univariate Gaussian mean

$\theta_{MAP} = \overline{x} * \frac{n}{n + \sigma^2} + \mu * \frac{\sigma^2}{n + \sigma^2}$

in other words it is sample mean plus prior mean.

$\theta_{MLE} = \overline{x}$

so when n->0 we get

$\theta_{MAP} \rightarrow \mu$

but when n->∞ we get

$\theta_{MAP} \rightarrow \theta_{MLE}$

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
machine-learning-10-601-hm2.ipynb		machine-learning-10-601-hm2.ipynb
mle-statsmodel.ipynb		mle-statsmodel.ipynb
mle.ipynb		mle.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLE and MAP

Subtleties

MLE

Pros

MAP

Pros

Cons

Examples

Univariate Gaussian mean

Related Topics

Videos

Jeff Miller (mathematicalmonk)

About

Releases

Packages

Languages

License

hyzhak/mle

Folders and files

Latest commit

History

Repository files navigation

MLE and MAP

Subtleties

MLE

Pros

MAP

Pros

Cons

Examples

Univariate Gaussian mean

Related Topics

Videos

Jeff Miller (mathematicalmonk)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages