Refs

In much of the following, I’m interchangeably using Θ and $z$.

Why latent variables

We want to model $p(X;\theta)$, but we also explicitly model latent variables/features $Z$.

Example: GMMs—for clustering, unsupervised learning

image.png

Another motivation:

image.png

Can also use more concrete examples if they’re easier to think about:

image.png

Overview

tldr: