pixel-rnn Multi-dimensional LSTMs for image generative models. TODO: 4. Further optimize Mixture of Gaussian 5. Inference Code BUGS: Why there can be negative loss? Probabilities that larger than 1?