Skip to content

konstantinos-p/cold_posteriors_pac_bayes

Repository files navigation

Cold Posteriors through PAC-Bayes.

We investigate the cold posterior effect through the lens of PAC-Bayes generalization bounds. We argue that in the non-asymptotic setting, when the number of training samples is (relatively) small, discussions of the cold posterior effect should take into account that approximate Bayesian inference does not readily provide guarantees of performance on out-of-sample data. Instead, out-of-sample error is better described through a generalization bound. In this context, we explore the connections of the ELBO objective from variational inference and the PAC-Bayes objectives. We note that, while the ELBO and PAC-Bayes objectives are similar, the latter objectives naturally contain a temperature parameter $\lambda$ which is not restricted to be $\lambda=1$. For both regression and classification tasks, in the case of isotropic Laplace approximations to the posterior, we show how this PAC-Bayesian interpretation of the temperature parameter captures the cold posterior effect.

Fig.1 - $\mathcal{B}_{\mathrm{original}}$ as a function of the $\lambda$ parameter for different datasets. We plot the Empirical Risk, Moment and KL terms, as well as the $\mathcal{B}_{\mathrm{original}}$ bound values for different $\lambda$. We see that for increasing values of $\lambda$, the empirical risk decreases, while the Moment term increases. Interestingly the KL term (which is the KL divergence + a constant and divided by $n\lambda$) also decreases as we increase $\lambda$. As the KL term dominates the bound, the overall effect is that bound decreases as $\lambda$ increases and is tightest for the largest value of $\lambda$.

📝 Citation

When citing this repository on your scientific publications please use the following BibTeX citation:

@article{pitas2022cold,
  title={Cold Posteriors through PAC-Bayes},
  author={Pitas, Konstantinos and Arbel, Julyan},
  journal={arXiv preprint arXiv:2206.11173},
  year={2022}
}

✉️ Contact Information

You can contact me at any of my social network profiles:

Or via email at pitas.konstantinos@inria.fr

⚠️ Disclaimer

This Python package has been made for research purposes.

About

The code for the paper "Cold Posteriors through PAC-Bayes". https://arxiv.org/pdf/2206.11173.pdf

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages