Surprising predict() behaviour #533

markgoodhead · 2022-06-11T21:34:27Z

I'm not sure if this is an issue as such but more a comment / point of discussion. As a fairly new user to bambi, I found the behaviour of predict() to be very unintuitive because the fit() / predict() interface in the python data science ecosystem has very strong connotions with the standard sklearn interface that's very common.

I've an appreciation of how the bayesian approach is different to the usual ML approach in that generally you're interested in not just making point estimates but getting the full posterior predictive distribution for predictions, however I think it's very beginner friendly to have the option be able to treat the model like the frequentist/ML sklearn-style models for which predict() would produce point estimates. Even if you don't care too much about the full posterior and uncertainty estimation, there's still reasons to prefer the bayesian approach if you're only after point estimates.

One proposal would be to add a "point-estimate-prediction" option alongside "mean" and "pps" which instead returns a numpy array of the per data point predictions. Whilst this would just be equivalent to doing pps then taking the mean, I think this would help the beginner tremendously as otherwise they have to understand the InferenceData/xarray normal structure to be able to do so, which has its own learning curve.

aloctavodia · 2022-06-11T22:46:51Z

Working with distributions and samples is the bread and butter of computational bayesians. What about adding or extending and example about this. The more examples we have about how to use inferencedata the least steep that learning curve will be.

tomicapretto · 2022-06-12T12:28:01Z

I understand your point, and I agree it's harder to think about samples. Even more, we have chains and draws, and new data structures as well which make things more complicated I think.

On the other hand, I don't think we should change anything in the .predict() method itself. It should always return a non-post-processed Bayesian prediction.

I think the way to make Bambi more appealing to newcomers (both beginners and people experienced with frequentist frameworks) is to provide utility functions that do much if not all of the heavy lifting (for example #517).

In this particular case, I think there could be another utility function that post-process the InferenceData to return point estimates. This way, users will be aware that Bambi returns a whole posterior (as samples) but they are explicitly converting them to point estimates (no matter they don't implement that conversion by hand).

Some open questions

How this function should behave?
In which module do we put this function?
Do we create a new one? Do we want to have a utility module?

markgoodhead · 2022-06-13T09:34:51Z

Yes I think a utility method is a good solution and would address this quite simply, I think it'd be quite small as my analogous code I use in my pymc models is just:

y_test = pm.sample_posterior_predictive(results)
return y_test.posterior_predictive.y.mean(axis=(0,1)).values

If it were a brand new library, I'd argue for keeping predict() closer to the sklearn interface because that's what's 'surprising' to new users; I'd bet a high percentage of people without a Bayesian/PyMC background but with a Python data science/ML background see fit/predict and immediately have a misconception about how the predict() function will work. However I appreciate that breaking backwards compatibility to address this is probably too annoying to existing bambi users who expect the current behaviour, so a small utility function to add an sklearn-API-like predict call is a good compromise I think.

In terms of where it should exist, ideally it'd be another method on the bambi Model object named something like predict_sklearn() or predict_point_estimates() (better naming suggestions welcomed!) such that when another new user does what I did (calls predict() then goes "huh, where's my numpy array of predictions?!") and then they go to the API reference documentation they'll see this function directly below/above predict() and can go "Ah, that's the one I should use!".

I'll produce a small example branch with this feature on (as I think it should be fairly straightforward) to better demonstrate what I'm thinking of.

markgoodhead · 2022-06-13T13:08:28Z

#535

Here's an example of what I was thinking of. I appreciate it's incredibly simple (literally a one liner once you've called 'predict') but this one-liner took me a few hours to work out the first time I used bambi 😅

aloctavodia · 2022-06-13T15:37:30Z

Notice that with InferenceData/xarray you can use labels.

y_test.posterior_predictive["y"].mean(("chain", "draw"))

This generally requires writing more characters, but the result is easier to read. For example, here is clear that the intention is to average over both chains and draws.

Another comment. We have discussed in ArviZ to hide the information about the chains to the user. The main reason is to simplify working with InferenceData, because as a general rule uses do not care about directly accessing to individuals chains. The chain information is useful mostly to diagnose samples, so ArviZ internally can get access to chains.

tomicapretto · 2023-01-05T18:55:22Z

I'm going to close the issue because we're not changing the way .predict() behaves.

gshotwell · 2024-08-27T13:55:42Z

I'm pretty new to bayesian modelling, but I did find this behaviour to be quite confusing in Bambi. Maybe a good solution would be to have a vignette which went through how to practically work with the XArray objects?

tomicapretto · 2024-09-01T14:40:44Z

@gshotwell that's definitely a good idea, thanks!

aloctavodia added the Discussion Issue open for discussion, still not ready for a PR on it. label Jun 11, 2022

tomicapretto closed this as not planned Won't fix, can't repro, duplicate, stale Jan 5, 2023

tomicapretto mentioned this issue Sep 1, 2024

Add notebook showing how to work with posterior draws #838

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Surprising predict() behaviour #533

Surprising predict() behaviour #533

markgoodhead commented Jun 11, 2022

aloctavodia commented Jun 11, 2022

tomicapretto commented Jun 12, 2022 •

edited

Loading

markgoodhead commented Jun 13, 2022

markgoodhead commented Jun 13, 2022

aloctavodia commented Jun 13, 2022 •

edited

Loading

tomicapretto commented Jan 5, 2023

gshotwell commented Aug 27, 2024

tomicapretto commented Sep 1, 2024

Surprising predict() behaviour #533

Surprising predict() behaviour #533

Comments

markgoodhead commented Jun 11, 2022

aloctavodia commented Jun 11, 2022

tomicapretto commented Jun 12, 2022 • edited Loading

markgoodhead commented Jun 13, 2022

markgoodhead commented Jun 13, 2022

aloctavodia commented Jun 13, 2022 • edited Loading

tomicapretto commented Jan 5, 2023

gshotwell commented Aug 27, 2024

tomicapretto commented Sep 1, 2024

tomicapretto commented Jun 12, 2022 •

edited

Loading

aloctavodia commented Jun 13, 2022 •

edited

Loading