[WIP] Fix HSGP predictions #780

tomicapretto · 2024-02-18T19:36:43Z

Fixes predictions when HSGP contains a by variable.

It makes sure the underlying design matrix is not modified until the data of all HSGP components are used.
Modifies get_model_covariates so it also looks at the named arguments of function calls.
- For this, this PR in formulae was also needed Interpret True, False, and None as python literals formulae#107

~~TODO: implement tests?~~

Edit : it closes #776

GStechschulte · 2024-02-19T11:36:48Z

Thanks a lot @tomicapretto 👍🏼

* Update code of conduct * update changelog

…nto fix_hsgp_prediction

tomicapretto · 2024-02-23T16:36:42Z

@GStechschulte could you try this?

import bambi as bmb
import numpy as np
import pandas as pd

df = pd.read_csv("tests/data/gam_data.csv")

rng = np.random.default_rng(1234)
df["fac2"] = rng.choice(["a", "b", "c"], size=df.shape[0])

formula = "y ~ 1 + x0 + hsgp(x1, by=fac, m=10, c=2) + hsgp(x1, by=fac2, m=10, c=2)"
model = bmb.Model(formula, df, categorical=["fac"])
idata = model.fit(tune=500, draws=500, target_accept=0.9)

Plot 1

bmb.interpret.plot_predictions(
    model, 
    idata, 
    conditional="x1", 
    subplot_kwargs={"main": "x1", "group": "fac2", "panel": "fac2"},
);

Plot 2

bmb.interpret.plot_predictions(
    model, 
    idata, 
    conditional={
        "x1": np.linspace(0, 1, num=100),
        "fac2": ["a", "b", "c"]
    }, 
    legend=False,
    subplot_kwargs={"main": "x1", "group": "fac2", "panel": "fac2"},
);

I was expecting to get the second plot with the code for the first plot. I think we got the result we got because we first generate the data, and only then, we use the subplot_kwargs? At that point, it's just too late, you only have one value of fac2

codecov-commenter · 2024-02-23T17:10:08Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 90.16%. Comparing base (b5b9f09) to head (bdb48d8).
Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #780      +/-   ##
==========================================
+ Coverage   89.86%   90.16%   +0.29%     
==========================================
  Files          46       46              
  Lines        3810     3814       +4     
==========================================
+ Hits         3424     3439      +15     
+ Misses        386      375      -11

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

GStechschulte · 2024-02-23T17:32:55Z

@tomicapretto thanks! Plot 1 is displaying correctly. It is because you are not explicitly passing fac2 to conditional. Which results in, as you stated, a single default value computed for fac2. The single value cannot have any subplots.

This is the behavior both interpret and marginaleffects uses if a covariate was specified in the model, but not passed to conditional.

bmb.interpret.plot_predictions(
    model, 
    idata, 
    conditional=["x1", "fac2"], 
    subplot_kwargs={"main": "x1", "group": "fac2", "panel": "fac2"},
    legend=False
);

tomicapretto · 2024-02-23T18:58:31Z

Thanks @GStechschulte! I think this is done.

I know the test is actually testing many things at the same time, not just the fix. But I think it's not possible to write a test for the fix in particular, and if possible, it would be so complicated.

GStechschulte

LGTM! However, my knowledge on the implementation of HSGP in Bambi is a bit lacking.

GStechschulte · 2024-02-29T15:07:53Z

Thanks @GStechschulte! I think this is done.

I know the test is actually testing many things at the same time, not just the fix. But I think it's not possible to write a test for the fix in particular, and if possible, it would be so complicated.

Yup, I agree. It is also nice to have the text for interpret in there.

* Delete all HSGP slices at the same time * Make interpret consider kwargs in function calls * Update code of conduct (bambinos#783) * Update code of conduct * update changelog * Update formulae to >=0.5.3 * start a test for the hsgp and 'by' * update changelog

* use bayeux to access a wide range of samplers * use bayeux to access a wide range of samplers * add notebook links to family table (#774) * access methods programatically * clean bayeux idata to be consistent with pymc model coords * rename alternative sampler args in tests * change docstring to reflect bayeux sampler names * bayeux dependencies are numpyro/jax/jaxlib/blackjax * rename idata coords and dims to PyMC model * add JAX based sampler dependencies * Update code of conduct (#783) * Update code of conduct * update changelog * [WIP] Fix HSGP predictions (#780) * Delete all HSGP slices at the same time * Make interpret consider kwargs in function calls * Update code of conduct (#783) * Update code of conduct * update changelog * Update formulae to >=0.5.3 * start a test for the hsgp and 'by' * update changelog * bayeux 0.1.9 updates * bump bayeux version * remove TFP methods, optimizers, and resolve pylint errors * alternative backends docs * tests for JAX based samplers except TFP * add TFP backend example * add TFP MCMC methods * don't use flowmc, chees, meads for categorical model * call model.backend.inference_methods to show list of samplers * docstring changes * inference_methods attribute and change JAX random seed * Add FutureWarning to inference_method parameter * black formatting and resolve pylint errors * fix package name * drop 3.9 and add 3.12 to testing matrix * change Python versions in requires-python and target-version * remove python 3.11 black target-version * pin requires-python to <3.13 * pip upgrade setuptools * Bump PyMC to 5.12 * Upgrade black and pylint * remove upgrading of setup tools --------- Co-authored-by: Tomás Capretto <tomicapretto@gmail.com>

tomicapretto added 2 commits February 6, 2024 20:28

Delete all HSGP slices at the same time

b1566b3

Make interpret consider kwargs in function calls

842cac2

tomicapretto added 4 commits February 21, 2024 15:16

Update code of conduct (bambinos#783)

d209f4c

* Update code of conduct * update changelog

Merge branch 'fix_hsgp_prediction' of github.com:tomicapretto/bambi i…

6f868fe

…nto fix_hsgp_prediction

Update formulae to >=0.5.3

3aecb7f

start a test for the hsgp and 'by'

fa95fb9

update changelog

bdb48d8

tomicapretto marked this pull request as ready for review February 23, 2024 18:55

tomicapretto requested a review from GStechschulte February 23, 2024 18:56

GStechschulte approved these changes Feb 29, 2024

View reviewed changes

GStechschulte merged commit ff685b7 into bambinos:main Feb 29, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Fix HSGP predictions #780

[WIP] Fix HSGP predictions #780

tomicapretto commented Feb 18, 2024 •

edited

Loading

GStechschulte commented Feb 19, 2024

tomicapretto commented Feb 23, 2024

codecov-commenter commented Feb 23, 2024 •

edited

Loading

GStechschulte commented Feb 23, 2024 •

edited

Loading

tomicapretto commented Feb 23, 2024

GStechschulte left a comment

GStechschulte commented Feb 29, 2024

[WIP] Fix HSGP predictions #780

[WIP] Fix HSGP predictions #780

Conversation

tomicapretto commented Feb 18, 2024 • edited Loading

GStechschulte commented Feb 19, 2024

tomicapretto commented Feb 23, 2024

codecov-commenter commented Feb 23, 2024 • edited Loading

Codecov Report

GStechschulte commented Feb 23, 2024 • edited Loading

tomicapretto commented Feb 23, 2024

GStechschulte left a comment

Choose a reason for hiding this comment

GStechschulte commented Feb 29, 2024

tomicapretto commented Feb 18, 2024 •

edited

Loading

codecov-commenter commented Feb 23, 2024 •

edited

Loading

GStechschulte commented Feb 23, 2024 •

edited

Loading