laplace method #800

jgabry · 2023-07-30T01:09:36Z

Submission Checklist

Run unit tests
Declare copyright holder and agree to license (see below)

Summary

Closes #760
Builds off of PR #799

Implements new method CmdStanModel$laplace() with new fitted model class CmdStanLaplace.
Follows the design from @WardBrian in #760 (comment).

Copyright and Licensing

Please list the copyright holder for the work you are submitting
(this will be you or your assignee, such as a university or company):
Columbia University

By submitting this pull request, the copyright holder is agreeing to
license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

closes #760

jgabry · 2023-07-30T01:10:47Z

Example usage

file <- file.path(cmdstan_path(), "examples/bernoulli/bernoulli.stan")
mod <- cmdstan_model(file)
mod$print()

stan_data <- list(N = 10, y = c(0,1,0,0,0,0,0,0,0,1))

# pass CmdStanMLE from optimize to 'mode' argument of laplace
fit_mode <- mod$optimize(data = stan_data, jacobian = TRUE)
fit_laplace <- mod$laplace(data = stan_data, mode = fit_mode)
fit_laplace$summary()
fit_laplace$draws("theta")

# can also pass CSV file to 'mode' argument
fit_laplace <- mod$laplace(data = stan_data, mode = fit_mode$output_files())
fit_laplace$summary()

# if mode isn't specified optimize is run internally first
# can pass arguments to optimize via opt_args
fit_laplace <- mod$laplace(data = stan_data, opt_args = list(iter = 200))
fit_laplace$summary()

codecov-commenter · 2023-07-30T02:32:03Z

Codecov Report

Merging #800 (8a9ae96) into master (17678d5) will increase coverage by 0.30%.
The diff coverage is 98.54%.

❗ Current head 8a9ae96 differs from pull request most recent head 5c2dbbd. Consider uploading reports for the commit 5c2dbbd to get more accurate results

@@            Coverage Diff             @@
##           master     #800      +/-   ##
==========================================
+ Coverage   88.19%   88.49%   +0.30%     
==========================================
  Files          12       12              
  Lines        4218     4347     +129     
==========================================
+ Hits         3720     3847     +127     
- Misses        498      500       +2

Files Changed	Coverage Δ
R/example.R	`97.56% <ø> (ø)`
R/fit.R	`96.06% <88.88%> (-0.16%)`	⬇️
R/model.R	`92.31% <98.41%> (+0.49%)`	⬆️
R/args.R	`97.87% <100.00%> (+0.11%)`	⬆️
R/csv.R	`96.62% <100.00%> (+0.19%)`	⬆️
R/run.R	`93.90% <100.00%> (+0.01%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

jgabry · 2023-07-31T17:01:15Z

I think this is ready for review. I've implemented everything that I can think of. Anything I need to do when adding a new method that I'm forgetting?

jgabry · 2023-07-31T23:58:26Z

Any ideas why everything is passing except with the WSL backend?

will continue to accept output_samples to not break backwards compatibility

avehtari · 2023-08-03T16:19:51Z

I'm still in progress of testing, but one observation now. The laplace method has argument draws but the variational method has output_samples and furthermore the posterior package resample_draws has ndraws. Could we have some common argument name?

jgabry · 2023-08-03T16:35:04Z

I agree. Those are the names CmdStan uses, but I actually added the draws argument to variational in a recent commit to match Laplace. output_samples will still work but we can deprecate it eventually.

avehtari · 2023-08-04T08:46:52Z

I guess this should be done in CmdStan, but mentioning now here. It would be nice to be able to be able to parallelize the computation for the draws from the approximation similarly way as $sampling() can do. Often the optimization is fast, but also often obtaining 4000 draws and computing lp__ and lp_approx__ takes time that is similar to getting 1000 MCMC draws.

avehtari · 2023-08-04T10:25:46Z

Here's an example to do resample importance sampling

file <- file.path(cmdstan_path(), "examples/bernoulli/bernoulli.stan")
mod <- cmdstan_model(file)
mod$print()

stan_data <- list(N = 10, y = c(0,1,0,0,0,0,0,0,0,0))
fit_mode <- mod$optimize(data = stan_data, jacobian = TRUE)
# 1000 draws from the approximation
fit_laplace <- mod$laplace(data = stan_data, mode = fit_mode, refresh=0)
draws_laplace <- fit_laplace$draws(variables="theta")
# Compute Pareto smoothed importance sampling weights
(psis_laplace <- psis(log_ratios=fit_laplace$draws("lp__")-fit_laplace$draws("lp_approx__"), r_eff=1))
# Resample 1000 draws using simple resampling without resampling
draws_psis <- resample_draws(draws_laplace,
                             weights = exp(psis_laplace$log_weights),
                             ndraws = 1000,
                             method = 'simple')

summarize_draws(draws_laplace, default_summary_measures())
summarize_draws(draws_psis, default_summary_measures())

library(patchwork)
p1 <- mcmc_areas(draws_laplace) + xlim(c(0,0.7))
p2 <- mcmc_areas(draws_psis) + xlim(c(0,0.7))
p1 / p2

avehtari · 2023-08-04T12:49:29Z

Is it possible to get draws without evaluating lp__ and lp_approx__? In the cases when I would trust the normal approximation, there is unnecessary but computationally potentially very costly evaluation of lp__.

jgabry · 2023-08-04T16:37:17Z

I guess this should be done in CmdStan, but mentioning now here. It would be nice to be able to be able to parallelize the computation for the draws from the approximation similarly way as $sampling() can do. Often the optimization is fast, but also often obtaining 4000 draws and computing lp__ and lp_approx__ takes time that is similar to getting 1000 MCMC draws.

I agree. But yeah I think this is more of a CmdStan issue than a CmdStanR issue.

jgabry · 2023-08-04T16:38:21Z

Is it possible to get draws without evaluating lp__ and lp_approx__? In the cases when I would trust the normal approximation, there is unnecessary but computationally potentially very costly evaluation of lp__.

Not by changing anything in CmdStanR unfortunately. All we're doing is providing access to what CmdStan has already computed and written to CSV, so this would need to happen in CmdStan.

jgabry · 2023-08-04T16:40:02Z

Here's an example to do resample importance sampling

file <- file.path(cmdstan_path(), "examples/bernoulli/bernoulli.stan")
mod <- cmdstan_model(file)
mod$print()

stan_data <- list(N = 10, y = c(0,1,0,0,0,0,0,0,0,0))
fit_mode <- mod$optimize(data = stan_data, jacobian = TRUE)
# 1000 draws from the approximation
fit_laplace <- mod$laplace(data = stan_data, mode = fit_mode, refresh=0)
draws_laplace <- fit_laplace$draws(variables="theta")
# Compute Pareto smoothed importance sampling weights
(psis_laplace <- psis(log_ratios=fit_laplace$draws("lp__")-fit_laplace$draws("lp_approx__"), r_eff=1))
# Resample 1000 draws using simple resampling without resampling
draws_psis <- resample_draws(draws_laplace,
                             weights = exp(psis_laplace$log_weights),
                             ndraws = 1000,
                             method = 'simple')

summarize_draws(draws_laplace, default_summary_measures())
summarize_draws(draws_psis, default_summary_measures())

library(patchwork)
p1 <- mcmc_areas(draws_laplace) + xlim(c(0,0.7))
p2 <- mcmc_areas(draws_psis) + xlim(c(0,0.7))
p1 / p2

Cool, thanks! Maybe we should put this in the documentation somewhere.

avehtari

Looks good. Added one suggestion

R/fit.R

jgabry · 2023-09-19T16:05:58Z

@andrjohns When you have time would you be able to check if you can figure out why this is failing on WSL only? Thank you!

andrjohns · 2023-09-20T06:39:58Z

Ack sorry for forgetting about this! I'll have time to dig in on Friday

jgabry · 2023-09-20T14:20:45Z

No worries, and thanks!

andrjohns · 2023-09-25T04:54:04Z

Just a heads up that I'm still working on this, I'm on an M1 Mac as my daily driver now so I'm just ironing out some kinks on getting WSL running in my windows VM

jgabry · 2023-09-25T15:39:29Z

Ok thanks for working on this @andrjohns

andrjohns · 2023-09-25T15:39:56Z

@jgabry ended up being a super minor issue of path handling, all good to go now!

jgabry · 2023-09-25T18:39:25Z

Awesome, thanks @andrjohns! Glad it turned out to be a simple fix. Everything is passing so I'll go ahead and merge.

initial attempt at laplace method

9c2643c

closes #760

jgabry added 2 commits July 29, 2023 19:19

Update model.R

0b2a677

Update model-method-laplace.Rd

9ea8fd2

jgabry mentioned this pull request Jul 30, 2023

enable jacobian argument for optimization #799

Merged

2 tasks

jgabry added 9 commits July 30, 2023 11:46

Merge branch 'master' into laplace-sample

2528523

Delete fit-temp.rds

04cfb4a

fix link to optimize method doc

3558eda

Update model.R

e821436

tests for laplace CmdStanModel method

64501a0

more tests for laplace method

b66f4d1

fix doc

e229fc4

a few more tests

7984814

Update _pkgdown.yml

aa286c3

jgabry marked this pull request as ready for review July 31, 2023 17:00

jgabry requested review from rok-cesnovar and andrjohns July 31, 2023 17:00

fix r cmd check warning

513444c

jgabry added 4 commits August 2, 2023 16:28

Merge branch 'master' into laplace-sample

0c0ce6f

change output_samples in variational to draws for consistency

101810e

will continue to accept output_samples to not break backwards compatibility

add laplace section to vignette

54ec3ad

fix vignette error

c9d8fc2

fix failing test

3c3a333

Merge branch 'master' into laplace-sample

18e92f9

avehtari reviewed Aug 22, 2023

View reviewed changes

R/fit.R Outdated Show resolved Hide resolved

jgabry added 5 commits August 22, 2023 12:12

update doc with Aki's suggestion

4d172ae

Merge branch 'master' into laplace-sample

d8eab9e

Debug on WSL: Turn off running vignette so unit tests run

5ae78ca

Merge branch 'master' into laplace-sample

f447e92

undo turning off vignette

e13fb1a

jgabry changed the title ~~Add laplace method~~ laplace method Sep 14, 2023

Merge branch 'master' into laplace-sample

467054f

Merge branch 'master' into laplace-sample

5c2dbbd

andrjohns added 2 commits September 25, 2023 11:23

Update file path for WSL

620458f

Fix non-WSL asserts

1a0a97d

jgabry merged commit a63e418 into master Sep 25, 2023
12 checks passed

jgabry deleted the laplace-sample branch September 25, 2023 18:39

jgabry mentioned this pull request Sep 25, 2023

Pathfinder #848

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

laplace method #800

laplace method #800

jgabry commented Jul 30, 2023 •

edited

Loading

jgabry commented Jul 30, 2023 •

edited

Loading

codecov-commenter commented Jul 30, 2023 •

edited

Loading

jgabry commented Jul 31, 2023

jgabry commented Jul 31, 2023

avehtari commented Aug 3, 2023

jgabry commented Aug 3, 2023

avehtari commented Aug 4, 2023

avehtari commented Aug 4, 2023 •

edited

Loading

avehtari commented Aug 4, 2023

jgabry commented Aug 4, 2023

jgabry commented Aug 4, 2023

jgabry commented Aug 4, 2023

avehtari left a comment

jgabry commented Sep 19, 2023

andrjohns commented Sep 20, 2023

jgabry commented Sep 20, 2023

andrjohns commented Sep 25, 2023

jgabry commented Sep 25, 2023

andrjohns commented Sep 25, 2023

jgabry commented Sep 25, 2023

laplace method #800

laplace method #800

Conversation

jgabry commented Jul 30, 2023 • edited Loading

Submission Checklist

Summary

Copyright and Licensing

jgabry commented Jul 30, 2023 • edited Loading

Example usage

codecov-commenter commented Jul 30, 2023 • edited Loading

Codecov Report

jgabry commented Jul 31, 2023

jgabry commented Jul 31, 2023

avehtari commented Aug 3, 2023

jgabry commented Aug 3, 2023

avehtari commented Aug 4, 2023

avehtari commented Aug 4, 2023 • edited Loading

avehtari commented Aug 4, 2023

jgabry commented Aug 4, 2023

jgabry commented Aug 4, 2023

jgabry commented Aug 4, 2023

avehtari left a comment

Choose a reason for hiding this comment

jgabry commented Sep 19, 2023

andrjohns commented Sep 20, 2023

jgabry commented Sep 20, 2023

andrjohns commented Sep 25, 2023

jgabry commented Sep 25, 2023

andrjohns commented Sep 25, 2023

jgabry commented Sep 25, 2023

jgabry commented Jul 30, 2023 •

edited

Loading

jgabry commented Jul 30, 2023 •

edited

Loading

codecov-commenter commented Jul 30, 2023 •

edited

Loading

avehtari commented Aug 4, 2023 •

edited

Loading