Prior in Gaussian random walk #34

ivandebono · 2020-10-09T22:29:24Z

Can you explain the choice of prior for the Gaussian random walk? How did you choose this value for sigma?

           log_r_t = pm.GaussianRandomWalk(
                "log_r_t",
                sigma=0.035,
                dims=["date"] )

The text was updated successfully, but these errors were encountered:

michaelosthege · 2020-10-12T15:08:24Z

The sigma here was chosen by Kevin while iterating on the model. This sigma just works well.

If you manage to put a prior on this & fit/sample the sigma, that would be very interesting.

ivandebono · 2020-10-13T07:23:37Z

What do you mean when you say it works well? Has it been fitted to some data?

I found that changing the fixed parameters in the model makes a significant difference to the final result. So it's not just sigma, but also the seed population, the minimum exposure, and the maximum of the generation interval.

michaelosthege · 2020-10-13T07:26:55Z

For generation interval it's expected to make a difference. I expect the seed to make a difference for the first month and the sigma for the smoothing.
If you can share some plots &or make a case for how to improve the model, that would be greatly appreciated!

ivandebono · 2020-10-14T22:39:51Z

Using United Kingdom data, I differences were not that significant, especially towards the end of the time series.

The original values in the code:

Now some different values. Sigma is a random variable with a uniform prior. Generation interval prior runs from 0 to 30 days. Lower limit of exposure is 0.05.

Fixed sigma, and generation interval prior cutoff at 30 days.

Cutoff at 40 days.

I have some of suggestions for the code, some of which I'm trying to implement myself. But I recognise the difficulty.

A correction to find the number of true positives (given that most tests are now greater than cases, and most are PCR). I implemented this separately, using PyMC3.
A cross-correlation with deaths. This is tricky, because the case fatality rate depends heavily on the age profile of the cases. However, we could set the CFR as a random variable.
A question: Why do you use the median of the posterior sample for R(t) rather than the mean?
Some kind of extrapolation of the test volume backwards, towards the beginning of the time series when data are unavailable.
A question: The method relies on tests being greater than cases. Now this isn't always the case, especially in European data sets. How does this affect the estimate of R(t)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prior in Gaussian random walk #34

Prior in Gaussian random walk #34

ivandebono commented Oct 9, 2020

michaelosthege commented Oct 12, 2020

ivandebono commented Oct 13, 2020

michaelosthege commented Oct 13, 2020

ivandebono commented Oct 14, 2020

Prior in Gaussian random walk #34

Prior in Gaussian random walk #34

Comments

ivandebono commented Oct 9, 2020

michaelosthege commented Oct 12, 2020

ivandebono commented Oct 13, 2020

michaelosthege commented Oct 13, 2020

ivandebono commented Oct 14, 2020