-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[New] Relationship of OxCGRT index and parameter values (short-term prediction) #280
Comments
This is just an experimental analysis to find the relasionship of the parameters and government responses. This is related to the discussion with @ilyasst and @joydisette in #3 We need to perform machine/deep learning. #204 and #205 are also related. However, parameter estimation for all countries is a very time-consuming task. We have too many countries and the number of phases are incleasing every day. #225 must be solved in advance. |
Yes, I think #225 must definitely be solved in advance. You mean machine learning for pattern recognition of the relationships?
|
As it is and this is just an experimental analysis to find the solutions to predict the parameter values using government responses with deep learning. They are for feature selection. I do not know which index is necessary in the solution. Correlation table, scatter plot and deep learning is helpful to find the useful index. |
|
For example, if a country applies quarantine measures on X day, we know that most probably the daily new cases will decrease in around 2-3 weeks after that X day. But the estimation analysis will give us increased cases for that period, since the parameters will be different and we simply cannot forecast this. Except if we somehow insert this index into the simulator analysis as an extra input parameter and affect the simulated cases. Because these simulated cases otherwise are not realistic and representative of our current knowledge that extra measures are in effect. We need to let the model know about that and the best way would be that index. |
I intended to analyse the relationship with
What do you think about these steps? Sentences in bold will be the most difficult part. |
Why is necessary to have a web service/RESTful API for such relationship? Can't we run these in advance once for a specific set of countries and then find this relationship on-the-fly? Regarding the above algorithm I think this would suffice as a standalone solution and would enable the model to consider the various government measures in effect. I think this is vital to be implemented soon. And if not a complete solution, for a starting point it would be enough to apply this to a single future phase (one predicted set of parameters) or to the next month, for short-term impact. Another question more general, what is the physical meaning of the estimated parameters? Do they make sense, are the parameters logical? For example, for Greece the Rt now is 19.5 . What does this mean? Is it logical that one individual can infect other 19 people? Or is it just a value with no realistic meaning, that serves only for fitting of the data to the model? |
Which do you want to use for this analysis, If If If another class, please explain the detailed steps of your idea.
Reproduction number is a index to know whether outbreaking (Rt > 1) or not. Rho, sigma and kappa are functions of control factors as explained in Factors of model parameters section of my Kaggle Notebook. |
I think for a first solution implementation a |
[MEMO]
|
May I suggest another way to predict future values? |
Yes, time series forcasting only with parameter values is an alternative. However, I tried a prototype of this solution in the bottom of the notebook I mentioned in the previous comment and failed in forcasting as shown in line 97. How can we improve it? |
You mean a prototype of which solution, the alternative I described or the one you had in mind with the index? |
Perhaps a time varying autoregressive model would be more appropriate for fitting |
Linear regression part was for the idea with OxCGRT scores. This is not related to the alternative you nentioned. |
MEMO: https://gist.github.com/lisphilar/8f492770cd4c306b081873ca71b7871d |
I tired time series forcasting without OxCGRT scores, but it seems difficult to forecast parameter values with this solution because parameter values show wild ups and downs. |
This is very good I think!
It is a nice first approach I think. Also try to use AutoARIMA as well in the first model selection (they have same score with exponential smoothing). A general note, I think the RMSLE by itself is not that much credible, because the parameter values are very small. |
How OxCGRT index is combined and used in forecasting? |
This seems very promising indeed!! |
I try to use recovery period (=17 days) rather than 14 days as |
In order to calculate the delay? Hmm.. if you compared the dates per country when the index was changed rapidly or critical measures were imposed to the dates of the phases (from S-R trend amalysis) or the dates when parameters changed rapidly? Like applying change point analysis but in parameters-index plane instead of S-R. Averaging of these change points duration then could indicate such delay period. |
It seems a difficult issue and this will be solved in the future versions... I created pull request #471 as the first step. Usage: snl = cs.Scenario(jhu_data, population_data, "Japan")
snl.trend()
snl.estimate(cs.SIRF)
snl.predict(oxcgrt_data)
snl.summary()
snl.simulate()
snl.history("Rt") I may rename |
#471 was merged and tutorial of |
In this example notebook, this code was used to include the delay period of 14 days:
However, this delay is different for each country and the 'end' of the effects from Policy Measures is also very different. I made a short overview at the bottom of this notebook to identify the 'ending' effect of policy measures: Just wanted to share this for any new ideas of implementations. |
Yes as far as I know the delay should not be a fixed value, but calculated dynamically for each country. In this first implementation the delay is set to the recovery period just to have a first working functionality. This will have to be revised.
|
The delay period will be reworked with issue #513. |
Very very interesting. We will move forward to the new issue. |
What we need to document?
I am referring to https://lisphilar.github.io/covid19-sir/usage_policy.html, in specific to the
(Experimental): Relationship of OxCGRT index and parameter values
section.What all these results actually mean? Could you provide a more detailed documentation and analysis, about the OxCGRT index usage and how it affects the parameter values for each country, with examples and practical explanations?
The results have to be reworked/scaled, in order to display properly all the points around 0-100 zone and to ignore the high outliers.
The text was updated successfully, but these errors were encountered: