fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean #974

laresbernardo · 2024-05-13T07:36:17Z

Fix on #960

gufengzhou · 2024-05-20T02:32:48Z

Thanks for updating @laresbernardo . Would you please test the current change vs the main branch on refresh each to see if everything is identical? It'd be identical when you compare the JSONs and the list "hyper_values" are having the same values. Also when testing refresh, make sure to test 2 separate refresh steps (rf1 & rf2) to see if the whole chaining etc work properly. Thanks!

laresbernardo · 2024-05-20T11:40:06Z

Hey @gufengzhou

Yes, they give the same hyper_values. The decimals are rounded somewhere, so we get 0.5769212583 vs 0.5769213, but looking good.

The issue here is that Robyn$listInit$OutputCollect$hyper_updated is not available when you start building the first refresh model. That is why the user was having issues and crashed. We only have Robyn$listInit$OutputCollect$hyper_fixed which is TRUE and the attr(,"hypParamSamName") has the names of the updated hyperparameters (including penalties, etc). Changing the code a little bit I was able to get around this error but the following question arises: should the penalties/lambda hyps be fixed or should be allow for the model to find new ones within the (unexisting) bounds freedom? If they are fixed I can commit a change that will fix it all; if not, then I have to fix it so that maybe we use the fixed value +- the bounds freedom in those specific hyps. Let me know your thoughts...

gufengzhou · 2024-05-21T16:30:28Z

Thanks for digging. Theoretically refresh should also iterate through penalty. we just didn't have users going this far so far.

…ameter to overwrite default calculation

laresbernardo · 2024-05-22T09:15:57Z

Alright, did some tweaks all around and I feel more confident on refreshing results. I've:

feat: created a new bounds_freedom parameter so that users can overwrite the flexibility on hyperparameters bounds for refreshed models (default remains the same)
feat: more flexibility on chain logic so original file may be elsewhere
fix: fixed hyperparameters for lambda, penalties, train_test are now flexible, using the refresh flexibility bounds. Results are way more consistent and similar to original model
fix: issues with refresh plots when chain is not followed correctly

laresbernardo · 2024-05-28T07:44:54Z

@gufengzhou can you please check the Fitted vs actual calculations when refreshing? For some reason, the baseline is a large number and the predictions are small numbers (as if they haven't been scaled - check rowSums(xDecompVec)); so you get almost a straight line. Funny thing is that we haven't specifically changed anything there... unless I'm getting terrible fit because of the hyps constraints(?)

amanrai2508 · 2024-05-28T11:12:19Z

Hi @laresbernardo

Can you check this issue as well , I am getting way different result when I am doing data refresh (for 2 weeks of data)
: #985

- negative trend is not interpretable for MMM - force negative coef when trend is negative to get positive decomp

…dstock feat: instead of Inf, use channel_constr_up, which by default is 10 for target_efficiency

gufengzhou · 2024-06-13T13:29:43Z

@gufengzhou can you please check the Fitted vs actual calculations when refreshing? For some reason, the baseline is a large number and the predictions are small numbers (as if they haven't been scaled - check rowSums(xDecompVec)); so you get almost a straight line. Funny thing is that we haven't specifically changed anything there... unless I'm getting terrible fit because of the hyps constraints(?)

If you check report_aggregated.csv, you'll see that the intercept are wrong. It was dropped in the initial model but not dropped in the refresh. The dropping of intercept happens in the refit function within robyn_mmm @laresbernardo

amanrai2508 · 2024-06-14T06:50:03Z

Hi @laresbernardo @gufengzhou
This is the same issue that I am facing as well : #985
If you need any help with respect to testing, I can help you with this. (Robyn refresh is flawed currently because of this)

@gufengzhou can you please check the Fitted vs actual calculations when refreshing? For some reason, the baseline is a large number and the predictions are small numbers (as if they haven't been scaled - check rowSums(xDecompVec)); so you get almost a straight line. Funny thing is that we haven't specifically changed anything there... unless I'm getting terrible fit because of the hyps constraints(?)

The refactoring of initBounds & listOutputPrev in refresh_hyps was wrong in 774c18d

laresbernardo · 2024-06-15T19:48:27Z

Ok. It looks good now. @amanrai2508 would you mind updating Robyn to this branch (Robyn::robyn_update(ref = "bl02") + refresh R session) and testing the refresh functionality? Before we merge I'd like to hear from you. Thanks!

amanrai2508 · 2024-06-16T15:45:51Z

Hi @laresbernardo
Now the refresh looks good. (For some data it is off but for most of the data it is working)
Just wanted to know why we are plotting onepager of a solutionid where topsolution is False ( Though the results are similar in this solution as well)

laresbernardo · 2024-06-17T08:05:33Z

Could you please check if model 1_142_5 is the solution with the lowest DECOMP.RSSD error across all Pareto-front models? The top solutions are the minimum combined error models per cluster, which may not match.

amanrai2508 · 2024-06-17T09:35:45Z

Yeah, it has the lowest DECOMP.RSSD.
It looks fine on my end @laresbernardo ,now the values are more credible.

* build(deps): bump braces from 3.0.2 to 3.0.3 in /website (#997) * fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean (#974) * fix: refresh hyps check #960 + use data available in json * fix: update based on gz's comments * fix: fixed penalties and other fixed hyps on refreshing models * fix: refresh plot when chain is broken + feat: new bounds_freedom parameter to overwrite default calculation * fix: import and store original model when not in original plot_dir * recode: applied styler::tidyverse_style() to clean code for CRAN * fix: paid_media_total calc * fix: print ExportedModel only when available * fix: deal with negative trend - negative trend is not interpretable for MMM - force negative coef when trend is negative to get positive decomp * fix: upper constraint issue on BA for target_efficiency and weibull adstock feat: instead of Inf, use channel_constr_up, which by default is 10 for target_efficiency * fix: reverse wrong bounds update in refresh_hyps The refactoring of initBounds & listOutputPrev in refresh_hyps was wrong in 774c18d * recode: apply styler::tidyverse_style() --------- Co-authored-by: gufengzhou <gufengzhou@gmail.com> --------- Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: gufengzhou <gufengzhou@gmail.com>

fix: refresh hyps check #960 + use data available in json

5ef8c50

laresbernardo added the bug Something isn't working label May 13, 2024

laresbernardo requested a review from gufengzhou May 13, 2024 07:36

laresbernardo self-assigned this May 13, 2024

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 13, 2024

fix: update based on gz's comments

ba2a318

Merge branch 'main' into bl02

6699559

laresbernardo added 3 commits May 22, 2024 10:26

fix: fixed penalties and other fixed hyps on refreshing models

774c18d

fix: refresh plot when chain is broken + feat: new bounds_freedom par…

5895a5e

…ameter to overwrite default calculation

fix: import and store original model when not in original plot_dir

3d94c85

recode: applied styler::tidyverse_style() to clean code for CRAN

fb21795

laresbernardo assigned gufengzhou May 28, 2024

laresbernardo and others added 4 commits June 3, 2024 18:01

fix: paid_media_total calc

a061843

fix: print ExportedModel only when available

3401c1a

fix: deal with negative trend

57d14db

- negative trend is not interpretable for MMM - force negative coef when trend is negative to get positive decomp

fix: upper constraint issue on BA for target_efficiency and weibull a…

d649356

…dstock feat: instead of Inf, use channel_constr_up, which by default is 10 for target_efficiency

laresbernardo changed the title ~~fix: refresh hyps check + use data available in json~~ fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean Jun 11, 2024

gufengzhou approved these changes Jun 12, 2024

View reviewed changes

gufengzhou and others added 2 commits June 15, 2024 12:27

fix: reverse wrong bounds update in refresh_hyps

3221bc1

The refactoring of initBounds & listOutputPrev in refresh_hyps was wrong in 774c18d

recode: apply styler::tidyverse_style()

6e865a7

laresbernardo merged commit 3bc8422 into main Jun 17, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean #974

fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean #974

laresbernardo commented May 13, 2024

gufengzhou commented May 20, 2024

laresbernardo commented May 20, 2024

gufengzhou commented May 21, 2024

laresbernardo commented May 22, 2024

laresbernardo commented May 28, 2024 •

edited

Loading

amanrai2508 commented May 28, 2024

gufengzhou commented Jun 13, 2024 •

edited

Loading

amanrai2508 commented Jun 14, 2024

laresbernardo commented Jun 15, 2024

amanrai2508 commented Jun 16, 2024 •

edited

Loading

laresbernardo commented Jun 17, 2024

amanrai2508 commented Jun 17, 2024

fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean #974

fix: refresh hyps check + use data available in json + refresh hyps + upper constraints fix when higher than mean #974

Conversation

laresbernardo commented May 13, 2024

gufengzhou commented May 20, 2024

laresbernardo commented May 20, 2024

gufengzhou commented May 21, 2024

laresbernardo commented May 22, 2024

laresbernardo commented May 28, 2024 • edited Loading

amanrai2508 commented May 28, 2024

gufengzhou commented Jun 13, 2024 • edited Loading

amanrai2508 commented Jun 14, 2024

laresbernardo commented Jun 15, 2024

amanrai2508 commented Jun 16, 2024 • edited Loading

laresbernardo commented Jun 17, 2024

amanrai2508 commented Jun 17, 2024

laresbernardo commented May 28, 2024 •

edited

Loading

gufengzhou commented Jun 13, 2024 •

edited

Loading

amanrai2508 commented Jun 16, 2024 •

edited

Loading