Fixed all the bugs of save_resume #1917

bremen79 · 2019-06-06T00:33:58Z

The state of FTRL models is now saved (added a parameter to save_load_online_state)
Fixed bug in save_load_online_state: now the state is saved even for features that have w=0. This affected --l1 and FTRL models that add up gradients with equal weights
Moved total_weight from gd to vw struct, so FTRL can use and save it
added save_resume lines for ftrl, pistol, coin

…n by average lenght of the feature vectors

…than default one in ooa and cbify

JohnLangford

This looks good other than the additional use of global state. Can we avoid?

JohnLangford · 2019-06-06T01:18:00Z

vowpalwabbit/gd.cc

@@ -559,14 +559,14 @@ float get_pred_per_update(gd& g, example& ec)
    if (!stateless)
    {
      g.all->normalized_sum_norm_x += ((double)ec.weight) * nd.norm_x;
-      g.total_weight += ec.weight;
+      g.all->total_weight += ec.weight;


Why prefer a global?

The general rule of thumb is to use variables which are as local as possible to minimize context.

It is not clear to me how to solve this, I am open to suggestions.

From one side, the struct vw already contains similar quantities: power_t, invariant_updates, normalized_sum_norm_x, and similar ones are specific to gd, still they are in a global place.

Also, the problem comes from using GD::save_load_online_state in ftrl. We don't have access to ftrl data in this way. We could duplicate and customize the entire save state function in ftrl? It seems painful... Or hack the GD::save_load_online_state with even more optional inputs, but it also seems a bad idea...

Of the three options, an extra argument seems preferred to either a global or code duplication.

Code duplication seems particularly bad---it's a recipe for non-maintainability.

The global variable is moving in the wrong direction---we are working towards atomizing the reductions so they can be composed with other learning algorithms.

The extra arguments approach seems the best. In the long term, we'd probably want to adjust the arguments so they are semantic rather than algorithm-specific. Basically, instead of having ftrl, we'd have "the number of floats per weight to store", etc... But this is a minor refactoring consistent with the extra arguments approach.

JohnLangford · 2019-06-06T12:54:18Z

Closing in favor of #1919 which tweaks this one.

bremen79 and others added 20 commits March 6, 2019 18:02

first version of the KT algorithm

5783622

changed from 'kt' to 'approximate cocob' and implemented normalizatio…

07a7e04

…n by average lenght of the feature vectors

variant that works with squared loss

9430ede

fixed all bugs: works great in binary classification, slightly worse …

9fba67d

…than default one in ooa and cbify

cleaned version, no bias used

83633b8

bias and fix bug

d31cef3

another bug fix

49bccd5

removed bias and added default params for logistic

12f6a84

prediction is now stateless

2591f6b

added comments

7540eca

Merge branch 'master' into coin_pr_version

048c0f7

added tests

a8bc3a5

fix to ftrl state saving

b19a82e

merge

c13457d

moved ftrl_size to a parameter of save_load_online_state

14ae4ae

Merge branch 'master' into fix_save_ftrl

39512b8

fixed all the bugs related to resume models

75a0db1

merge

d07d6c3

removed comments

43a43c3

Merge branch 'master' into fix_save_ftrl

91cd2c1

JohnLangford reviewed Jun 6, 2019

View reviewed changes

Merge branch 'master' into fix_save_ftrl

a9f5361

JohnLangford closed this Jun 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed all the bugs of save_resume #1917

Fixed all the bugs of save_resume #1917

bremen79 commented Jun 6, 2019

JohnLangford left a comment

JohnLangford Jun 6, 2019

bremen79 Jun 6, 2019

JohnLangford Jun 6, 2019

JohnLangford commented Jun 6, 2019

Fixed all the bugs of save_resume #1917

Fixed all the bugs of save_resume #1917

Conversation

bremen79 commented Jun 6, 2019

JohnLangford left a comment

Choose a reason for hiding this comment

JohnLangford Jun 6, 2019

Choose a reason for hiding this comment

bremen79 Jun 6, 2019

Choose a reason for hiding this comment

JohnLangford Jun 6, 2019

Choose a reason for hiding this comment

JohnLangford commented Jun 6, 2019