feat: spin off automl predict_only_model to standard cb model #4279

bassmang · 2022-11-10T22:17:31Z

No description provided.

vowpalwabbit/core/src/reductions/automl.cc

lalo · 2022-11-16T16:48:42Z

vowpalwabbit/core/src/reductions/automl.cc

+  uint64_t multiplier = static_cast<uint64_t>(data.cm->wpp) << data.cm->weights.stride_shift();
+  for (uint32_t index = 0; index < data.cm->weights.mask(); index += multiplier)
+  {
+    if (data.cm->weights[index] != 0.0f)


this might not be compatible with ftrl/coin - you would need to check all the other values in the same chunk just like here

vowpal_wabbit/vowpalwabbit/core/src/reductions/gd.cc

Line 1062 in 417012d

if (*v != 0. || (&(*v))[1] != 0. || (&(*v))[2] != 0. || (&(*v))[3] != 0. || (&(*v))[4] != 0. ||

also might be worth checking if GD writes to model if we have something like w[index] == 0.0f, w[index+1] != 0.0f, w[index+2] != 0.0f, w[index+3] != 0.0f.

updated to check all weights within the chunk as opposed to just the first

vowpalwabbit/core/include/vw/core/learner.h

lalo · 2022-11-16T16:55:45Z

test/core.vwtest.json

+    ]
+  },
+  {
+    "id": 426,


you can also recycle this unit test:

vowpal_wabbit/test/unit_test/automl_weights_test.cc

Line 263 in 417012d

BOOST_AUTO_TEST_CASE(automl_equal_no_automl_w_iterations)

line 280 model is -b 20, line 279 is -b 18, so if you run --predict_only_model on 280 line one it should be equal to the other one 279 -- as in exactly the same bit by bit.

made a test automl_equal_spin_off_model which is similar and shows the weights matching

vowpalwabbit/config/include/vw/config/options.h

vowpalwabbit/core/src/parse_regressor.cc

olgavrou · 2022-11-17T16:09:05Z

vowpalwabbit/core/src/reductions/automl.cc

+  clear_non_champ_weights(data.cm->weights, data.cm->estimators.size(), data.cm->wpp);
+
+  uint64_t multiplier = static_cast<uint64_t>(data.cm->wpp) << data.cm->weights.stride_shift();
+  for (uint32_t index = 0; index < data.cm->weights.mask(); index += multiplier)


should this be index *= multiplier?

also maybe create a function/iterator here that encapsulates this multiplier indexing

it is supposed to be +=, maybe multiplier isnt the right word

olgavrou · 2022-11-17T16:17:58Z

vowpalwabbit/core/src/reductions/automl.cc

+    if (data.cm->weights[index] != 0.0f)
+    {
+      uint32_t cb_ind = index / data.cm->wpp;
+      for (uint32_t stride = 0; stride < (static_cast<uint32_t>(1) << data.cm->weights.stride_shift()); ++stride)


do you want the stride here or the indexes between two strides? if it is the latter maybe using stride here for the indexing parameter is misleading

updated to stride_ind, also moved this function to array_parameters_dense.cc

lalo · 2022-11-17T16:18:54Z

vowpalwabbit/core/src/reductions/automl.cc

+  clear_non_champ_weights(data.cm->weights, data.cm->estimators.size(), data.cm->wpp);
+
+  uint64_t multiplier = static_cast<uint64_t>(data.cm->wpp) << data.cm->weights.stride_shift();
+  for (uint32_t index = 0; index < data.cm->weights.mask(); index += multiplier)


are the iterators from array_parameters_dense useful here?

vowpal_wabbit/vowpalwabbit/core/include/vw/core/array_parameters_dense.h

Line 63 in 417012d

// ignores the stride

this seems possible but might complicate things

vowpalwabbit/core/src/reductions/automl.cc

test/unit_test/automl_weights_test.cc

test/unit_test/simulator.cc

vowpalwabbit/core/src/reductions/automl.cc

test/core.vwtest.json

olgavrou · 2022-11-30T17:59:45Z

test/core.vwtest.json

+    ],
+    "depends_on": [
+      427
+    ]


it would be good if these model diffs were to happen with larger files with more features and interactions to allow for hash collisions to happen and check this more thoroughly (something like the ccb_lots_of_interactions.dat file)

this feature isnt compatable with ccb yet. Also it would be difficult to compare these in runtests since each interaction would need to be enumerated in the command line when comparing to a standard model

I wasn't suggesting we use the ccb file but a file like that one where there are multiple namespaces to explore and is a bit more complex than the one exercised here and could potentially surface issues

updated the test file, now it has 290k interacted features across 6 namespaces
4fb69a8#diff-94d0ac7a20310e17e8f59703288b6806f838c0d36c8893ac70385c2227ae62d8R1

vowpalwabbit/core/src/array_parameters_dense.cc

olgavrou · 2022-11-30T18:15:27Z

vowpalwabbit/core/src/array_parameters_dense.cc

@@ -89,6 +89,26 @@ void dense_parameters::clear_offset(const size_t offset, const size_t params_per
  }
 }

+void dense_parameters::adjust_weights_single_model(const size_t params_per_problem, const size_t model_num)


this method could also be unit tested in weights_test.cc seems like a prime candidate for some rigorous unit testing

The weights are tested here:
https://github.com/VowpalWabbit/vowpal_wabbit/pull/4279/files#diff-9932fc58f28480ef64a94f6fc4cb26b0b50e7d94c621a3edb00fd0131fcc14dfR333

It doesn't make much sense to test this without initializing weights with a standard and automl model since it converts one to the other

feat: spin off automl predict_only_model to standard cb model

8919546

bassmang marked this pull request as draft November 10, 2022 22:17

bassmang added 7 commits November 14, 2022 16:44

tests

7246ac9

test

b4966e1

test set

c3255d4

remove unneeded

1aca6de

clang

a5c9391

fix tests

e0130fb

remove flatbuf tests

f330daf

bassmang requested a review from lalo November 15, 2022 19:04

bassmang marked this pull request as ready for review November 16, 2022 14:41

lalo reviewed Nov 16, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/automl.cc Outdated Show resolved Hide resolved

lalo reviewed Nov 16, 2022

View reviewed changes

vowpalwabbit/core/include/vw/core/learner.h Outdated Show resolved Hide resolved

lalo reviewed Nov 16, 2022

View reviewed changes

jackgerrits reviewed Nov 17, 2022

View reviewed changes

vowpalwabbit/config/include/vw/config/options.h Outdated Show resolved Hide resolved

vowpalwabbit/core/src/parse_regressor.cc Outdated Show resolved Hide resolved

olgavrou reviewed Nov 17, 2022

View reviewed changes

lalo reviewed Nov 17, 2022

View reviewed changes

address comments

b573cf2

olgavrou reviewed Nov 17, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/automl.cc Outdated Show resolved Hide resolved

bassmang added 5 commits November 17, 2022 11:30

clang

537e40f

merge

6130cf3

add interactions to single model

fb19fe6

merge

2d9d104

clang

6ca7c4f

lalo reviewed Nov 21, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/automl.cc Show resolved Hide resolved

bassmang added 2 commits November 21, 2022 13:03

move weights function

babf414

clang

3943443

lalo reviewed Nov 21, 2022

View reviewed changes

test/unit_test/automl_weights_test.cc Outdated Show resolved Hide resolved

bassmang added 7 commits November 28, 2022 14:10

make weights more generic

a41c15c

merge

ee07d2a

remove remove_option

24cf6bc

rename

1b33d41

cubic test

3bd93a3

clang

6d05eb6

fb tests

1edacde

lalo reviewed Nov 29, 2022

View reviewed changes

test/unit_test/simulator.cc Show resolved Hide resolved

lalo reviewed Nov 29, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/automl.cc Outdated Show resolved Hide resolved

lalo reviewed Nov 29, 2022

View reviewed changes

test/core.vwtest.json Show resolved Hide resolved

shift cs test skips

0b9086e

olgavrou reviewed Nov 30, 2022

View reviewed changes

vowpalwabbit/core/src/array_parameters_dense.cc Outdated Show resolved Hide resolved

olgavrou reviewed Nov 30, 2022

View reviewed changes

bassmang added 9 commits November 30, 2022 13:29

move weights func to multi_model_utils

7a42ae7

clang

d1b2b7b

add file

e121d55

clang

254ac43

bulk out testing

4fb69a8

fix test

b6b91f5

Merge remote-tracking branch 'upstream/master' into spin_off_aml

532f5dd

merge

af464c2

Merge branch 'master' into spin_off_aml

7ef6f7f

lalo approved these changes Dec 5, 2022

View reviewed changes

bassmang and others added 4 commits December 5, 2022 14:51

merge

b1e2fc7

clang

bcab939

Merge branch 'master' into spin_off_aml

395c77c

Merge branch 'master' into spin_off_aml

f54867c

bassmang merged commit 583ce44 into VowpalWabbit:master Dec 6, 2022

bassmang deleted the spin_off_aml branch December 6, 2022 16:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: spin off automl predict_only_model to standard cb model #4279

feat: spin off automl predict_only_model to standard cb model #4279

bassmang commented Nov 10, 2022

lalo Nov 16, 2022

lalo Nov 16, 2022

bassmang Nov 17, 2022

lalo Nov 16, 2022

bassmang Nov 17, 2022

olgavrou Nov 17, 2022

olgavrou Nov 17, 2022 •

edited

Loading

bassmang Nov 21, 2022

olgavrou Nov 17, 2022

bassmang Nov 22, 2022

lalo Nov 17, 2022

lalo Nov 17, 2022

bassmang Nov 22, 2022

olgavrou Nov 30, 2022

bassmang Nov 30, 2022

olgavrou Nov 30, 2022

bassmang Nov 30, 2022

olgavrou Dec 1, 2022

olgavrou Nov 30, 2022

bassmang Nov 30, 2022

feat: spin off automl predict_only_model to standard cb model #4279

feat: spin off automl predict_only_model to standard cb model #4279

Conversation

bassmang commented Nov 10, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

olgavrou Nov 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

olgavrou Nov 17, 2022 •

edited

Loading