feat: add a training loss calculation to the predict method of PLT reduction #4534

mwydmuch · 2023-03-16T21:48:14Z

This is a small change to PLT reduction that adds calculation of training loss also to the predict method when labels are available (e.g., when predicting for holdout dataset), instead of always returning 0. Which is a behavior that might cause confusion/problems to some users (as in #4511).

…ction when true labels are not available

jackgerrits · 2023-03-17T15:48:49Z

vowpalwabbit/core/src/reductions/plt.cc

+
+  double t = p.all->sd->t;
+  double weighted_holdout_examples = p.all->sd->weighted_holdout_examples;
+  p.all->sd->weighted_holdout_examples = 0;


Why reset this to zero?

Good question. This is there from the initial implementation, it is not a change I introduced in this PR, but I don't remember why it's here. Does it somehow impact the update of the base classifier? That probably could be a reason for "resetting" this variable. If not, this can probably be removed.

Ok, it seems that it doesn't impact the training so I think I will remove these.

Sounds good!

jackgerrits

Thanks for this change! The loss being reported now is really helpful.

… reduction

mwydmuch · 2023-03-18T00:23:05Z

Ok, all the checks have passed, so from my point, this is ready to be merged :)

commit adcaff2 Author: Marek Wydmuch <marek@wydmuch.poznan.pl> Date: Mon Mar 20 15:30:30 2023 +0100 feat: add a training loss calculation to the predict method of PLT reduction (#4534) * add a training loss calculation to the predict method of PLT reduction * update PLT demo * update the tests for PLT reduction * disable the calculation of additional evaluation measures in PLT reduction when true labels are not available * apply black formating to plt_demo.py * remove unnecessary reset of weighted_holdout_examples variable in PLT reduction * revert the change of the path to the exe in plt_demo.py * apply black formating again to plt_demo.py --------- Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com> commit f7a197e Author: Griffin Bassman <griffinbassman@gmail.com> Date: Fri Mar 17 11:01:17 2023 -0400 refactor: separate cb_to_cs_adf_mtr and cb_to_cs_adf_dr (#4532) * refactor: separate cb_to_cs_adf_mtr and cb_to_cs_adf_dr * clang * unused * remove mtr commit e5597ae Author: swaptr <83858160+swaptr@users.noreply.github.com> Date: Fri Mar 17 02:21:50 2023 +0530 fix: fix multiline typo (#4533) commit 301800a Author: Eduardo Salinas <edus@microsoft.com> Date: Wed Mar 15 12:25:55 2023 -0400 test: [automl] improve runtest and test changes (#4531) commit 258731c Author: Griffin Bassman <griffinbassman@gmail.com> Date: Tue Mar 14 13:28:03 2023 -0400 chore: Update Version to 9.5.0 (#4529) commit 49131be Author: Eduardo Salinas <edus@microsoft.com> Date: Tue Mar 14 11:03:24 2023 -0400 fix: [automl] avoid ccb pulling in generate_interactions (#4524) * fix: [automl] avoid ccb pulling in generate_interactions * same features in one line minimal repro * add assert of reserve size * update test file * remove include and add comment * temp print * sorting interactions matters * update temp print * fix by accounting for slot ns * remove prints * change comment and remove commented code * add sort to test * update runtests * Squashed commit of the following: commit 322a2b1 Author: Eduardo Salinas <edus@microsoft.com> Date: Mon Mar 13 21:51:49 2023 +0000 possibly overwrite vw brought in by vw-executor commit 0a6baa0 Author: Eduardo Salinas <edus@microsoft.com> Date: Mon Mar 13 21:25:46 2023 +0000 add check for metrics commit 469cebe Author: Eduardo Salinas <edus@microsoft.com> Date: Mon Mar 13 21:22:38 2023 +0000 update test commit 7c0b212 Author: Eduardo Salinas <edus@microsoft.com> Date: Mon Mar 13 21:11:45 2023 +0000 format and add handler none commit 533e067 Author: Eduardo Salinas <edus@microsoft.com> Date: Mon Mar 13 20:56:07 2023 +0000 test: [automl] add ccb test that checks for ft names * update python test * Update automl_oracle.cc commit 37f4b19 Author: Griffin Bassman <griffinbassman@gmail.com> Date: Fri Mar 10 17:38:02 2023 -0500 refactor: remove resize in gd setup (#4526) * refactor: remove resize in gd setup * rm resize commit 009831b Author: Griffin Bassman <griffinbassman@gmail.com> Date: Fri Mar 10 16:57:53 2023 -0500 fix: multi-model state for cb_adf (#4513) * switch to vector * fix aml and ep_dec * clang * reorder * clang * reorder commit a31ef14 Author: Griffin Bassman <griffinbassman@gmail.com> Date: Fri Mar 10 14:52:50 2023 -0500 refactor: rename wpp, ppw, ws, params_per_problem, problem_multiplier, num_learners, increment -> feature_width (#4521) * refactor: rename wpp, ppw, ws, params_per_problem, problem_multiplier, num_learners, increment -> interleaves * clang * clang * settings * make bottom interleaves the same * remove bottom_interleaves * fix test * feature width * clang commit 8390f48 Author: Griffin Bassman <griffinbassman@gmail.com> Date: Fri Mar 10 12:25:12 2023 -0500 refactor: dedup dict const (#4525) * refactor: dedup dict const * clang commit 2238d70 Author: Jack Gerrits <jackgerrits@users.noreply.github.com> Date: Thu Mar 9 13:51:35 2023 -0500 refactor: add api to set data object associated with learner (#4523) * refactor: add api to set data object associated with learner * add shared ptr func commit b622540 Author: Griffin Bassman <griffinbassman@gmail.com> Date: Tue Mar 7 12:16:39 2023 -0500 fix: cbzo ppw fix (#4519) commit f83cb7f Author: Jack Gerrits <jackgerrits@users.noreply.github.com> Date: Tue Mar 7 11:21:51 2023 -0500 refactor: automatically set label parser after stack created (#4471) * refactor: automatically set label parser after stack created * a couple of fixes * Put in hack to keep search working * formatting commit 64e5920 Author: olgavrou <olgavrou@gmail.com> Date: Fri Mar 3 16:20:05 2023 -0500 feat: [LAS] with CCB (#4520) commit 69bf346 Author: Jack Gerrits <jackgerrits@users.noreply.github.com> Date: Fri Mar 3 15:17:29 2023 -0500 refactor: make flat_example an implementation detail of ksvm (#4505) * refactor!: make flat_example an implementation detail of ksvm * Update memory_tree.cc * Absorb flat_example into svm_example * revert "Absorb flat_example into svm_example" This reverts commit b063feb. commit f08f1ec Author: Jack Gerrits <jackgerrits@users.noreply.github.com> Date: Fri Mar 3 14:04:48 2023 -0500 test: fix pytype issue in test runner and utl (#4517) * test: fix pytype issue in test runner * fix version_number.py type checker issues commit a8b1d91 Author: Eduardo Salinas <edus@microsoft.com> Date: Fri Mar 3 12:59:26 2023 -0500 fix: [epsdecay] return champ prediction always (#4518) commit b2276c1 Author: olgavrou <olgavrou@gmail.com> Date: Thu Mar 2 20:18:23 2023 -0500 chore: [LAS] don't force mtr with LAS (#4516) commit c0ba180 Author: olgavrou <olgavrou@gmail.com> Date: Tue Feb 28 11:27:36 2023 -0500 feat: [LAS] add example ft hash and cache and re-use rows of matrix if actions do not change (#4509) commit e1a9363 Author: Eduardo Salinas <edus@microsoft.com> Date: Mon Feb 27 16:35:09 2023 -0500 feat: [gd] persist ppw extra state (#4023) * feat: [gd] persist ppm state * introduce resize_ppw_state * wip: move logic down to gd, respect incoming ft_offset * replace assert with status quo behaviour * implement writing/reading to modelfile * remove from predict * update test 351 and 411 * update sensitivity and update * remove debug prints * update all tests * apply fix of other pr * use .at() for bounds checking * add max_ft_offset and add asserts * comment extra assert that is failing * remove files * fix automl tests * more tests * tests * tests * clang * fix for predict_only_model automl * comment * fix ppm printing * temporarily remove tests 50 and 68 * address comments * expand width for search * fix tests * merge * revert cb_adf * merge * fix learner * clang * search fix * clang * fix unit tests * bump 9.7.1 for version CIs * revert to 9.7.0 * stop search from learning out of bounds * expand search num_learners * fix search cs test * comment * revert ext_libs * clang * comment out saveresume tests * pylint * comment * fix with search * fix search * clang * unused * unused * commnets * fix scope_exit * fix cs test * revert automl test update * remove resize * clang --------- Co-authored-by: Griffin Bassman <griffinbassman@gmail.com>

add a training loss calculation to the predict method of PLT reduction

db19663

mwydmuch changed the title ~~add a training loss calculation to the predict method of PLT reduction~~ feat: add a training loss calculation to the predict method of PLT reduction Mar 16, 2023

mwydmuch added 3 commits March 17, 2023 02:27

update PLT demo

92ed421

update the tests for PLT reduction

bbae926

disable the calculation of additional evaluation measures in PLT redu…

de6626f

…ction when true labels are not available

jackgerrits reviewed Mar 17, 2023

View reviewed changes

jackgerrits approved these changes Mar 17, 2023

View reviewed changes

mwydmuch added 4 commits March 17, 2023 18:28

apply black formating to plt_demo.py

5792620

remove unnecessary reset of weighted_holdout_examples variable in PLT…

11024ff

… reduction

revert the change of the path to the exe in plt_demo.py

aca08ce

apply black formating again to plt_demo.py

1a78bb9

Merge branch 'master' into probabilistic_label_tree_update

bcd2976

jackgerrits merged commit adcaff2 into VowpalWabbit:master Mar 20, 2023

jackgerrits mentioned this pull request Mar 20, 2023

Worse performance and less training time for the PLT reduction, when updating from VW 9.6 to 9.7 #4511

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add a training loss calculation to the predict method of PLT reduction #4534

feat: add a training loss calculation to the predict method of PLT reduction #4534

mwydmuch commented Mar 16, 2023

jackgerrits Mar 17, 2023

mwydmuch Mar 17, 2023

mwydmuch Mar 17, 2023

jackgerrits Mar 17, 2023

jackgerrits left a comment

mwydmuch commented Mar 18, 2023

feat: add a training loss calculation to the predict method of PLT reduction #4534

feat: add a training loss calculation to the predict method of PLT reduction #4534

Conversation

mwydmuch commented Mar 16, 2023

jackgerrits Mar 17, 2023

Choose a reason for hiding this comment

mwydmuch Mar 17, 2023

Choose a reason for hiding this comment

mwydmuch Mar 17, 2023

Choose a reason for hiding this comment

jackgerrits Mar 17, 2023

Choose a reason for hiding this comment

jackgerrits left a comment

Choose a reason for hiding this comment

mwydmuch commented Mar 18, 2023