fix: explore_eval don't learn if logged action not in predicted actions #4262

olgavrou · 2022-11-02T13:22:41Z

No description provided.

olgavrou · 2022-11-02T13:23:17Z

vowpalwabbit/core/src/reductions/explore_eval.cc

@@ -164,6 +165,8 @@ void do_actual_learning(explore_eval& data, multi_learner& base, VW::multi_ex& e
      if (data.known_cost.action == a_s[i].action) { action_probability = a_s[i].score; }
    }

+    if (action_probability == 0) { return; }
+
    float threshold = action_probability / data.known_cost.probability;

    if (!data.fixed_multiplier) { data.multiplier = std::min(data.multiplier, 1 / threshold); }


now that probs can be zero, this 1 / threshold could cause division by zero

vowpalwabbit/core/src/reductions/explore_eval.cc

Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com>

olgavrou added 4 commits November 1, 2022 00:10

fix: explore_eval skip if prob is zero

9b780b0

clang clang

2fb63a9

don't compare floats just skip if action not found

04959c9

formatting

a5b119c

olgavrou commented Nov 2, 2022

View reviewed changes

jackgerrits reviewed Nov 2, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/explore_eval.cc Outdated Show resolved Hide resolved

olgavrou and others added 2 commits November 2, 2022 09:29

Update vowpalwabbit/core/src/reductions/explore_eval.cc

e01a4ed

Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com>

don't compare floats just skip if action not found

51db5fa

olgavrou changed the title ~~fix: explore_eval don't learn if prob is zero~~ fix: explore_eval don't learn if logged action not in predicted actions Nov 2, 2022

Merge branch 'master' into explore_eval_no_UB

8db8bd9

ataymano approved these changes Nov 8, 2022

View reviewed changes

jackgerrits approved these changes Nov 8, 2022

View reviewed changes

olgavrou merged commit 0406c0f into VowpalWabbit:master Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: explore_eval don't learn if logged action not in predicted actions #4262

fix: explore_eval don't learn if logged action not in predicted actions #4262

olgavrou commented Nov 2, 2022

olgavrou Nov 2, 2022

fix: explore_eval don't learn if logged action not in predicted actions #4262

fix: explore_eval don't learn if logged action not in predicted actions #4262

Conversation

olgavrou commented Nov 2, 2022

olgavrou Nov 2, 2022

Choose a reason for hiding this comment