cfg_fpu_dynamic_eval mode includes virtual loss #317

killerducky · 2018-04-14T19:01:07Z

auto fpu_eval = (cfg_fpu_dynamic_eval ? get_eval(color) : net_eval) - fpu_reduction;

This code includes virtual losses in it. There should probably be a get_pure_eval(color) or an argument to get_eval(color, no_virtual_loss) that excludes them. This is making it unclear exactly what is going on because things depend on how many threads are going into the same move etc.

Not super urgent but we should probably clean this up before tuning other FPU related stuff.

Thanks to crem for finding this.

The text was updated successfully, but these errors were encountered:

killerducky · 2018-04-16T12:09:30Z

@jkiliani note this bug effects even -t1 because after you select a node, a virtual loss is applied. Then when selecting the next node, get_eval is called on the current node, which now includes the virtual loss.

jkiliani · 2018-04-17T20:52:11Z

Since it increasingly looks like this bug actually gains instead of losing strength, I'm rather dubious about fixing it to be honest...

jjoshua2 · 2018-04-17T21:02:20Z

I'm only ok with it, if it gains strength with all numbers of threads. We know it works with -t1, but does it fall apart when tcec uses -t43? Or even -t8 which is easy to test.

jkiliani · 2018-04-17T21:04:45Z

Very doubtful, I posted some debug output in dev channel today. The virtual losses massively reduce FPU for nodes far up the search tree, while leaving the FPU for root nodes unchanged. This is the case for all thread counts I looked at, though to be thorough, strength should be tested with multithreading for this.

mooskagh · 2018-05-01T17:50:25Z

While it may improve strength of play, it can hinder training.

Look at this position:

Network id230 evaluates the probability to move Re2 as 0.25%

Without this "virtual loss bug", it does the first visit on this subtree during playout 520, and at playout 810 that move becomes the best. (I expect 2-4 network generations for those moves to become most probable).

With "virtual loss bug" however the first visit to that subtree happens at playout 1501, so with 800 playout training, that move is trained with probability 0.00.

The same would happen with FPU reduction I guess, but hopefully we won't do FPU reduction in training games.

jjoshua2 · 2018-05-01T17:59:47Z

FPU of 0.1 sometimes helps and sometimes hurts elo in my tests, but using .05 seems very conservative. and gaining Using noise and -t=1 should be enough for training to find tactics, so we should probably tune for elo on a variety of nets, instead of tactics puzzles.
But I do see the point that not getting a single playout will mean t=1 won't ever try it. I think this would be an argument for training with 1600 playouts once we stall at 800, or once we implement resign.

killerducky · 2018-05-01T18:27:18Z

@mooskagh FPU reduction is not done when noise is on (implies training) for the root node:

    // Lower the expected eval for moves that are likely not the best.
    // Do not do this if we have introduced noise at this node exactly
    // to explore more.
    if (!is_root || !cfg_noise) {
        fpu_reduction = cfg_fpu_reduction * std::sqrt(total_visited_policy);
    }

This combined with the noise should help the network discover these gaps in it's knowledge.

killerducky · 2018-05-01T18:54:44Z

Sorry I was focused on the fpu_reduction term:
fpu_eval = (cfg_fpu_dynamic_eval ? get_eval(color) : net_eval) - fpu_reduction

But you're pointing out the problem in the other term, get_eval. Yes this seems like a problem.

killerducky mentioned this issue Apr 15, 2018

Add tuning support for FPU reduction #288

Closed

killerducky mentioned this issue May 1, 2018

Add recursive search depth, remove FPU VL bug #466

Merged

killerducky closed this as completed May 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cfg_fpu_dynamic_eval mode includes virtual loss #317

cfg_fpu_dynamic_eval mode includes virtual loss #317

killerducky commented Apr 14, 2018

killerducky commented Apr 16, 2018

jkiliani commented Apr 17, 2018 •

edited

Loading

jjoshua2 commented Apr 17, 2018 •

edited

Loading

jkiliani commented Apr 17, 2018 •

edited

Loading

mooskagh commented May 1, 2018 •

edited

Loading

jjoshua2 commented May 1, 2018

killerducky commented May 1, 2018

killerducky commented May 1, 2018

cfg_fpu_dynamic_eval mode includes virtual loss #317

cfg_fpu_dynamic_eval mode includes virtual loss #317

Comments

killerducky commented Apr 14, 2018

killerducky commented Apr 16, 2018

jkiliani commented Apr 17, 2018 • edited Loading

jjoshua2 commented Apr 17, 2018 • edited Loading

jkiliani commented Apr 17, 2018 • edited Loading

mooskagh commented May 1, 2018 • edited Loading

jjoshua2 commented May 1, 2018

killerducky commented May 1, 2018

killerducky commented May 1, 2018

jkiliani commented Apr 17, 2018 •

edited

Loading

jjoshua2 commented Apr 17, 2018 •

edited

Loading

jkiliani commented Apr 17, 2018 •

edited

Loading

mooskagh commented May 1, 2018 •

edited

Loading