Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CSI: no early return when feasibility check fails on eligible nodes #13274

Merged
merged 1 commit into from
Jun 7, 2022

Commits on Jun 7, 2022

  1. CSI: no early return when feasibility check fails on eligible nodes

    As a performance optimization in the scheduler, feasibility checks
    that apply to an entire class are only checked once for all nodes of
    that class. Other feasibility checks are "available" checks because
    they rely on more ephemeral characteristics and don't contribute to
    the hash for the node class. This currently includes only CSI.
    
    We have a separate fast path for "available" checks when the node has
    already been marked eligible on the basis of class. This fast path has
    a bug where it returns early rather than continuing the loop. This
    causes the entire task group to be rejected.
    
    Fix the bug by not returning early in the fast path and instead jump
    to the top of the loop like all the other code paths in this method.
    Includes a new test exercising topology at whole-scheduler level and a
    fix for an existing test that should've caught this previously.
    tgross committed Jun 7, 2022
    Configuration menu
    Copy the full SHA
    41744a6 View commit details
    Browse the repository at this point in the history