Handle negative free disk space in deciders #48392

DaveCTurner · 2019-10-23T12:44:40Z

Today it is possible that the total size of all relocating shards exceeds the
total amount of free disk space. For instance, this may be caused by another
user of the same disk increasing their disk usage, or may be due to how
Elasticsearch double-counts relocations that are nearly complete particularly
if there are many concurrent relocations in progress.

The DiskThresholdDecider treats negative free space similarly to zero free
space, but it then fails when rendering the messages that explain its decision.
This commit fixes its handling of negative free space.

Fixes #48380

Today it is possible that the total size of all relocating shards exceeds the total amount of free disk space. For instance, this may be caused by another user of the same disk increasing their disk usage, or may be due to how Elasticsearch double-counts relocations that are nearly complete particularly if there are many concurrent relocations in progress. The `DiskThresholdDecider` treats negative free space similarly to zero free space, but it then fails when rendering the messages that explain its decision. This commit fixes its handling of negative free space. Fixes elastic#48380

elasticmachine · 2019-10-23T12:44:42Z

Pinging @elastic/es-distributed (:Distributed/Allocation)

ywelsch

I'm not sure whether we would like DiskUsage.freeBytes() to ever return a negative value. Its serialization as a VLong e.g. suggests that that value can never be negative. I know that we only expect this to be negative if we have locally create a DiskUsage instance that is not serialized, but that seems trappy either way. I wonder if we should instead add a new class that represents a DiskUsage object with relocations taken into account and use that object for the decision making and logging here. We can do this in a follow-up though.

DaveCTurner · 2019-10-23T14:36:12Z

DiskUsageTests mention cases where the disk usage is negative, although looking at ESFileStore I don't think that this can really happen any more. But you're certainly right that negative disk usage stats can't be serialised and this seems bad.

I added a new class in 9ea918f.

… arithmetic on it

ywelsch · 2019-10-23T15:17:58Z

DiskUsageTests mention cases where the disk usage is negative, although looking at ESFileStore I don't think that this can really happen any more.

Should we fix the tests then?

I would like to have these assertions in DiskUsage checking that values >= 0?

ywelsch · 2019-10-23T15:20:39Z

...src/main/java/org/elasticsearch/cluster/routing/allocation/decider/DiskThresholdDecider.java

@@ -500,7 +500,11 @@ public String toString() {
        }

        long getFreeBytes() {
-            return diskUsage.getFreeBytes() - relocatingShardSize;
+            try {
+                return Math.subtractExact(diskUsage.getFreeBytes(), relocatingShardSize);


How could this overflow?
Isn't relocatingShardSize >= 0 and 0 <= diskUsage.getFreeBytes() <= Long.MAX_VALUE? Is this to capture the case where diskUsage.getFreeBytes() == 0 && relocatingShardSize == Long.MAX_VALUE?

No, relocatingShardSize is the net size of all relocations, i.e. shards in minus shards out, so can indeed be negative :(

DaveCTurner · 2019-10-23T15:31:06Z

I opened #48413 to investigate other sources of negative sizes because it needs a deeper look than I can offer right now.

ywelsch

LGTM

Today it is possible that the total size of all relocating shards exceeds the total amount of free disk space. For instance, this may be caused by another user of the same disk increasing their disk usage, or may be due to how Elasticsearch double-counts relocations that are nearly complete particularly if there are many concurrent relocations in progress. The `DiskThresholdDecider` treats negative free space similarly to zero free space, but it then fails when rendering the messages that explain its decision. This commit fixes its handling of negative free space. Fixes #48380

In elastic#48392 we added a second computation of the sizes of the relocating shards in `canRemain()` but passed the wrong value for `subtractLeavingShards`. This fixes that. It also removes some unnecessary logging in a test case added in the same commit.

In #48392 we added a second computation of the sizes of the relocating shards in `canRemain()` but passed the wrong value for `subtractLeavingShards`. This fixes that. It also removes some unnecessary logging in a test case added in the same commit.

DaveCTurner added >bug :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v8.0.0 v7.5.0 v7.6.0 v7.4.2 v6.8.5 labels Oct 23, 2019

DaveCTurner requested a review from ywelsch October 23, 2019 12:44

ywelsch approved these changes Oct 23, 2019

View reviewed changes

DaveCTurner added 3 commits October 23, 2019 14:09

Merge branch 'master' into 2019-10-23-handle-negative-free-space

144e0f4

New class to avoid bogus DiskUsage instances

9ea918f

Apparently negative bytes are legit, see DiskUsageTests

f5cf7b6

DaveCTurner requested a review from ywelsch October 23, 2019 14:36

We can get Long.MAX_VALUE for the free space so must be careful doing…

e3c35fd

… arithmetic on it

ywelsch reviewed Oct 23, 2019

View reviewed changes

DaveCTurner mentioned this pull request Oct 23, 2019

DiskUsages with negative total or free space are unserialisable #48413

Open

ywelsch approved these changes Oct 23, 2019

View reviewed changes

DaveCTurner merged commit 36b03a2 into elastic:master Oct 23, 2019

DaveCTurner deleted the 2019-10-23-handle-negative-free-space branch October 23, 2019 15:58

DaveCTurner removed the v7.4.2 label Oct 23, 2019

DaveCTurner mentioned this pull request Oct 23, 2019

Fix relocating shards size calculation #48421

Merged

This was referenced Feb 3, 2020

[meta] 7.6 release elastic/elasticsearch-net#4340

Closed

[meta] 7.6 release elastic/elasticsearch-net#4341

Closed

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle negative free disk space in deciders #48392

Handle negative free disk space in deciders #48392

DaveCTurner commented Oct 23, 2019

elasticmachine commented Oct 23, 2019

ywelsch left a comment

DaveCTurner commented Oct 23, 2019

ywelsch commented Oct 23, 2019

ywelsch Oct 23, 2019

DaveCTurner Oct 23, 2019

DaveCTurner commented Oct 23, 2019

ywelsch left a comment

Handle negative free disk space in deciders #48392

Handle negative free disk space in deciders #48392

Conversation

DaveCTurner commented Oct 23, 2019

elasticmachine commented Oct 23, 2019

ywelsch left a comment

Choose a reason for hiding this comment

DaveCTurner commented Oct 23, 2019

ywelsch commented Oct 23, 2019

ywelsch Oct 23, 2019

Choose a reason for hiding this comment

DaveCTurner Oct 23, 2019

Choose a reason for hiding this comment

DaveCTurner commented Oct 23, 2019

ywelsch left a comment

Choose a reason for hiding this comment