[AUTOCUT] Gradle Check Flaky Test Report for RemoteSegmentTransferTrackerTests #14325

opensearch-ci-bot · 2024-06-13T21:37:16Z

Flaky Test Report for `RemoteSegmentTransferTrackerTests`

Noticed the RemoteSegmentTransferTrackerTests has some flaky, failing tests that failed during post-merge actions.

Details

Git Reference	Merged Pull Request	Build Details	Test Name
`16c8806`	13724	41044	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`21d3aaa`	14399	41157	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`51af2e2`	14660	42106	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`68bdb77`	15143	44533	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`bd0deec`	13932	39630	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`c92e125`	14361	41039	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`c9ff7ce`	14827	42817	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`
`e076e66`	15348	44994	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testGetInflightUploadBytes`
`ffa67f9`	14963	43644	`org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate`

The other pull requests, besides those involved in post-merge actions, that contain failing tests with the RemoteSegmentTransferTrackerTests class are:

For more details on the failed tests refer to OpenSearch Gradle Check Metrics dashboard.

The text was updated successfully, but these errors were encountered:

lukas-vlcek · 2024-07-10T16:58:42Z

If no one is working on this one I would like to give it a try. Feel free to assign to me.

Current implementation of [`RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate()`](https://github.com/opensearch-project/OpenSearch/blob/2b17902643738f0d2a75ade7c85cbca94d18ce49/server/src/test/java/org/opensearch/index/remote/RemoteSegmentTransferTrackerTests.java#L139) test rely on some assumptions about how fast the testing code will finish in JVM. Moreover it does not precisely control boundaries of the time span, specifically the start of the span because it is determined by internal implementation of [`RemoteSegmentTransferTracker.getTimeMsLag()`](https://github.com/opensearch-project/OpenSearch/blob/2b17902643738f0d2a75ade7c85cbca94d18ce49/server/src/main/java/org/opensearch/index/remote/RemoteSegmentTransferTracker.java#L262) which indirectly makes call to `System.nanoTime()`. This commit loosens the assumption that the test code execution will finish within +/-20ms. Instead it only assumes that the execution time span won't be shorter than predefined (and controlled) thread sleep interval and any larger interval value is considered a success. The whole point of this test is not to verify execution speed with defined precision. Instead the point is that the [`getTimeMsLag()`](https://github.com/opensearch-project/OpenSearch/blob/2b17902643738f0d2a75ade7c85cbca94d18ce49/server/src/main/java/org/opensearch/index/remote/RemoteSegmentTransferTracker.java#L262) method returns either 0 (for specific conditions) or possitive number (assuming that `remoteRefreshStartTimeMs` is not greater than `System.nanoTime()`). Closes: opensearch-project#14325 Signed-off-by: Lukáš Vlček <lukas.vlcek@aiven.io>

dblock · 2024-09-09T16:05:10Z

[Catch All Triage - 1, 2, 3, 4, 5]

prachi-gaonkar · 2024-10-09T12:41:07Z

Hi Team, we are also getting the same error on ppc64le VM

RemoteSegmentTransferTrackerTests > testComputeTimeLagOnUpdate STANDARD_ERROR
REPRODUCE WITH: ./gradlew ':server:test' --tests "org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate" -Dtests.seed=1489833A008E8B2 -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=bg-BG -Dtests.timezone=Indian/Kerguelen -Druntime.java=21

RemoteSegmentTransferTrackerTests > testComputeTimeLagOnUpdate FAILED
java.lang.AssertionError
at __randomizedtesting.SeedInfo.seed([1489833A008E8B2:531AB8310E2D8828]:0)
at org.junit.Assert.fail(Assert.java:87)
at org.junit.Assert.assertTrue(Assert.java:42)
at org.junit.Assert.assertTrue(Assert.java:53)
at org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate(RemoteSegmentTransferTrackerTests.java:157)

prachi-gaonkar · 2024-10-15T06:16:23Z

Hi Team
is there any updated on this issue?

opensearch-ci-bot added >test-failure Test failure from CI, local build, etc. autocut untriaged labels Jun 13, 2024

prudhvigodithi mentioned this issue Jun 13, 2024

Add additional details on Gradle Check failures autocut issues #13950

Closed

prudhvigodithi added the flaky-test Random test failure that succeeds on second run label Jun 14, 2024

andrross mentioned this issue Jun 17, 2024

[BUG] org.opensearch.index.remote.RemoteSegmentTransferTrackerTests.testComputeTimeLagOnUpdate is flaky #12639

Closed

andrross added the Storage:Remote label Jun 17, 2024

reta mentioned this issue Jun 19, 2024

Tests are failing on top of tree(443cfca) for x86_64. #10014

Closed

gbbafna removed the untriaged label Jun 27, 2024

SwethaGuptha mentioned this issue Jul 3, 2024

Use set for shard routings in batch check. #14533

Merged

3 tasks

akolarkunnu mentioned this issue Jul 9, 2024

[Metadata Immutability] Change different indices lookup objects from array type to lists #14557

Closed

9 tasks

lukas-vlcek mentioned this issue Jul 9, 2024

[BUG] Incorrect cast to int in RemoteSegmentTransferTrackerTests.testStatsObjectCreationViaStream() test #14694

Closed

dblock assigned lukas-vlcek Jul 10, 2024

Pranshu-S mentioned this issue Jul 16, 2024

Optimise TransportNodesAction to not send DiscoveryNodes for NodeStat… #14749

Merged

3 tasks

rajiv-kv mentioned this issue Jul 23, 2024

Enabling term version check on local state for all ClusterManager Read actions - backport 2.16 #14887

Merged

3 tasks

mch2 mentioned this issue Jul 30, 2024

[Backport 2.x] [Derived Fields] Add aggregation support for derived fields #15009

Merged

lukas-vlcek mentioned this issue Aug 9, 2024

Don't rely on test code execution time span for RemoteSegmentTransferTrackerTests #15187

Merged

1 task

linuxpi closed this as completed in #15187 Aug 14, 2024

opensearch-ci-bot reopened this Sep 6, 2024

github-actions bot added the untriaged label Sep 6, 2024

dblock removed the untriaged label Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AUTOCUT] Gradle Check Flaky Test Report for RemoteSegmentTransferTrackerTests #14325

[AUTOCUT] Gradle Check Flaky Test Report for RemoteSegmentTransferTrackerTests #14325

opensearch-ci-bot commented Jun 13, 2024 •

edited

Loading

lukas-vlcek commented Jul 10, 2024

dblock commented Sep 9, 2024

prachi-gaonkar commented Oct 9, 2024

prachi-gaonkar commented Oct 15, 2024

[AUTOCUT] Gradle Check Flaky Test Report for RemoteSegmentTransferTrackerTests #14325

[AUTOCUT] Gradle Check Flaky Test Report for RemoteSegmentTransferTrackerTests #14325

Comments

opensearch-ci-bot commented Jun 13, 2024 • edited Loading

Flaky Test Report for RemoteSegmentTransferTrackerTests

Details

lukas-vlcek commented Jul 10, 2024

dblock commented Sep 9, 2024

prachi-gaonkar commented Oct 9, 2024

prachi-gaonkar commented Oct 15, 2024

opensearch-ci-bot commented Jun 13, 2024 •

edited

Loading

Flaky Test Report for `RemoteSegmentTransferTrackerTests`