Wait for pending ml tasks in docs tests #44123

davidkyle · 2019-07-09T15:16:53Z

#43271 describes the problem where PUTing a ml job or data frame causes a notification document (saying something like Job X created) to be written to the ml-notifications index. This is done async and can occur after the test has finished and the teardown deleting indices has completed causing the index to be recreated and leaking into the next test.

This is a known issue XPackRestIT handles this by waiting for pending tasks to complete. This change adds the same step to DocsClientYamlTestSuiteIT

Unmutes the muted ml and data frame tests and closes #43271

XPackRestIT also has logic to stop datafeeds and close jobs post test that isn't necessary here as none of the tests start a job or data frame but may be required in the future

This reverts commit 071b652.

This reverts commit 2f9e8a8.

elasticmachine · 2019-07-09T15:16:55Z

Pinging @elastic/es-docs

nik9000 · 2019-07-10T15:12:56Z

docs/src/test/java/org/elasticsearch/smoketest/DocsClientYamlTestSuiteIT.java

+    @After
+    public void cleanup() throws Exception {
+        if (isMachineLearningTest() || isDataFrameTest()) {
+            ESRestTestCase.waitForPendingTasks(adminClient());


I wonder how bad it'd be to do this after every test. I don't feel great about relying on stuff in the test name. It just feels a bit too magical.

It is a little bit complicated because Rollups do the wait in the base ESRestTestCase

Additionally some tests leave tasks running. get-follow-info.asciidoc line 38 is a good example as it creates various CCR tasks which will be waited on indefinitely unless the test teardown is run. Interestingly what appears to be happening is the @After method of this class is called before the test teardown

Interestingly what appears to be happening is the @After method of this class is called before the test teardown

Weird!

I'm not a big fan of leaving things running in those tests either. Is there a way you could do something like the rollups here? It looks like it only cares about rollup style jobs. Does ml have something similar?

Yeah rollups filter the waiting tasks with taskName.startsWith("xpack/rollup/job") and we can do something similar with ml jobs but the action causing the leakage in #43271 is indexing a document not an ml task. Waiting for all tasks catches unexpected issues and actually helps debugging tests that have failed due to leakage from a previous test, experience from using this in XPackRestIT has shown that it is very valuable.

If I remove the if (isMachineLearningTest() || isDataFrameTest()) { check then the tests that fail with pending tasks are ccr and rollup. I'll look into what's happening there and maybe there is a way of removing the _if ml ... _ conditional

I took a look at the Rollup and CCR tests, unfortunately it is not possible to wait for pending tasks after every test because those tests require special handling. I cannot see a way to simplify the logic and I think the current code is best as it is explicitly for the ml & data frame tests.

Also as more xpack feature snippet testing is added I would expect more usages of the pattern e.g. if (isSecurityTest()) { // security specific cleanup

Using the test name to determine if the test is an ml test is a valid use. XPackRestIT set the precedent some time ago and it has not caused problems there.

I'm really not a fan of looking at the test name. I know XPackRestIT does it and I think it is sneaky black magic that will cause tests to fail in very difficult ways to trace. One badly named test invoking ml will cause subsequent tests to fail. Sometimes. Randomly.

I'm ok with merging this, but I'd really like a follow up issue to remove it somehow. Because I'm 100% sure somebody is going to lose many hours to debugging errors caused by a funny named test one day.

Can you detect a data frame test or ML test by looking at the public API somehow? Like by looking for jobs or something.....

nik9000 · 2019-07-12T15:20:12Z

docs/src/test/java/org/elasticsearch/smoketest/DocsClientYamlTestSuiteIT.java

+    @After
+    public void cleanup() throws Exception {
+        if (isMachineLearningTest() || isDataFrameTest()) {
+            ESRestTestCase.waitForPendingTasks(adminClient());


I'm really not a fan of looking at the test name. I know XPackRestIT does it and I think it is sneaky black magic that will cause tests to fail in very difficult ways to trace. One badly named test invoking ml will cause subsequent tests to fail. Sometimes. Randomly.

I'm ok with merging this, but I'd really like a follow up issue to remove it somehow. Because I'm 100% sure somebody is going to lose many hours to debugging errors caused by a funny named test one day.

nik9000 · 2019-07-12T15:21:14Z

docs/src/test/java/org/elasticsearch/smoketest/DocsClientYamlTestSuiteIT.java

+    @After
+    public void cleanup() throws Exception {
+        if (isMachineLearningTest() || isDataFrameTest()) {
+            ESRestTestCase.waitForPendingTasks(adminClient());


Can you detect a data frame test or ML test by looking at the public API somehow? Like by looking for jobs or something.....

ML and Data Frame tests should wait for pending tasks

davidkyle added 4 commits July 9, 2019 14:40

Wait for pending tasks

c01e416

Revert "Mute put job docs test"

442b451

This reverts commit 071b652.

Revert "Mute put-transform docs test"

dd72e83

This reverts commit 2f9e8a8.

Remove debug logging and fix test type checks

4f7d8bc

davidkyle added :Docs v8.0.0 v7.3.0 v7.4.0 labels Jul 9, 2019

davidkyle requested a review from nik9000 July 10, 2019 14:35

nik9000 reviewed Jul 10, 2019

View reviewed changes

nik9000 approved these changes Jul 12, 2019

View reviewed changes

davidkyle merged commit 4402cf3 into elastic:master Jul 15, 2019

davidkyle deleted the docs-tests-wait-for-pending branch July 15, 2019 10:58

davidkyle added a commit that referenced this pull request Jul 15, 2019

Wait for pending tasks in docs tests cleanup (#44123)

2382701

ML and Data Frame tests should wait for pending tasks

davidkyle added a commit that referenced this pull request Jul 15, 2019

Wait for pending tasks in docs tests cleanup (#44123)

a31a67e

ML and Data Frame tests should wait for pending tasks

jpountz added the >test Issues or PRs that are addressing/adding tests label Jul 15, 2019

lcawl mentioned this pull request Jan 28, 2020

Simplifying cleanup after code examples #51576

Open

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wait for pending ml tasks in docs tests #44123

Wait for pending ml tasks in docs tests #44123

davidkyle commented Jul 9, 2019 •

edited

Loading

elasticmachine commented Jul 9, 2019

nik9000 Jul 10, 2019

davidkyle Jul 10, 2019

nik9000 Jul 10, 2019

davidkyle Jul 10, 2019

davidkyle Jul 11, 2019

nik9000 Jul 12, 2019

nik9000 Jul 12, 2019

nik9000 Jul 12, 2019

nik9000 Jul 12, 2019

Wait for pending ml tasks in docs tests #44123

Wait for pending ml tasks in docs tests #44123

Conversation

davidkyle commented Jul 9, 2019 • edited Loading

elasticmachine commented Jul 9, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davidkyle commented Jul 9, 2019 •

edited

Loading