-
Notifications
You must be signed in to change notification settings - Fork 277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Out of date archive test distribution may be causing plugin IT test failures #953
Comments
Here is the script that is producing those artifacts https://github.com/opensearch-project/opensearch-infra/blob/main/jenkins/jobs/OpenSearch_CI/Snapshots_CI/opensearch_snapshots_creation_1_0_0/opensearch-assemble.sh |
After looking into the system and the scripts, I don't see an indication of what is or isn't producing these specific artifacts. This might be related to how the distribution downloader is operating at runtime as opposed to the snapshot build is producing. For the time being lets keep this issue on the books in the event that the build job is not correct and needs to be updated |
The change to add -min to artifact name has been recently merged to gradle script in OpenSearch repo as part of this PR but it was not removed from the build script (which was already adding a -min to this artifact name) causing 2 min (-min-min) to be added to artifact name. The distribution downloader has been downloading the stale artifact (with one -min) for the integration testing. We are working on fixing this issue now. |
Resolved in https://github.com/opensearch-project/opensearch-infra/pull/131 (Private Repo) |
Describe the bug
Parallel issue on core which has been identified as a release blocker for 1.2: opensearch-project/OpenSearch#1473
Integration tests for the Index Management plugin which extend from OpenSearchIntegTestCase are failing during cleanup following the addition of shard indexing pressure stats in this PR, which are causing a serialization mismatch. The new fields are included in serialization when the version is on or after 1.2.0, and debug logging has shown that both sides of the stream are on version 1.2.0, but the fields are written and not read.
This failure can be replicated with an empty test extending from OpenSearchIntegTestCase, on at least Alerting, Index Management, and Anomaly Detection if the testDistribution is set to ARCHIVE and the integTest gradle task is run. These test failures have been isolated to Index Management as only this test setup is only used in Index Management.
It appears that when the test distribution is set to archive, this artifact is used: https://artifacts.opensearch.org/snapshots/core/opensearch/1.2.0-SNAPSHOT/opensearch-min-1.2.0-SNAPSHOT-linux-x64-latest.tar.gz
After unpacking the snapshot, you can see that the shard indexing pressure changes are not included in the code, and the state of the snapshot code would precisely cause these errors. Additionally, the equivalent snapshot is not yet available for 1.3.0, so there may be an issue with building these artifacts.
To reproduce
On a plugin, using this empty integration test with the test distribution set to ARCHIVE, the test always fails during cleanup:
Expected behavior
The test setup as described should not fail. This same test setup has been used in Index Management for a long time now, and changes to serialization in the NodeStats API should not cause it to fail.
Screenshots
No response
Host / Environment
I have been able to replicate this locally on my Amazon Linux 2 desktop, and it occurs every time on Index Management's CI run. Here is a failing run.
Additional context
No response
Relevant log output
The text was updated successfully, but these errors were encountered: