Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] HuggingFaceServiceUpgradeIT testHFEmbeddings {upgradedNodes=1} failing #118197

Open
elasticsearchmachine opened this issue Dec 6, 2024 · 5 comments
Labels
medium-risk An open issue or test failure that is a medium risk to future releases :ml Machine learning Team:ML Meta label for the ML team >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

elasticsearchmachine commented Dec 6, 2024

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:inference:qa:rolling-upgrade:v8.14.3#bwcTest" -Dtests.class="org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT" -Dtests.method="testHFEmbeddings {upgradedNodes=1}" -Dtests.seed=71BC5F32A9895774 -Dtests.bwc=true -Dtests.locale=zh-Hant -Dtests.timezone=Etc/GMT-10 -Druntime.java=23

Applicable branches:
8.x

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.IndexOutOfBoundsException: Index: 0, Size: 0

Issue Reasons:

  • [8.x] 2 consecutive failures in step 8.14.3_bwc
  • [8.x] 2 consecutive failures in step 8.12.2_bwc
  • [8.x] 18 failures in test testHFEmbeddings {upgradedNodes=1} (3.0% fail rate in 594 executions)
  • [8.x] 7 failures in step 8.14.3_bwc (53.8% fail rate in 13 executions)
  • [8.x] 6 failures in step 8.12.2_bwc (50.0% fail rate in 12 executions)
  • [8.x] 5 failures in step 8.13.4_bwc (45.5% fail rate in 11 executions)
  • [8.x] 10 failures in pipeline elasticsearch-periodic (83.3% fail rate in 12 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :ml Machine learning >test-failure Triaged test failures from CI Team:ML Meta label for the ML team needs:risk Requires assignment of a risk label (low, medium, blocker) labels Dec 6, 2024
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/ml-core (Team:ML)

@maxhniebergall maxhniebergall added medium-risk An open issue or test failure that is a medium risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Dec 9, 2024
@maxhniebergall
Copy link
Member

maxhniebergall commented Dec 9, 2024

related: #118127, #118163, #118162, #118156, #118204

The errors in these issues all show up in the same build and probably share a root cause.

@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 16 failures in test testHFEmbeddings {upgradedNodes=1} (2.6% fail rate in 626 executions)
  • [8.x] 5 failures in step 8.13.4_bwc (41.7% fail rate in 12 executions)
  • [8.x] 6 failures in step 8.14.3_bwc (50.0% fail rate in 12 executions)
  • [8.x] 5 failures in step 8.12.2_bwc (45.5% fail rate in 11 executions)
  • [8.x] 10 failures in pipeline elasticsearch-periodic (83.3% fail rate in 12 executions)

Build Scans:

elasticsearchmachine added a commit that referenced this issue Dec 10, 2024
@maxhniebergall
Copy link
Member

Fixed by #118624

elasticsearchmachine pushed a commit that referenced this issue Dec 13, 2024
#118664

```
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testRerank {upgradedNodes=1}
  issue: #116973
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=1}
  issue: #116974
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=2}
  issue: #116975

- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=1}
  issue: #118156
- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testElser {upgradedNodes=1}
  issue: #118127
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=1}
  issue: #118162
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=2}
  issue: #118163
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=2}
  issue: #118204

- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testHFEmbeddings {upgradedNodes=1}
  issue: #118197
```
elasticsearchmachine added a commit that referenced this issue Dec 13, 2024
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 2 consecutive failures in step 8.12.2_bwc
  • [8.x] 18 failures in test testHFEmbeddings {upgradedNodes=1} (3.0% fail rate in 610 executions)
  • [8.x] 7 failures in step 8.12.2_bwc (53.8% fail rate in 13 executions)
  • [8.x] 6 failures in step 8.14.3_bwc (50.0% fail rate in 12 executions)
  • [8.x] 5 failures in step 8.13.4_bwc (45.5% fail rate in 11 executions)
  • [8.x] 10 failures in pipeline elasticsearch-periodic (83.3% fail rate in 12 executions)

Build Scans:

maxhniebergall added a commit to maxhniebergall/elasticsearch that referenced this issue Dec 16, 2024
elastic#118664

```
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testRerank {upgradedNodes=1}
  issue: elastic#116973
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=1}
  issue: elastic#116974
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=2}
  issue: elastic#116975

- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=1}
  issue: elastic#118156
- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testElser {upgradedNodes=1}
  issue: elastic#118127
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=1}
  issue: elastic#118162
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=2}
  issue: elastic#118163
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=2}
  issue: elastic#118204

- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testHFEmbeddings {upgradedNodes=1}
  issue: elastic#118197
```
maxhniebergall pushed a commit to maxhniebergall/elasticsearch that referenced this issue Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
medium-risk An open issue or test failure that is a medium risk to future releases :ml Machine learning Team:ML Meta label for the ML team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

2 participants