-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] AssertionError in OsProbe.readProcSelfCgroup #77833
Comments
Pinging @elastic/es-core-infra (Team:Core/Infra) |
More occurrences of this: https://gradle-enterprise.elastic.co/s/rb3ky43i6j34c |
Another occurrence here https://gradle-enterprise.elastic.co/s/jtr5tqzsg2eq2 |
I took a look at this issue and I believe it has to do with Oracle Linux 6. After a fresh VM install I checked the contents of /proc/self/cgroup and it was empty, it was there on the file system, but nothing in it. After a full shutdown and reboot the file was filled in properly with the basic set of entries, like :memory, :cpu, etc. I don't know if this an intermittent issue with the kernel shipped in Oracle Linux 6.10 (4.1.12) or a temporary problem on first boot after install, but the /proc/self/cgroup file can be empty it seems on this version. I haven't been able to reproduce this issue with newer kernel versions (I tried Ubuntu 20.04 and latest RHEL 8.4). The 3 reported failures that we've had so far were while running Oracle Linux 6. We've had the assert that fails for a while, but it seems we introduced a test for it the first time in #77128. I think the fix would be to remove the assert that's causing this failure and treat the empty cgroups file just as the non-existent file case. The second conditional for the assert |
We've removed support for OEL-6 in 8.0 (see #51480), so I don't think we should relax the assertion there, but in 7.x I guess it is necessary. |
OK makes sense, I didn't realize all of these failed on 7.x only. I can make the fix for 7.x only. |
Older versions of the Linux kernel, e.g. 4.1.12 which is found in OEL-6, can sometimes have empty cgroup file causing a test assertion. This change removes the assert and handles the empty file like a non-existent file. Closes #77833
This was fixed in #78659. I'm not sure why auto-close didn't work, I'll close this for now. |
Multiple tests failed with
process was found dead while waiting for ports files
. The underlying issue is shown in the test cluster log file: The node crashed when starting up because it didn't read any valid content from the/proc/self/cgroup
file. I wonder whether there is a subtle racing condition somwhere setting up the test.Build scan:
https://gradle-enterprise.elastic.co/s/lfrzuigm2obi4
https://gradle-enterprise.elastic.co/s/wruq4uitjwndq
Repro line:
N/A
Reproduces locally?:
Didn't try
Applicable branches:
7.x
Failure history:
N/A
Failure excerpt:
The text was updated successfully, but these errors were encountered: