-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ZTS: Test case failures #9769
ZTS: Test case failures #9769
Conversation
since the testcase failures where introduced by 9fb2771 and are currently still failing with this PR. dont you think that changing testcases are just hiding a problem? according to the logs the cause of the errors are a segfault in libzpool |
Codecov Report
@@ Coverage Diff @@
## master #9769 +/- ##
========================================
+ Coverage 80% 80% +<1%
========================================
Files 385 385
Lines 121470 121470
========================================
+ Hits 96756 96874 +118
+ Misses 24714 24596 -118
Continue to review full report at Codecov.
|
614e5a6
to
5d0788a
Compare
These ZTS failures don't coincide with a regression in the upstream code, but instead with an upgrade to the CI test environment which was made yesterday. Specifically the ec2 instance types were changed to types using newer generation cpus and NVMe based devices. This resulted in some existing tests consistently failing and some occasional failures becoming very likely. I've reverted the NVMe portion of that upgrade for now, and if that's not sufficient I'll revert back to the previous generation cpus until we update the failing tests.
Can you point me at the relevant logs. Looking at the recent testing results on the master branch I have't seen any evidence of this. |
in the zstd pull request you will find it but you can also find this segfault on the test round of the most recent commit on the zfs master |
* large_dnode_008_pos - Force a pool sync before invoking zdb to ensure the updated dnode blocks have been persistented to disk. * refreserv_raidz - Wait for the /dev/zvol links to be both created and removed, this is important because the same device volume names are being used repeatedly. * btree_test - Add missing .gitignore file for btree_test binary. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
5d0788a
to
6346659
Compare
@BrainSlayer I dont get it... |
@Ornias1993 grep for segfault 22:54:37.67 [ 2139.712371] loop: module loaded |
@BrainSlayer
These segfaults are not what @behlendorf is fixing here, this just fixes the new test failures that are related to the buildbot failures. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code is clean and results are green.
LGTM
This was actually intentionally added recently by commit 523fc80, but I can definitely see how the segfault is misleading. The |
i understand. yes this was indeed missleading. i was looking for the cause of the segfault but was not able to find anything |
* large_dnode_008_pos - Force a pool sync before invoking zdb to ensure the updated dnode blocks have been persisted to disk. * refreserv_raidz - Wait for the /dev/zvol links to be both created and removed, this is important because the same device volume names are being used repeatedly. * btree_test - Add missing .gitignore file for btree_test binary. Reviewed-by: Kjeld Schouten <kjeld@schouten-lebbing.nl> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes openzfs#9769
* large_dnode_008_pos - Force a pool sync before invoking zdb to ensure the updated dnode blocks have been persisted to disk. * refreserv_raidz - Wait for the /dev/zvol links to be both created and removed, this is important because the same device volume names are being used repeatedly. * btree_test - Add missing .gitignore file for btree_test binary. Reviewed-by: Kjeld Schouten <kjeld@schouten-lebbing.nl> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes openzfs#9769
* large_dnode_008_pos - Force a pool sync before invoking zdb to ensure the updated dnode blocks have been persisted to disk. * refreserv_raidz - Wait for the /dev/zvol links to be both created and removed, this is important because the same device volume names are being used repeatedly. * btree_test - Add missing .gitignore file for btree_test binary. Reviewed-by: Kjeld Schouten <kjeld@schouten-lebbing.nl> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes openzfs#9769
* large_dnode_008_pos - Force a pool sync before invoking zdb to ensure the updated dnode blocks have been persisted to disk. * refreserv_raidz - Wait for the /dev/zvol links to be both created and removed, this is important because the same device volume names are being used repeatedly. * btree_test - Add missing .gitignore file for btree_test binary. Reviewed-by: Kjeld Schouten <kjeld@schouten-lebbing.nl> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes #9769
Motivation and Context
Resolve several ZTS failures now being regularly observed by the CI
after changing the test instance type.
Description
devices/devices_001_pos, devices/devices_002_neg - When only NVMeblock devices exist no valid major/minor are found. Update the
create_dev_file_linux function to handle this case.
large_dnode_008_pos - Force a pool sync before invoking zdb to
ensure the updated dnode blocks have been persistented to disk.
refreserv_raidz - Wait for the /dev/zvol links to be both created
and removed, this is important because the same device volume
names are being used repeatedly.
btree_test - Add missing .gitignore file for btree_test binary.
How Has This Been Tested?
Locally verified using the ZTS. Pending full CI results to confirm
these changes do resolve the issues in the CI environment.
Types of changes
Checklist:
Signed-off-by
.