-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Illumos 5056 - ZFS deadlock on db_mtx and dn_holds #3240
Conversation
@chrisrd Great work ! 👍 this needs the file policy.h in
not exactly sure what contents it needs to contain from https://github.com/illumos/illumos-gate/blob/master/usr/src/uts/common/sys/policy.h - but I went with the minimum please adapt in this pull-request: https://github.com/zfsonlinux/zfs/pull/3240.diff the file,
<-- this builds for me (without DEBUG) let's see if this survives the buildbots ... |
@chrisrd FYI: I'll push more changes, if needed, when anything might come up and I can deal with it currently pretty busy - but still: let's get this tested and working ASAP so that @dweeezil and @behlendorf can check whether this does any difference on #3225 |
@kernelOfTruth The @6dc1dd0 patch does contain the policy.h file in that path. I'm struggling to understand exactly what the build is doing at the point of failure, it looks like it's extracted stuff from a tar ball and is building in a separate directory, but the policy.h file is missing from the tar ball, e.g. from one of the build failure log files:
...Oh, I just got it! I need to add the file to the Makefile.in in the directory where I added the file, that's used to create the tar ball. I'll push an update soon... |
I know, still the naming scheme seems "off" - some files I checked, e.g. vnode.h , types.h start with _ LIBSPL _
anyway - great that you found the cause - let's see if the buildbot can proceed it didn't in #3241 ;) (or that Ubuntu 12.04 buildbot is supposed to do that) |
OK, let's see how that goes. I've adjusted the |
a62c0cb
to
2d17463
Compare
okay great, so now it builds 👍
so the modules seemingly aren't being imported ? or do working shell_7 test modules and shell_9 test ztest indicate that with shell_11 test import failed shell_10 test zconfig failed show that there are problems with the pool or its configuration other than with the modules ? ... alright, so the modues are being imported, I'm wondering why the buildbot doesn't show any error (red) on dmesg output
apparently it's failing a test_1 https://github.com/zfsonlinux/zfs/blob/master/scripts/zconfig.sh will take a look tomorrow - been a long day & most of this is new to me - never had to deal with these kind of issues 😉 some of you guys probably know by heart what could be the problem (removed some of the clutter to improve overview) edit:
include/sys/list.h
|
@chrisrd the |
@chrisrd I am convinced we should drop the special handling involving |
@nedbass Thanks - I'll look harder to see if I can follow the reasoning and drop the special handling. |
Added 2 new patches fixing issues introduced by Illumos 5056 and causing buildbot failures. |
wow - besides that everything's green, great job 👍 I'm curious if these changes help with the lockups on ZoL |
OK, new push for the style fail |
Note: @49ce3ad - |
@chrisrd you can now drop |
@behlendorf Thanks - One of the two buildbot failure (prior to push) looks like a test harness failure rather than related to the patch stack:
The other is #3075:
|
e273b87
to
3be7851
Compare
Redone my test experienced here : #3190 (comment) with the pull request #3240 (just before "Patch out crgetzomeid()" drop).
|
rejects due to openzfs#2129 in: module/zfs/dbuf.c module/zfs/zap.c module/zfs/dsl_dir.c module/zfs/dsl_dataset.c module/zfs/dnode.c
Crap! I didn't mean to close this. Again (#3239). I wanted the buildbots to work on the patch stack on top of the newly merged code, so I rebased my local branch and force pushed it. After 15m the new commits hadn't shown up in this pull request, so I thought I'd delete all the changes on my side, force push again, then reapply the patch stack and push once more. But the force push closed the pull request and the 'Reopen and comment' button is greyed out. How is this supposed to work?!?! Sigh. Looks like I have to do another pull request: #3254 |
Pull in Illumos 5056 with a largely mechanical application of Illumos patches.
Let's get the buildbots chewing on this patch stack...
Whilst the individual patches are likely worth applying in their own right at some point, it will be interesting to see if the stack helps with #3225.
@a22f7a0: the
freespill
stuff indbuf_free_range()
was more than mechanical and I'm not sure I have it right. This was related to @4254acb from @dweeezil and @58806b4 from @nedbass. Would you guys be able to take a look at that in particular?