-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Autoreplace not working #2449
Comments
See https://github.com/zfsonlinux/zfs/blob/master/cmd/zed/zed.d/io-spare.sh#L19-22 and the rest of the script for details (and also the checksum-spare.sh script). At the moment, the replace operation will only be performed in response to I/O or checksum errors and then, only if you've set the appropriate flags in the The zed and its scripts are new for 0.6.3 and are a work-in-progress at the moment. |
I understand what you are writing, so please consider this as a feature request instead, to get the situation where a disk becoming unavailable will be considered an error condition, triggering the use of a spare disk. |
oh, I can't wait to see this in 0.6.5! update: I have replaced my io-spare.sh with the version above referenced by @dweeezil and restarted the zed daemon, but it still does not initiate a resilver :( |
this appears to work on a testcase using latest released version - 0.6.4-1.1-2 on debian jessie - see comment #3478 (comment) however it fails with io-spare.sh exit code 4 under cciss storage driver. how can I make zed/io-spare detect errors when my disks are not named /dev/sdX but /dev/cciss/cXdY ? |
@mailinglists35 to make the The
|
@behlendorf thanks for the light; is this the right issue to follow to see when it is woking? #2012 |
@mailinglists35 that's related but unnecessary to fix this. The script already has the guid, we just need to extend the utilities and I don't think there an issue open for that. |
@behlendorf I'm confused, then where/what should I watch for to know when the utilities can do that? The problem that actually I need to solve is to have zfs automatically use a hot spare when i/o or checksum happen on an old hp P400 controller having each physical drive behind a corresponding logicaldrive under /dev/cciss/c0dX (the new "hpsa" driver replacing "cciss" which has normal scsi naming /dev/sdX does not support the p400 controller). |
I'm currently testing ZFS on Linux v0.6.3 on Debian Wheezy, using a LSI2008 based controller with the disks in JBOD mode. And I have configured the system as shown:
zpool status tank
pool: tank
state: ONLINE
scan: none requested
config:
errors: No known data errors
And then I pull out a disk to see what happens in an error condition.
zpool status tank
pool: tank
state: DEGRADED
status: One or more devices could not be used because the label is missing
or
invalid. Sufficient replicas exist for the pool to continue
functioning in a degraded state.
action: Replace the device using 'zpool replace'.
see: http://zfsonlinux.org/msg/ZFS-8000-4J
scan: none requested
config:
errors: No known data errors
zpool get autoreplace tank
NAME PROPERTY VALUE SOURCE
tank autoreplace on local
And now I wonder why the system has not started resilvering using the hot spare.......
The text was updated successfully, but these errors were encountered: