-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
All I/O hangs in ZFS #8469
Comments
I too have been seeing this for a while, also with |
Did you have any luck getting this resolved? It makes me feel rather uneasy about our server. |
Bummer, because it does seem to be related to amount of data written, I had this happen after a week now :( |
Also having this issue. System runs fine for maybe an hour then hangs. Nothing crazy going on in the system, just an rsync transfer thats limited to 12 MB/s, and a docker container with zoneminder running 1 stream @ 1mbps. Tgx_sync freezes, as well as every other process running on zfs and only solution is to reboot. CPU utilization remains low as well as memory utilization. Both compression and dedupe are enabled, ashift=12. 128KB record size. OS - Debian Stretch NAME SIZE ALLOC FREE EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT pool: zfspool
errors: No known data errors ZFS Subsystem Report Wed Nov 13 11:30:46 2019 ARC Misc: ARC Size: 98.85% 15.43 GiB ARC Size Breakdown: ARC Hash Breakdown: ARC Total accesses: 497.41M
DMU Prefetch Efficiency: 105.46M cat /proc/spl/kstat/zfs/dmu_tx Nov 13 11:23:32 nas kernel: [59329.141166] INFO: task txg_sync:1228 blocked for more than 120 seconds. |
Happens consistently in my environment. Happened after transferring about 1TB of data via Syncthing. Twice, one after another. Then it happened during
|
@Anteru Any luck resolving or mitigating this? We see the same problem from time to time in production, Ubuntu zfs v0.7.12-1ubuntu, kernel 5.0.0. |
For mitigation I try to reboot the server every two months, but since 20.04 I haven't seen it, so maybe that fixed it? I think that upgrades to ZFS 0.8. |
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions. |
I saw what I think is the same issue last night. Running Debian 11, zfs version 2.0.3-9. I seemed to be able to browse the filesystem still, but the NFS server was not working, and KVM VMs that were running were not accepting connections. Needed to reboot the server to get back to normal. In syslog, multiple hung tasks reported, with the same stack trace. Example:
|
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions. |
System information
Distribution Name | Ubuntu
Distribution Version | 18.04.2 LTS
Linux Kernel | 4.15.0
Architecture | x64
ZFS Version | 0.7.5-1ubuntu16.4
SPL Version | 0.7.5-1ubuntu1
Describe the problem you're observing
I/O freezes after running the server for quite some time. The server is running a bunch of virtual machines, and becomes unresponsive once this condition appears.
Describe how to reproduce the problem
No good repro case, it just happens after 90..120 days.
FWIW, the machine has two pools configured like this
Include any warning/errors/backtraces from the system logs
dmesg reports:
The text was updated successfully, but these errors were encountered: