WIP: Zvol trim #16

lundman · 2021-04-14T04:38:54Z

Handle incoming SCSIOP_UNMAP and place it into the same WrkRtn as IO, to queue and handle cancel. Then fan out to call ScsiOpUnmap_impl() instead of IO calls.

Parse the struct with the unmap ranges, calling zvol_os_unmap() for each.

Find a way to test it, as ntfs didn't "just enable" it when I tested in VM :)

cstyle still needs to be applied

Unknown how other platforms get there

Not using vfork() and execve()

Amusingly, usleep() is more portable than poll() as a timing function.

These should probably rely on HAVE_UIO_H etc

Should probably rely on HAVE_UIO_H

We'll just leak for now. Also clear out the fileobjects

As we set S_IFBLK in stat_blk() so we can detect if something is a disk or file, and Windows stat.h S_IFMT isn't wide enough to contain it. We now use internal stat.h values only, and we will have to convert any calls to Windows _stat() into our values. However, we do not use _stat() with posix.c, but rather simulate them anyway.

But uses stat.st_mode to test for BLK or REG. Using the example of zfs_resolve_shortname("Harddisk1Partition2"); *************** if (arg[0] == '/') { } else { err = is_shorthand_path(arg, path, sizeof (path), &statbuf, &wholedisk); } } } /* * Determine whether this is a device or a file. */ if (wholedisk || S_ISBLK(statbuf.st_mode)) { ***************** Possibly the idea is that is_shorthand_path() is expected to call stat(), as it is passed the struct. The path will expand into "\\?\Harddisk1Partition2" in zfs_resolve_shortname() which can be opened, and stat()ed. However; error = zfs_resolve_shortname(arg, path, path_size); if (error == 0) { *wholedisk = zfs_dev_is_whole_disk(path); if (*wholedisk || (stat64(path, statbuf) == 0)) It correctly determins that wholedisk should be FALSE (it is just a partition) but because of that, the short-circuit "||" will not call stat64(), and the statbuf is used uninitialised in the S_ISBLK() test. At least on Windows, we always want to call stat64(), after the zfs_resolve_shortname() comes back (and it should be available at this point).

Take partition names, in the format "HarddiskXPartitionY". $ ./zpool.exe create DOOM Harddisk1Partition3 Expanded path to '\\?\Harddisk1Partition3' $ ./zpool.exe list NAME SIZE ALLOC FREE CKPOINT EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT DOOM 15G 99.5K 15.0G - - 0% 0% 1.00x ONLINE - $ ./zpool.exe status pool: DOOM state: ONLINE status: Mismatch between pool hostid and system hostid on imported pool. This pool was previously imported into a system with a different hostid, and then was verbatim imported into this system. action: Export this pool on all systems on which it is imported. Then import it to correct the mismatch. see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-EY config: NAME STATE READ WRITE CKSUM DOOM ONLINE 0 0 0 Harddisk1Partition3 ONLINE 0 0 0 Also, only Offline wholedisks - or we would offline the disk containing our partition.

Look it up again if called.

Unsure why it doesn't work as advertised. Have to hardcode path to correct library for now. FindOpenSSL picks MD versions and not MT. Path ends up all wrong with "optimizedC:" strings and so on.

This is a large commit, but it should have no functional changes, just cstyle corrections. * fix for cstyle. Had to guess some things. Move ident in made some places nicer. * Correct typos in cstyle Thanks for the mindless work Co-authored-by: Kajerik Lundman <kajerik@lundman.net>

How long as the test for vp==NULL VN_RELE been there, and how has it not died sooner.

Random has no need for fd in Windows. It was tempting to change pthread.h to return 0 instead of ENOTSUP, but it is more correct to signal we don't support it.

Since zdb will open "disks" with vdev_file in userland, it needs to understand the partition hack.

So we need to add some more API calls to handle it. There is still change needed in abd.c It might in fact be easier to not call aligned_malloc, and handle alignment internally.

Find a way to test it.

lundman · 2021-04-14T04:39:29Z

module/os/windows/zfs/zfs_windows_zvol_scsi.c

@@ -343,6 +343,11 @@ ScsiExecuteMain(
 			status = ScsiOpReportLuns(pHBAExt, pSrb);
 			break;

+		case SCSIOP_UNMAP:
+			DbgBreakPoint();


obvs remove before merge

lundman · 2021-07-02T09:41:49Z

PR messed up, merged

Under certain loads, the following panic is hit: panic: page fault KDB: stack backtrace: #0 0xffffffff805db025 at kdb_backtrace+0x65 #1 0xffffffff8058e86f at vpanic+0x17f #2 0xffffffff8058e6e3 at panic+0x43 #3 0xffffffff808adc15 at trap_fatal+0x385 #4 0xffffffff808adc6f at trap_pfault+0x4f #5 0xffffffff80886da8 at calltrap+0x8 #6 0xffffffff80669186 at vgonel+0x186 #7 0xffffffff80669841 at vgone+0x31 #8 0xffffffff8065806d at vfs_hash_insert+0x26d #9 0xffffffff81a39069 at sfs_vgetx+0x149 #10 0xffffffff81a39c54 at zfsctl_snapdir_lookup+0x1e4 #11 0xffffffff8065a28c at lookup+0x45c #12 0xffffffff806594b9 at namei+0x259 #13 0xffffffff80676a33 at kern_statat+0xf3 #14 0xffffffff8067712f at sys_fstatat+0x2f #15 0xffffffff808ae50c at amd64_syscall+0x10c #16 0xffffffff808876bb at fast_syscall_common+0xf8 The page fault occurs because vgonel() will call VOP_CLOSE() for active vnodes. For this reason, define vop_close for zfsctl_ops_snapshot. While here, define vop_open for consistency. After adding the necessary vop, the bug progresses to the following panic: panic: VERIFY3(vrecycle(vp) == 1) failed (0 == 1) cpuid = 17 KDB: stack backtrace: #0 0xffffffff805e29c5 at kdb_backtrace+0x65 #1 0xffffffff8059620f at vpanic+0x17f #2 0xffffffff81a27f4a at spl_panic+0x3a #3 0xffffffff81a3a4d0 at zfsctl_snapshot_inactive+0x40 #4 0xffffffff8066fdee at vinactivef+0xde #5 0xffffffff80670b8a at vgonel+0x1ea #6 0xffffffff806711e1 at vgone+0x31 #7 0xffffffff8065fa0d at vfs_hash_insert+0x26d #8 0xffffffff81a39069 at sfs_vgetx+0x149 #9 0xffffffff81a39c54 at zfsctl_snapdir_lookup+0x1e4 #10 0xffffffff80661c2c at lookup+0x45c #11 0xffffffff80660e59 at namei+0x259 #12 0xffffffff8067e3d3 at kern_statat+0xf3 #13 0xffffffff8067eacf at sys_fstatat+0x2f #14 0xffffffff808b5ecc at amd64_syscall+0x10c #15 0xffffffff8088f07b at fast_syscall_common+0xf8 This is caused by a race condition that can occur when allocating a new vnode and adding that vnode to the vfs hash. If the newly created vnode loses the race when being inserted into the vfs hash, it will not be recycled as its usecount is greater than zero, hitting the above assertion. Fix this by dropping the assertion. FreeBSD-issue: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=252700 Reviewed-by: Andriy Gapon <avg@FreeBSD.org> Reviewed-by: Mateusz Guzik <mjguzik@gmail.com> Reviewed-by: Alek Pinchuk <apinchuk@axcient.com> Reviewed-by: Ryan Moeller <ryan@iXsystems.com> Signed-off-by: Rob Wing <rob.wing@klarasystems.com> Co-authored-by: Rob Wing <rob.wing@klarasystems.com> Submitted-by: Klara, Inc. Sponsored-by: rsync.net Closes openzfs#14501

If a zvol has more than 15 partitions, the minor device number exhausts the slot count reserved for partitions next to the zvol itself. As a result, the minor number cannot be used to determine the partition number for the higher partition, and doing so results in wrong named symlinks being generated by udev. Since the partition number is encoded in the block device name anyway, let's just extract it from there instead. Reviewed-by: Tony Hutter <hutter2@llnl.gov> Reviewed by: Brian Behlendorf <behlendorf1@llnl.gov> Reviewed-by: Tino Reichardt <milky-zfs@mcmilk.de> Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com> Closes openzfs#15904 Closes openzfs#15970

lundman added 30 commits October 12, 2020 11:45

Windows: bring in SPL sourcefiles

3bbfff6

cstyle still needs to be applied

Windows: ZFS source file place holders

5bea50d

Windows: Add source files and CMake

33f79d9

Upstream: Add Windows support to OpenZFS

abd9f7e

Windows: Compile kernel driver, and load

caab128

Windows: bring in userland

342fb4a

Windows: port userland, and cmd/zpool

fe67d68

Upstream: add termios.h

fcaaf5a

Unknown how other platforms get there

Upstream: highbits64 already defined.

a9f0e3e

Upstream: Pass zfs_handle_t to unmount

9dab053

Upstream: this probably isn't the header for NORETURN

29411a4

Upstream: Add MSVC to comiler directives

1725010

Upstream: No OS xdr header

92c09d7

Upstream: dance around constructor

4b08049

Upstream: Additional header

2204109

Upstream: Add Windows VENDOR

c531c7a

Upstrema: Add Windows mount options

f6b05d4

Upstream: Windows will do own run_process

97f38b0

Not using vfork() and execve()

Upstream: Use usleep

fa5631c

Amusingly, usleep() is more portable than poll() as a timing function.

Upstream: Add TASKQ_EMPTY

37d3002

Upstream: Remove dlopen and global var for now

53b80e0

Windows: squash

cb9c129

Upstream: Add Windows properties

00cb9c5

Upstream: No OS libgen.h

2319fa2

These should probably rely on HAVE_UIO_H etc

Upstream: No OS uio.h

26e25ef

Should probably rely on HAVE_UIO_H

Windows: port cmd/zfs

05eacb8

Upstream: OS defines highbit64

6d1d6d7

Windows: bring in cmd/zdb cmd/zstream cmd/kstat

ae586f3

Windows: cmake files

fc24af1

Windows: source files are not optional

6662d15

lundman and others added 23 commits February 12, 2021 15:13

Windows: hack in export\umount

a96cc4a

We'll just leak for now. Also clear out the fileobjects

Windows: vcruntime.dll dependency removal

0bfe2d9

Windows: zpool create fixes

640e79b

Upstream: Add partition after zpool_label_disk

6667821

Windows: Allow harddiskXPartitionY shortname

110c6d9

Upstream: update_vdev_config_dev_strs() can change path

8eccd0b

Look it up again if called.

Windows: Handle partition zpool import

dcdf629

Windows: Fight CMake and OpenSSL

1fdda70

Unsure why it doesn't work as advertised. Have to hardcode path to correct library for now. FindOpenSSL picks MD versions and not MT. Path ends up all wrong with "optimizedC:" strings and so on.

Windows: Check for error code when calling zfs_dirlook()

2f7bbd3

Windows: re-arrange the release of stream dvp

944f0c7

Upstream: allow Windows zvols to have 1 longhold

a00d883

Windows: fix zvol export/destroy

35a9024

Upstream: strftime differences cause abort()

bd8d764

vnode_put(NULL) and missing mutex

ef8bfe7

How long as the test for vp==NULL VN_RELE been there, and how has it not died sooner.

Upstream: Avoid ASSERTs in zdb

9b8e054

Random has no need for fd in Windows. It was tempting to change pthread.h to return 0 instead of ENOTSUP, but it is more correct to signal we don't support it.

Upstream: ddi_stroul would not return end ptr

a41e650

Windows: userland vdev_file needs to handle partition hack

43ce553

Since zdb will open "disks" with vdev_file in userland, it needs to understand the partition hack.

Upstream: aligned_malloc must call aligned_free

7d7aab6

So we need to add some more API calls to handle it. There is still change needed in abd.c It might in fact be easier to not call aligned_malloc, and handle alignment internally.

WIP: Handle scsi unmap/discard requests to zvol_unmap

e04f197

Find a way to test it.

lundman commented Apr 14, 2021

View reviewed changes

lundman force-pushed the windows branch from f970eb7 to b4b6418 Compare July 1, 2021 11:28

lundman closed this Jul 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Zvol trim #16

WIP: Zvol trim #16

lundman commented Apr 14, 2021

lundman Apr 14, 2021

lundman commented Jul 2, 2021

WIP: Zvol trim #16

WIP: Zvol trim #16

Conversation

lundman commented Apr 14, 2021

lundman Apr 14, 2021

Choose a reason for hiding this comment

lundman commented Jul 2, 2021