Improve ntfs_device_size_get for file #45

kgermanov · 2022-07-01T11:21:12Z

For avoid random read we can use IO callback for get size from struct stat.
Issue: #46

For avoid random read we can use IO callback for get stat.

kgermanov · 2022-07-01T11:46:13Z

@szakacsits What do you think about this?

jpandre · 2022-07-01T14:45:38Z

AFAICT this does not work for a block device. You should check whether you are mounting a regular file.

kgermanov · 2022-07-04T09:15:50Z

@jpandre Usually block device should be handled by some ioctl codes. But anyway we should handle the situation, when it cannot, thx.

kgermanov · 2022-07-12T06:36:29Z

@jpandre Is it ok now?

unsound · 2022-07-12T06:42:20Z

@kgermanov While trusting 'stat' is usually fine, the binary search method is more reliable with regard to the actual underlying file characteristics and it doesn't require that many calls. In addition it ensures that the entire length of the file can be accessed (e.g. it's not corrupted in the file system).
Can you explain in what way the binary search method is problematic for you?

kgermanov · 2022-07-12T08:15:28Z

@unsound For 2TB disk it requred about log(2*1024*1024*1024*1024) = 41 calls. But most problem in that this calls are randomly over whole disk (Did not work buffered IO).
If underlying fs does not fast (for example on slow NFS) there is may be problem.
Corrupted file system can break read calls for any chunk, binary search does not provide guarantees.

unsound · 2022-07-12T09:15:07Z

Maybe it's just me but 41 I/O requests doesn't seem like much, even random ones, compared to what the driver would issue during normal filesystem operation. How much does your fix speed up mounting in this particular scenario?

The particular corruption I'm thinking about is when internal extents end before logical ones, and also some filter filesystems can choose to expose a device as a 0-byte (indeterminate) file, which if we apply this patch couldn't be opened.

kgermanov · 2022-07-12T11:10:53Z

@unsound My case was extremly corner: each read's call take 1s. In this situation impove from 1 min to 5 sec for 2TB file for mount.
We can do binary search if stat return size less than 512.

jpandre · 2022-07-12T11:44:37Z

AFAICT ntfs_device_size_get() is not used while mounting (mounting relies on data stored in the boot sector). It is only used by a few ntfsprogs : mkntfs, ntfsclone, ntfsfix, ntfslabel and ntfsresize.

unsound · 2022-07-12T11:53:09Z

@jpandre Ah, I could have sworn I've seen it used in the library code as a sanity check but I may have confused it with another project. Then the impact is much smaller than I thought initially.

Improve ntfs_device_size_get for file

1f896b1

For avoid random read we can use IO callback for get stat.

Check if it is regular file

48443a6

Check if lstat size more that zero

a302a19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve ntfs_device_size_get for file #45

Improve ntfs_device_size_get for file #45

kgermanov commented Jul 1, 2022 •

edited

Loading

kgermanov commented Jul 1, 2022

jpandre commented Jul 1, 2022

kgermanov commented Jul 4, 2022 •

edited

Loading

kgermanov commented Jul 12, 2022

unsound commented Jul 12, 2022

kgermanov commented Jul 12, 2022 •

edited

Loading

unsound commented Jul 12, 2022 •

edited

Loading

kgermanov commented Jul 12, 2022

jpandre commented Jul 12, 2022

unsound commented Jul 12, 2022

Improve ntfs_device_size_get for file #45

Are you sure you want to change the base?

Improve ntfs_device_size_get for file #45

Conversation

kgermanov commented Jul 1, 2022 • edited Loading

kgermanov commented Jul 1, 2022

jpandre commented Jul 1, 2022

kgermanov commented Jul 4, 2022 • edited Loading

kgermanov commented Jul 12, 2022

unsound commented Jul 12, 2022

kgermanov commented Jul 12, 2022 • edited Loading

unsound commented Jul 12, 2022 • edited Loading

kgermanov commented Jul 12, 2022

jpandre commented Jul 12, 2022

unsound commented Jul 12, 2022

kgermanov commented Jul 1, 2022 •

edited

Loading

kgermanov commented Jul 4, 2022 •

edited

Loading

kgermanov commented Jul 12, 2022 •

edited

Loading

unsound commented Jul 12, 2022 •

edited

Loading