feat: support reading of custom-length RNTuple floats and suppressed columns #1347

ariostas · 2024-12-06T15:40:19Z

This PR implements the reading of Real32Trunc and Real32Quant fields, which have a variable number of bits in the ranges 10-31 and 1-32, respectively.

It also adds support for suppressed columns, at least in simple cases.

ariostas · 2024-12-12T19:06:30Z

It ended up only taking a few lines to support suppressed columns. I'll improve the reading of floats and add some tests.

ariostas · 2024-12-13T19:47:11Z

This PR is ready for review, but it needs scikit-hep/scikit-hep-testdata#167 for the tests to pass.

Also, since there is still an issue with Dask, we'll just have to wait until that is resolved.

ariostas · 2024-12-19T18:31:22Z

@jpivarski I fixed the issue with Numpy 1 and I improved the tests. I still had to use np.isclose for some of the comparisons due to small differences, probably from ROOT rounding things a little differently.

jpivarski

It's fine for comparisons with ROOT to involve isclose with tight tolerances (around the scale of numerical precision). What I meant was that if you know you'll be truncating at the Nth digit, it's better to test against an expected value that's truncated at the Nth digit, with isclose picking up any small errors, rather than using isclose to check for differences with respect to the original value. My reasoning was just that the Nth digit might be a huge error and leaving a window wide open like that could fail to identify some errors.

As a whole, this PR looks great and I'd say it's ready to be merged. This is the PR that brings RNTuple-reading up to 100% coverage, right?

ariostas · 2024-12-19T19:49:13Z

This is the PR that brings RNTuple-reading up to 100% coverage, right?

Yes, as far as I can tell, after this PR we'll have 100% coverage of the current spec.

ariostas added 2 commits December 6, 2024 10:33

Started implementing reading of quantized and truncated floats

0820794

Added support for suppressed columns

0df3073

ariostas added 3 commits December 13, 2024 14:19

Added tests

0e69c15

Cleaner reading of floats with 1, 2, or 3 bytes

fb3a304

Only support little-endian systems

ebfe52c

ariostas changed the title ~~feat: support reading of custom-length RNTuple floats~~ feat: support reading of custom-length RNTuple floats and suppressed columns Dec 13, 2024

ariostas marked this pull request as ready for review December 13, 2024 19:46

ariostas added 4 commits December 19, 2024 11:18

Merge branch 'main' into ariostas/rntuple_floats

6d39cc5

Fixed bug with Numpy 1

1f8d1c8

Improved tests

6d8ddbd

Fixed tests for Numpy 2

a2b720c

ariostas requested a review from jpivarski December 19, 2024 18:31

jpivarski approved these changes Dec 19, 2024

View reviewed changes

ariostas merged commit 4ee1a2d into main Dec 19, 2024
26 checks passed

ariostas deleted the ariostas/rntuple_floats branch December 19, 2024 19:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support reading of custom-length RNTuple floats and suppressed columns #1347

feat: support reading of custom-length RNTuple floats and suppressed columns #1347

ariostas commented Dec 6, 2024 •

edited

Loading

ariostas commented Dec 12, 2024

ariostas commented Dec 13, 2024

ariostas commented Dec 19, 2024 •

edited

Loading

jpivarski left a comment

ariostas commented Dec 19, 2024

feat: support reading of custom-length RNTuple floats and suppressed columns #1347

feat: support reading of custom-length RNTuple floats and suppressed columns #1347

Conversation

ariostas commented Dec 6, 2024 • edited Loading

ariostas commented Dec 12, 2024

ariostas commented Dec 13, 2024

ariostas commented Dec 19, 2024 • edited Loading

jpivarski left a comment

Choose a reason for hiding this comment

ariostas commented Dec 19, 2024

ariostas commented Dec 6, 2024 •

edited

Loading

ariostas commented Dec 19, 2024 •

edited

Loading