Fix an issue in `Chunk#toBytes` #943

adamretter · 2017-10-05T15:38:26Z

Fixes an issue where too many bytes are returned from Chunk#toBytes, when a Chunk.Bytes has a length less than the backing array.

This may not be the most elegant, efficient or even correct way to fix this, but it certainly causes the test case that I have added to pass, where it previously failed.

…en a Chunk.Bytes has a length less than the backing array

SystemFw · 2017-10-05T15:45:18Z

Hi @adamretter, thanks for sending this in. I'll investigate some more to see if we can keep this optimisation without compromising md5. Good catch!

SystemFw · 2017-10-05T15:59:19Z

I think we can keep this optimisation by having the Bytes class have both a values and rawValues. values would be rawValues.take(size). This way you should still get access to the raw array should one need to (I haven't checked we do, but I'm thinking Chunks backed by a subsequence of an array), but still fixing this bug, and avoiding pointless copies. The same applies to the other Chunk.PrimitiveTypes classes.

@mpilquist WDYT?

EDIT: Alternatively, we could simply modify md5 to do the right thing, but I'm not sure we wouldn't be bitten by a similar bug again in another scenario

mpilquist · 2017-10-06T11:57:03Z

Hm, I think this is okay as-is. The Bytes class has a values array along with an offset and length. We could rename values to underlying or something to help clarify that values is the source. If folks want the subsequence array, they can call .toArray.

Note that hash is also incorrectly ignoring the offset parameter.

SystemFw · 2017-10-06T13:22:16Z

Right, so Bytes should stay the same, save for maybe renaming values to underlying.

Should we change toBytes, or just fix hash to take only the portion of the array going from offset to offset + size?

mpilquist · 2017-10-06T13:25:25Z

Yeah, let's fix hash to operate on the proper slice. It did that correctly in 0.9: https://github.com/functional-streams-for-scala/fs2/blob/series/0.9/core/jvm/src/main/scala/fs2/hash/hash.scala#L19

I think I know how this bug was introduced in 0.10. Early in the 0.10 work, Chunk.Bytes didn't have an offset or size. The hash object must have been ported at that time. Then later in 0.10 development, we added back the offset/size feature for performance reasons but forgot to update hash.

SystemFw · 2017-10-06T13:27:45Z

I'll send a PR in asap

SystemFw · 2017-10-06T13:46:30Z

Thanks a lot @adamretter for spotting this! Closing :)

Fix an issue where too many bytes are returned from Chunk#toBytes, wh…

873a4bd

…en a Chunk.Bytes has a length less than the backing array

SystemFw self-assigned this Oct 5, 2017

SystemFw mentioned this pull request Oct 6, 2017

Make sure MessageDigest respects Chunk size and offset #944

Merged

SystemFw closed this Oct 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix an issue in `Chunk#toBytes` #943

Fix an issue in `Chunk#toBytes` #943

adamretter commented Oct 5, 2017 •

edited

Loading

SystemFw commented Oct 5, 2017

SystemFw commented Oct 5, 2017 •

edited

Loading

mpilquist commented Oct 6, 2017

SystemFw commented Oct 6, 2017

mpilquist commented Oct 6, 2017

SystemFw commented Oct 6, 2017

SystemFw commented Oct 6, 2017

Fix an issue in Chunk#toBytes #943

Fix an issue in Chunk#toBytes #943

Conversation

adamretter commented Oct 5, 2017 • edited Loading

SystemFw commented Oct 5, 2017

SystemFw commented Oct 5, 2017 • edited Loading

mpilquist commented Oct 6, 2017

SystemFw commented Oct 6, 2017

mpilquist commented Oct 6, 2017

SystemFw commented Oct 6, 2017

SystemFw commented Oct 6, 2017

Fix an issue in `Chunk#toBytes` #943

Fix an issue in `Chunk#toBytes` #943

adamretter commented Oct 5, 2017 •

edited

Loading

SystemFw commented Oct 5, 2017 •

edited

Loading