rlp: minor optimizations for slice/array encoding #23467

fjl · 2021-08-25T13:16:20Z

As per benchmark results below, these changes speed up encoding/decoding of consensus objects a bit.

name                             old time/op    new time/op    delta
EncodeRLP/legacy-header-8           384ns ± 1%     331ns ± 3%  -13.83%  (p=0.000 n=7+8)
EncodeRLP/london-header-8           411ns ± 1%     359ns ± 2%  -12.53%  (p=0.000 n=8+8)
EncodeRLP/receipt-for-storage-8     251ns ± 0%     239ns ± 0%   -4.97%  (p=0.000 n=8+8)
EncodeRLP/receipt-full-8            319ns ± 0%     300ns ± 0%   -5.89%  (p=0.000 n=8+7)
EncodeRLP/legacy-transaction-8      389ns ± 1%     387ns ± 1%     ~     (p=0.099 n=8+8)
EncodeRLP/access-transaction-8      607ns ± 0%     581ns ± 0%   -4.26%  (p=0.000 n=8+8)
EncodeRLP/1559-transaction-8        627ns ± 0%     606ns ± 1%   -3.44%  (p=0.000 n=8+8)
DecodeRLP/legacy-header-8           831ns ± 1%     813ns ± 1%   -2.20%  (p=0.000 n=8+8)
DecodeRLP/london-header-8           824ns ± 0%     804ns ± 1%   -2.44%  (p=0.000 n=8+7)

I created these changes before starting work on the code generator. Mostly submitting because it took non-trivial effort to find these optimizations, so I feel like the changes are valuable even if we end up not using the reflect-based encoder/decoder later.

This makes it possible to inline the function, and the length is known at encoder construction time.

It's actually cheaper to use Elem first, because it performs less checks on the value. If the pointer was nil, the result of Elem is 'invalid'.

For empty slices/arrays, we can avoid storing a list header entry in the encoder buffer. Also avoid doing the tail check at encoding time because it is already known at encoder construction time.

holiman

LGTM

As per benchmark results below, these changes speed up encoding/decoding of consensus objects a bit. name old time/op new time/op delta EncodeRLP/legacy-header-8 384ns ± 1% 331ns ± 3% -13.83% (p=0.000 n=7+8) EncodeRLP/london-header-8 411ns ± 1% 359ns ± 2% -12.53% (p=0.000 n=8+8) EncodeRLP/receipt-for-storage-8 251ns ± 0% 239ns ± 0% -4.97% (p=0.000 n=8+8) EncodeRLP/receipt-full-8 319ns ± 0% 300ns ± 0% -5.89% (p=0.000 n=8+7) EncodeRLP/legacy-transaction-8 389ns ± 1% 387ns ± 1% ~ (p=0.099 n=8+8) EncodeRLP/access-transaction-8 607ns ± 0% 581ns ± 0% -4.26% (p=0.000 n=8+8) EncodeRLP/1559-transaction-8 627ns ± 0% 606ns ± 1% -3.44% (p=0.000 n=8+8) DecodeRLP/legacy-header-8 831ns ± 1% 813ns ± 1% -2.20% (p=0.000 n=8+8) DecodeRLP/london-header-8 824ns ± 0% 804ns ± 1% -2.44% (p=0.000 n=8+7) * rlp: pass length to byteArrayBytes This makes it possible to inline byteArrayBytes. For arrays, the length is known at encoder construction time, so the call to v.Len() can be avoided. * rlp: avoid IsNil for pointer encoding It's actually cheaper to use Elem first, because it performs less checks on the value. If the pointer was nil, the result of Elem is 'invalid'. * rlp: minor optimizations for slice/array encoding For empty slices/arrays, we can avoid storing a list header entry in the encoder buffer. Also avoid doing the tail check at encoding time because it is already known at encoder construction time.

fjl added 4 commits August 25, 2021 14:40

rlp: add benchmark for encoding slice of structs

edfc605

rlp: pass length to byteArrayBytes

7c9a64e

This makes it possible to inline the function, and the length is known at encoder construction time.

rlp: avoid IsNil for pointer encoding

5094ef1

It's actually cheaper to use Elem first, because it performs less checks on the value. If the pointer was nil, the result of Elem is 'invalid'.

rlp: minor optimizations for slice/array encoding

8b57ade

For empty slices/arrays, we can avoid storing a list header entry in the encoder buffer. Also avoid doing the tail check at encoding time because it is already known at encoder construction time.

holiman approved these changes Aug 25, 2021

View reviewed changes

fjl merged commit 32c576b into ethereum:master Aug 25, 2021

fjl added this to the 1.10.9 milestone Aug 25, 2021

gzliudan mentioned this pull request May 15, 2024

upgarde package rlp to 2024-05-15 XinFinOrg/XDPoSChain#542

Merged

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rlp: minor optimizations for slice/array encoding #23467

rlp: minor optimizations for slice/array encoding #23467

fjl commented Aug 25, 2021 •

edited

Loading

holiman left a comment

rlp: minor optimizations for slice/array encoding #23467

rlp: minor optimizations for slice/array encoding #23467

Conversation

fjl commented Aug 25, 2021 • edited Loading

holiman left a comment

Choose a reason for hiding this comment

fjl commented Aug 25, 2021 •

edited

Loading