src: make base64 decoding 10-15% faster #2193

bnoordhuis · 2015-07-16T14:49:14Z

Make the inner loop execute fewer compare-and-branch executions per
processed byte, resulting in a 10-15% speedup.

This coincidentally fixes an out-of-bounds read:

while (unbase64(*src) < 0 && src < srcEnd)

Should have read:

while (src < srcEnd && unbase64(*src) < 0)

But this commit removes the offending code altogether.

Fixes: #2166

R=@trevnorris?

CI: https://jenkins-iojs.nodesource.com/view/iojs/job/iojs+any-pr+multi/155/

thefourtheye · 2015-07-16T16:31:24Z

src/string_bytes.cc

@@ -150,62 +150,83 @@ static const int unbase64_table[] =
    -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1,
    -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1
  };
-#define unbase64(x) unbase64_table[(uint8_t)(x)]
+#define unbase64(x)                                                           \
+    static_cast<uint8_t>(unbase64_table[static_cast<uint8_t>(x)])


very minor nit. Is four spaces here, okay?

Should we really cast the result to uint8_t when the actual data is int8_t?

Yes. The alternative was to change the type of the list elements to uint8_t but then I'd also have to change all the -1 to 255 to squelch compiler warnings. It would make the diff a lot noisier for no reason; conversion from signed to unsigned is well-defined.

Is four spaces here, okay?

I don't think we really have a convention for that but I'll change it to two spaces before landing.

bnoordhuis · 2015-07-17T08:45:15Z

Incorporated feedback, PTAL.

@trevnorris Good suggestion about force-flattening the string first. I'm confident saying now that it's actually 50% faster. :-)

trevnorris · 2015-07-17T13:26:31Z

LGTM

ronkorving · 2015-07-21T07:10:52Z

@bnoordhuis then I guess you can rename this PR :) great job!

Make the inner loop execute fewer compare-and-branch executions per processed byte, resulting in a 50% or more speedup. This coincidentally fixes an out-of-bounds read: while (unbase64(*src) < 0 && src < srcEnd) Should have read: while (src < srcEnd && unbase64(*src) < 0) But this commit removes the offending code altogether. Fixes: nodejs#2166 PR-URL: nodejs#2193 Reviewed-By: Trevor Norris <trev.norris@gmail.com>

parallel/test-buffer called `Buffer.prototype.toString()` on a buffer with uninitialized memory. Call `Buffer.prototype.fill()` on it first. PR-URL: nodejs#2193 Reviewed-By: Trevor Norris <trev.norris@gmail.com>

bnoordhuis · 2015-07-25T17:15:05Z

Landed in 8fd3ce1 and ac70bc8 with hex values, thanks everyone.

I only added @trevnorris in the Reviewed-By because he was the only one to formally LGTM it.

YurySolovyov · 2015-07-25T20:45:41Z

Is it going to be in 2.5.0, 3.0 or both?

Fishrock123 · 2015-07-26T05:02:33Z

@YuriSolovyov 2.5.0+ (both) :)

mscdex added the c++ Issues and PRs that require attention from people who are familiar with C++. label Jul 16, 2015

thefourtheye reviewed Jul 16, 2015
View reviewed changes

bnoordhuis force-pushed the optimize-base64-decode branch from 7f8acb6 to a5df468 Compare July 17, 2015 08:42

indutny force-pushed the master branch from bffb204 to eb35968 Compare July 22, 2015 21:21

bnoordhuis added 2 commits July 25, 2015 19:07

test: fix valgrind uninitialized memory warning

ac70bc8

parallel/test-buffer called `Buffer.prototype.toString()` on a buffer with uninitialized memory. Call `Buffer.prototype.fill()` on it first. PR-URL: nodejs#2193 Reviewed-By: Trevor Norris <trev.norris@gmail.com>

bnoordhuis force-pushed the optimize-base64-decode branch from a5df468 to ac70bc8 Compare July 25, 2015 17:13

bnoordhuis closed this Jul 25, 2015

bnoordhuis deleted the optimize-base64-decode branch July 25, 2015 17:13

bnoordhuis merged commit ac70bc8 into nodejs:master Jul 25, 2015

This was referenced Jul 27, 2015

Release Proposal: v2.5.0 #2239

Merged

Release proposal: v3.0.0 #2221

Closed

aqrln mentioned this pull request Apr 4, 2017

buffer: optimize decoding wrapped base64 data #12146

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src: make base64 decoding 10-15% faster #2193

src: make base64 decoding 10-15% faster #2193

bnoordhuis commented Jul 16, 2015

thefourtheye Jul 16, 2015

thefourtheye Jul 16, 2015

bnoordhuis Jul 16, 2015

bnoordhuis Jul 17, 2015

bnoordhuis commented Jul 17, 2015

trevnorris commented Jul 17, 2015

ronkorving commented Jul 21, 2015

bnoordhuis commented Jul 25, 2015

YurySolovyov commented Jul 25, 2015

Fishrock123 commented Jul 26, 2015

src: make base64 decoding 10-15% faster #2193

src: make base64 decoding 10-15% faster #2193

Conversation

bnoordhuis commented Jul 16, 2015

thefourtheye Jul 16, 2015

Choose a reason for hiding this comment

thefourtheye Jul 16, 2015

Choose a reason for hiding this comment

bnoordhuis Jul 16, 2015

Choose a reason for hiding this comment

bnoordhuis Jul 17, 2015

Choose a reason for hiding this comment

bnoordhuis commented Jul 17, 2015

trevnorris commented Jul 17, 2015

ronkorving commented Jul 21, 2015

bnoordhuis commented Jul 25, 2015

YurySolovyov commented Jul 25, 2015

Fishrock123 commented Jul 26, 2015