process: improve nextTick performance #25461

mscdex · 2019-01-12T03:39:21Z

                                               confidence improvement accuracy (*)    (**)   (***)
 process/next-tick-breadth-args.js n=10000000        ***     17.11 %       ±2.80%  ±3.74%  ±4.89%
 process/next-tick-breadth.js n=10000000             ***     40.88 %       ±2.72%  ±3.62%  ±4.71%
 process/next-tick-depth-args.js n=7000000           ***     21.87 %       ±2.18%  ±2.92%  ±3.82%
 process/next-tick-depth.js n=7000000                ***     23.58 %       ±7.60% ±10.12% ±13.17%
 process/next-tick-exec-args.js n=4000000            ***     39.07 %       ±1.88%  ±2.51%  ±3.28%
 process/next-tick-exec.js n=4000000                 ***     44.92 %       ±1.67%  ±2.23%  ±2.91%

Checklist

make -j4 test (UNIX), or vcbuild test (Windows) passes
commit message follows commit guidelines

nodejs-github-bot · 2019-01-12T03:39:22Z

@mscdex sadly an error occured when I tried to trigger a build :(

mscdex · 2019-01-12T03:40:54Z

benchmark/process/next-tick-exec-args.js

@@ -1,7 +1,7 @@
 'use strict';
 const common = require('../common.js');
 const bench = common.createBenchmark(main, {
-  n: [5e6]
+  n: [4e6]


These were changed to match those found in the other breadth benchmarks and it also seems to provide more stable results.

lib/internal/process/next_tick.js

apapirovski

The fact that we’re emitting one object on init and then pushing another on the next tick queue goes against basic expectations of using async hooks and what the resource is supposed to represent.

lib/internal/process/next_tick.js

mscdex · 2019-01-21T17:56:00Z

Since there are some that believe this should be semver-major, ping @nodejs/tsc

mscdex · 2019-01-28T05:43:07Z

ping?

mcollina · 2019-01-28T08:30:02Z

In the context of exposing the current async resource rather than just exposing the asyncId, this change might require to be reverted/changed later. However that work is not settled yet, so we might want to land this anyway, as this code is not really ready for it yet.

As an example, we are pursuing this change: #25094 for that reason.

cc @nodejs/diagnostics

mscdex · 2019-02-05T03:51:16Z

ping @nodejs/tsc once more

Trott · 2019-02-05T04:16:58Z

There probably won't be a meeting this week, but I'm going to throw a tsc-agenda label on this to make sure it doesn't entirely fall off the TSC radar for a third time.

Trott · 2019-02-05T04:17:33Z

(Obviously, if resolution is achieved before the next TSC meeting, that's awesome and I'll be delighted to remove the label at that time.)

mcollina

Thanks for working on nextTick! I've got some questions.

I'm failing to understand why this changes improves performance, and if it may something related to our benchmarks.

Have you tried if moving just the emitInit call outside of the constructor generates the same result?
Have you verified that is using the same object for both emitInit and queue.push() is what is actually creating the performance improvements?

@mscdex have you experiences this improves things in a more realistic scenario including I/O?

Also cc @bmeurer who might provide some insights.

mscdex · 2019-02-08T03:53:22Z

Have you tried if moving just the emitInit call outside of the constructor generates the same result?
Have you verified that is using the same object for both emitInit and queue.push() is what is actually creating the performance improvements?

@mcollina I honestly don't remember now, I tried a lot of variations though at the time, and this was the only one that resulted in a positive improvement with no regressions.

bmeurer · 2019-02-08T13:28:58Z

The change looks reasonable to me (JavaScript wise), but I'm really not a good candidate to review this, since this is not my area of expertise.

mcollina · 2019-02-08T21:17:34Z

@bmeurer have you got a clue on why this is faster than the current one?

bmeurer · 2019-02-08T21:25:06Z

@mcollina Nope, sorry.

apapirovski · 2019-02-20T21:25:29Z

have you got a clue on why this is faster than the current one?

As far as I can tell, this should be mostly related to using symbols which are in my experience slower for both getting and setting properties (and for your version the fact that we use an if condition rather than just running the emitInit function). That said, happy to be corrected.

BridgeAR · 2019-03-05T23:51:00Z

Ping @mscdex
Please check #25461 (comment)

mscdex · 2019-08-24T17:29:32Z

I've made different changes now to avoid the issues with domain. The new benchmark results are:

                                               confidence improvement accuracy (*)    (**)   (***)
 process/next-tick-breadth-args.js n=10000000        ***     17.11 %       ±2.80%  ±3.74%  ±4.89%
 process/next-tick-breadth.js n=10000000             ***     40.88 %       ±2.72%  ±3.62%  ±4.71%
 process/next-tick-depth-args.js n=7000000           ***     21.87 %       ±2.18%  ±2.92%  ±3.82%
 process/next-tick-depth.js n=7000000                ***     23.58 %       ±7.60% ±10.12% ±13.17%
 process/next-tick-exec-args.js n=4000000            ***     39.07 %       ±1.88%  ±2.51%  ±3.28%
 process/next-tick-exec.js n=4000000                 ***     44.92 %       ±1.67%  ±2.23%  ±2.91%

mscdex · 2019-08-25T13:46:45Z

/cc @nodejs/collaborators

mcollina · 2019-08-25T15:37:29Z

@mscdex does it still fail the test in #25461 (comment)? Maybe we should add it to our suite.

mscdex · 2019-08-25T20:20:07Z

@mcollina It should not fail it because the same object is being used now. These changes are now more or less inlining the previous custom TickObject class constructor code.

mcollina · 2019-08-25T21:36:00Z

Would you mind adding that test to this PR? Code LGTM.

mscdex · 2019-08-26T01:20:29Z

@mcollina Honestly I think that's probably best left to a separate issue/PR about whether we should (explicitly) support modifying behavior like that from an async hook callback.

mcollina

LGTM

benjamingr · 2019-08-26T10:28:38Z

lib/internal/process/task_queues.js

-          callback(...tock.args);
+        } else {
+          const args = tock.args;
+          switch (args.length) {


@hashseed @bmeurer why is this still faster? I thought this optimization of specializing the argument length explicitly in the code is no longer required?

Not sure - how much of the improvement is accounted for by this alone (and not the other part of this CL)?

Glancing at e.g. next-tick-depth-args, it doesn't look like the arguments to the callbacks are ever used - maybe we optimize something there. We don't do analysis like that for spread calls

@psmarshall the reason I was concerned is that we removed this optimization from certain parts of the code before.

I think it's possible to construct a microbenchmark where one or the other is faster - I don't think you could measure the difference on a larger application that does a lot of work between ticks. With that in mind my preference is for the spread-call version but I also don't really have the time to do a detailed analysis of what's going on in this specific case so I'm fine either way.

Making `.incRef()` and `.decRef()` fail silently leads to better error messages when trying to access the underlying value (as opposed to crashing inside these methods). Refs: #25461 (comment) PR-URL: #29289 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl> Reviewed-By: Gus Caplan <me@gus.host>

PR-URL: #25461 Reviewed-By: Matteo Collina <matteo.collina@gmail.com> Reviewed-By: Anna Henningsen <anna@addaleax.net>

Making `.incRef()` and `.decRef()` fail silently leads to better error messages when trying to access the underlying value (as opposed to crashing inside these methods). Refs: #25461 (comment) PR-URL: #29289 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl> Reviewed-By: Gus Caplan <me@gus.host>

PR-URL: #25461 Reviewed-By: Matteo Collina <matteo.collina@gmail.com> Reviewed-By: Anna Henningsen <anna@addaleax.net>

Making `.incRef()` and `.decRef()` fail silently leads to better error messages when trying to access the underlying value (as opposed to crashing inside these methods). Refs: #25461 (comment) PR-URL: #29289 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Ben Noordhuis <info@bnoordhuis.nl> Reviewed-By: Gus Caplan <me@gus.host>

mscdex added process Issues and PRs related to the process subsystem. performance Issues and PRs related to the performance of Node.js. labels Jan 12, 2019

mscdex commented Jan 12, 2019

View reviewed changes

BridgeAR approved these changes Jan 12, 2019

View reviewed changes

Fishrock123 reviewed Jan 12, 2019

View reviewed changes

lib/internal/process/next_tick.js Outdated Show resolved Hide resolved

apapirovski previously requested changes Jan 13, 2019

View reviewed changes

lib/internal/process/next_tick.js Outdated Show resolved Hide resolved

richardlau mentioned this pull request Jan 15, 2019

sadly an error occured when I tried to trigger a build :( nodejs/github-bot#210

Closed

BridgeAR added the semver-major PRs that contain breaking changes and should be released in the next major version. label Jan 19, 2019

mscdex added the tsc-review label Feb 5, 2019

Trott added the tsc-agenda Issues and PRs to discuss during the meetings of the TSC. label Feb 5, 2019

mcollina reviewed Feb 8, 2019

View reviewed changes

This was referenced Feb 11, 2019

Node.js Foundation Technical Steering Committee (TSC) Meeting 2019-02-13 nodejs/TSC#664

Closed

Node.js Foundation Technical Steering Committee (TSC) Meeting 2019-02-20 nodejs/TSC#668

Closed

Trott removed the tsc-agenda Issues and PRs to discuss during the meetings of the TSC. label Feb 20, 2019

Trott removed the tsc-review label Mar 10, 2019

apapirovski mentioned this pull request Apr 22, 2019

process: improve nextTick performance #27347

Closed

4 tasks

mscdex force-pushed the process-nexttick-perf branch from 955cdc1 to 9287f10 Compare June 21, 2019 01:41

mscdex requested a review from BridgeAR August 24, 2019 17:28

mscdex removed semver-major PRs that contain breaking changes and should be released in the next major version. tsc-agenda Issues and PRs to discuss during the meetings of the TSC. labels Aug 24, 2019

addaleax approved these changes Aug 25, 2019

View reviewed changes

mcollina approved these changes Aug 26, 2019

View reviewed changes

benjamingr reviewed Aug 26, 2019

View reviewed changes

mscdex closed this Aug 28, 2019

mscdex force-pushed the process-nexttick-perf branch from db72db3 to 34961c7 Compare August 28, 2019 02:15

mscdex merged commit 34961c7 into nodejs:master Aug 28, 2019

mscdex deleted the process-nexttick-perf branch August 28, 2019 02:17

BridgeAR pushed a commit that referenced this pull request Sep 3, 2019

process: improve nextTick performance

527f118

PR-URL: #25461 Reviewed-By: Matteo Collina <matteo.collina@gmail.com> Reviewed-By: Anna Henningsen <anna@addaleax.net>

BridgeAR mentioned this pull request Sep 3, 2019

v12.10.0 proposal #29429

Merged

BridgeAR pushed a commit that referenced this pull request Sep 4, 2019

process: improve nextTick performance

f4f8827

PR-URL: #25461 Reviewed-By: Matteo Collina <matteo.collina@gmail.com> Reviewed-By: Anna Henningsen <anna@addaleax.net>

addaleax added dont-land-on-v10.x labels Sep 7, 2019

addaleax mentioned this pull request Sep 7, 2019

stream performance regression 12.6.0 vs 10.16.0 #28586

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

process: improve nextTick performance #25461

process: improve nextTick performance #25461

mscdex commented Jan 12, 2019 •

edited

Loading

nodejs-github-bot commented Jan 12, 2019

mscdex Jan 12, 2019

apapirovski left a comment

mscdex commented Jan 21, 2019

mscdex commented Jan 28, 2019

mcollina commented Jan 28, 2019

mscdex commented Feb 5, 2019

Trott commented Feb 5, 2019

Trott commented Feb 5, 2019

mcollina left a comment

mscdex commented Feb 8, 2019 •

edited

Loading

bmeurer commented Feb 8, 2019

mcollina commented Feb 8, 2019

bmeurer commented Feb 8, 2019

apapirovski commented Feb 20, 2019

BridgeAR commented Mar 5, 2019

mscdex commented Aug 24, 2019

mscdex commented Aug 25, 2019

mcollina commented Aug 25, 2019

mscdex commented Aug 25, 2019

mcollina commented Aug 25, 2019

mscdex commented Aug 26, 2019

mcollina left a comment

benjamingr Aug 26, 2019

psmarshall Aug 26, 2019

benjamingr Aug 28, 2019

psmarshall Aug 29, 2019

process: improve nextTick performance #25461

process: improve nextTick performance #25461

Conversation

mscdex commented Jan 12, 2019 • edited Loading

Checklist

nodejs-github-bot commented Jan 12, 2019

mscdex Jan 12, 2019

Choose a reason for hiding this comment

apapirovski left a comment

Choose a reason for hiding this comment

mscdex commented Jan 21, 2019

mscdex commented Jan 28, 2019

mcollina commented Jan 28, 2019

mscdex commented Feb 5, 2019

Trott commented Feb 5, 2019

Trott commented Feb 5, 2019

mcollina left a comment

Choose a reason for hiding this comment

mscdex commented Feb 8, 2019 • edited Loading

bmeurer commented Feb 8, 2019

mcollina commented Feb 8, 2019

bmeurer commented Feb 8, 2019

apapirovski commented Feb 20, 2019

BridgeAR commented Mar 5, 2019

mscdex commented Aug 24, 2019

mscdex commented Aug 25, 2019

mcollina commented Aug 25, 2019

mscdex commented Aug 25, 2019

mcollina commented Aug 25, 2019

mscdex commented Aug 26, 2019

mcollina left a comment

Choose a reason for hiding this comment

benjamingr Aug 26, 2019

Choose a reason for hiding this comment

psmarshall Aug 26, 2019

Choose a reason for hiding this comment

benjamingr Aug 28, 2019

Choose a reason for hiding this comment

psmarshall Aug 29, 2019

Choose a reason for hiding this comment

mscdex commented Jan 12, 2019 •

edited

Loading

mscdex commented Feb 8, 2019 •

edited

Loading