Speeding up v8 heap snapshots #702

jdapena · 2023-06-21T08:41:13Z

Add "Speeding up V8 heap snapshots" blog post.

* Add "Speeding up V8 heap snapshots" blog post.

src/blog/speeding-up-v8-heap-snapshots.md

robpalme · 2023-06-21T20:25:26Z

My email address has now been registered as CLA compliant via my employer. How do we make the CLA bot reassess?

Co-authored-by: Rob Palmer <rpalmer57@bloomberg.net>

jdapena · 2023-06-22T14:27:56Z

CLA is ready now.

I cannot approve running the workflow (nor assign anybody to review the patch).

robpalme · 2023-06-22T14:50:04Z

@syg are you able to run the workflow?

src/blog/speeding-up-v8-heap-snapshots.md

Several fixes in accuracy, specially related to NODE_OPTIONS processing, coming from the code review. Also some writing improvements. Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

When we tried to make the post less Node.js specific, we want a bit too far, as the parts related to --heapsnapshot-near-heap-limit are indeed Node.js specific. Provided more clarity on that.

…ation Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

The level of details in teh source position caching fix explanation was quite poor, not even explaining why the source line position was not cached. Expand it after proposal from Joyee Cheung.

In previous version, we talked about "unoptimized development JS" and "optimized production JS", that could be misleading, as Joyee Cheung pointed out, because it could induce reader to think about JS runtime optimizations. So this change removes reference there to optimized/unoptimized JS to later detail how production source code is optimized using bundling and minification.

In the original description, we wrongly state that the heap limit is set by V8 to be 1400MB. Instead of this, it was an application specific limit. Also clarify that, in the test case, hitting Out-Of-Memory would hint there was a leak. Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

jdapena · 2023-07-03T15:20:52Z

@syg < Apparently I cannot add you or any other reviewer to the ticket. Who should review blog posts?

joyeecheung

A bunch of wording suggestions and some remaining questions, but overall looks good, thanks

src/blog/speeding-up-v8-heap-snapshots.md

src/_img/speeding-up-v8-heap-snapshots/59293b17-6d52-4d9e-8737-5b23d038d50a.png

src/blog/speeding-up-v8-heap-snapshots.md

syg · 2023-07-10T22:39:22Z

src/blog/speeding-up-v8-heap-snapshots.md

+- Once we had strings with a hash key value in lower positions, then the storing of new numbers would offset all the positions.
+- Or even worse: if we would introduce a string with a hash key value that would be a low number, and we already found several hundreds or thousands of consecutive numbers, then that value would be moved several hundreds or thousands of positions.
+
+What did I do to fix it? As the problem comes mostly from numbers represented as strings that would fall in consecutive positions, I modified the hash function so we would rotate the resulting hash value 2 positions to the left. So, for each number, we would introduce 3 free positions. Why 2? Empirical testing across several work-sets showed this number was the best choice to minimize collisions.


Could you perhaps add code snippets demonstrating the improved hash directly? V8 blog audience should be used to bitwise manipulation code.

src/blog/speeding-up-v8-heap-snapshots.md

syg · 2023-07-10T22:57:24Z

src/blog/speeding-up-v8-heap-snapshots.md

+
+## Narrowing down the Problem
+
+Jason Williams started investigating the issue using some V8 parameters. As described in the previous post, V8 has some nice command line parameters that can help with that. These options were used to curate the heap snapshots , simplify the reproduction, and improve observability:


Re-reading this section after reading the entire post, I'm unclear on the point of listing the flags in detail. It seems almost incidental: here are the flags that we were using to capture heap snapshots, and where we observed surprising slowdowns. Could this section be shortened? Perhaps I'm missing the intention though?

Initially the problem was detected just extracting regular heap snapshots from DevTools.

But using these command line arguments allowed to get a detailed breakdown of what was happening, and also allowed to reproduce the heap snapshot performance problem several times, without requiring to use a remote DevTools connection.

So this is part of the investigation steps (increasing and simplifying reproducibility).

In general, the intent of the whole post is not only explaining the specific fixes, but the whole investigation procedure that led to them.

Many insightful improvements to the writing of heap snapshot optimizations blog post, that can be landed altogether. Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

"And old friend" reference was for the ETW fix that I presented in a different blog post that is not even linked. And it is redundant as later I explain that this issue is similar to another one fixed in ETW. Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

The images in the v8 heap snapshot optimizations post were too much big, with lots of empty space that was not relevant and made it harder to read the contents. After cropping the images, not it is more focused on the information that matters.

Co-authored-by: Shu-yu Guo <syg@chromium.org>

Manually apply suggestions that could not be applied directly from the GH UI.

@syg

…vestigation Following @syg recommendation, avoid explicit engineers reference while explaining the procedure to investigate the performance problem.

V8 team is not sponsored. Work is actually done by engineers of different companies. Rewrite the credits section accordingly.

jdapena · 2023-07-13T12:01:22Z

Provided a new version of the explanation of the hash algorithm changes, with value examples, and snippets of the original and the new hash functions.

src/blog/speeding-up-v8-heap-snapshots.md

Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

…previous post Originally the blog post was part of a series until it was proposed to be an independent post in v8.dev. Make the reference to the previous post independent of the fact by referring to the link of the previous post.

…mplified writing Several changes to reduce redundancies and simplify reading the post: - Remove step-by-step guide of using Windows Performance Recording, referring to upstream documentation now. - Snippets are now almost C++ code instead of pseudocode. - Some typos. - Simplified line break caching explanation, and removed reference to equivalent ETW fix.

syg

Looks pretty good % remaining comments! Thanks for the many rounds of iterations.

syg · 2023-07-18T21:01:41Z

src/blog/speeding-up-v8-heap-snapshots.md

+
+Furthermore, we found the problem happened on both Windows and Linux. The problem was also not platform-specific.
+
+## Windows Performance Analyzer to the rescue


I'm not sure what this section adds to the article.

src/blog/speeding-up-v8-heap-snapshots.md

Co-authored-by: Shu-yu Guo <syg@chromium.org>

jdapena · 2023-07-19T15:07:46Z

Applied changes. Only pending running the workflow and merging.

I'd prefer to keep the Windows Performance Analyzer section, that is now really small, as it still give readers hint about the tool used for the investigation.

syg · 2023-07-19T17:57:42Z

I'd prefer to keep the Windows Performance Analyzer section, that is now really small, as it still give readers hint about the tool used for the investigation.

I still think something as short as "I used Windows Performance Toolkit to do the profiling" suffices. I feel like that workflow (1) isn't generally applicable and (2) out of character for the V8 blog. V8 blog posts tend to be about highlighting improvements to the infrastructure, or VM guts, or language hacks, not play-by-play accounts of how we achieved the results.

jdapena · 2023-07-26T14:38:03Z

@syg < I finally removed the WPA section, and just made the reference that it was the tool used (that is visible in the screenshots of the reports). I cannot apply the resulting patch myself now it is approved.

BTW, thanks everybody for all the careful review. Post is now far better!

joyeecheung · 2023-07-26T17:02:11Z

src/blog/speeding-up-v8-heap-snapshots.md

+
+Normally, after V8 finishes calculating the offsets of line breaks in a script, it caches them in a newly allocated array attached to the script. Unfortunately, the snapshot implementation cannot modify the heap when traversing it, so the newly calculated line information cannot be cached.
+
+The solution? Before generating the heap snapshot, we now iterate over all the scripts in the V8 context to compute and cache the offsets of the line breaks. As this is not done when we traverse the heap for heap snapshot generation, it is still possible to modify the heap and store the source line positions as a cache.


Suggested change

The solution? Before generating the heap snapshot, we now iterate over all the scripts in the V8 context to compute and cache the offsets of the line breaks. As this is not done when we traverse the heap for heap snapshot generation, it is still possible to modify the heap and store the source line positions as a cache.

The solution? Before generating the heap snapshot, we now iterate over all the scripts in the V8 context to compute and cache the offsets of the line breaks. As this is done when we traverse the heap for heap snapshot generation, it is still possible to modify the heap and store the source line positions as a cache.

The original sentence was OK. When we traverse the heap for heap snapshot generation we cannot add entries to the heap. That's the reason we do that before.

Ah, yes, I think I misread 👍

syg

lgtm, thanks for the many rounds

syg · 2023-07-27T18:51:58Z

Any last concerns before I publish? @joyeecheung?

joyeecheung

LGTM, thanks

Speeding up v8 heap snapshots

7d91e99

* Add "Speeding up V8 heap snapshots" blog post.

robpalme reviewed Jun 21, 2023

View reviewed changes

Apply suggestions from Rob Palmer

485d04f

Co-authored-by: Rob Palmer <rpalmer57@bloomberg.net>

jdapena force-pushed the jdapena/speeding-up-v8-heap-snapshots branch from cf33dbe to 485d04f Compare June 22, 2023 14:25

Fix lint issues

3dbf61b

joyeecheung reviewed Jun 26, 2023

View reviewed changes

jdapena and others added 7 commits June 27, 2023 10:50

Apply suggestions from Joyee Cheung

b6860f4

Several fixes in accuracy, specially related to NODE_OPTIONS processing, coming from the code review. Also some writing improvements. Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

More clarifications of Node.js specific parts

77c2744

When we tried to make the post less Node.js specific, we want a bit too far, as the parts related to --heapsnapshot-near-heap-limit are indeed Node.js specific. Provided more clarity on that.

Added Joyee Cheung to the initial paragraph list of contributors

9eac65c

More accurate description of --heapsnapshot-near-heap-limit implement…

53d996a

…ation Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

More details of source position caching fix

a0518ff

The level of details in teh source position caching fix explanation was quite poor, not even explaining why the source line position was not cached. Expand it after proposal from Joyee Cheung.

joyeecheung reviewed Jul 10, 2023

View reviewed changes

syg reviewed Jul 10, 2023

View reviewed changes

jdapena and others added 8 commits July 12, 2023 11:34

Apply suggestions from Joyee code review

91525cd

Many insightful improvements to the writing of heap snapshot optimizations blog post, that can be landed altogether. Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

Crop images to improve readability

1689958

The images in the v8 heap snapshot optimizations post were too much big, with lots of empty space that was not relevant and made it harder to read the contents. After cropping the images, not it is more focused on the information that matters.

Apply suggestions from @syg code review

3a1cdd9

Co-authored-by: Shu-yu Guo <syg@chromium.org>

Apply suggestions from @syg in pull request.

91dc5a8

Manually apply suggestions that could not be applied directly from the GH UI.

Added @syg to the list of contributors in first paragraph

3899164

Do not use specific names explaining the heap snapshot performance in…

de70d28

…vestigation Following @syg recommendation, avoid explicit engineers reference while explaining the procedure to investigate the performance problem.

Remove sponsorship reference in heap optimizations post

3811c74

V8 team is not sponsored. Work is actually done by engineers of different companies. Rewrite the credits section accordingly.

jdapena force-pushed the jdapena/speeding-up-v8-heap-snapshots branch from df811e8 to 3811c74 Compare July 12, 2023 09:34

Provided function snippet and value examples of new Hash function

c89f34f

joyeecheung reviewed Jul 13, 2023

View reviewed changes

jdapena and others added 4 commits July 18, 2023 13:32

speeding-up-heap-snapshots.md: apply suggestions from Joyee code review

239123a

Co-authored-by: Joyee Cheung <joyeec9h3@gmail.com>

speeding-up-heap-snapshots.md: remove unused image

979a434

syg approved these changes Jul 18, 2023

View reviewed changes

jdapena and others added 2 commits July 19, 2023 16:53

speeding-up-v8-heap-snapshots.md: apply suggestions from @syg

ca5bf19

Co-authored-by: Shu-yu Guo <syg@chromium.org>

speeding-up-v8-heap-snapshots.md: manually apply @syg proposed changes

bdd5b96

jdapena added 2 commits July 26, 2023 16:28

speeding-up-v8-heap-snapshots.md: update publication date

8f083a8

speeding-up-v8-heap-snapshots.md: drop WPA session

b065c43

jdapena force-pushed the jdapena/speeding-up-v8-heap-snapshots branch from d930645 to b065c43 Compare July 26, 2023 14:33

joyeecheung approved these changes Jul 26, 2023

View reviewed changes

syg approved these changes Jul 27, 2023

View reviewed changes

joyeecheung approved these changes Jul 27, 2023

View reviewed changes

syg merged commit a4956af into v8:main Jul 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speeding up v8 heap snapshots #702

Speeding up v8 heap snapshots #702

jdapena commented Jun 21, 2023

robpalme commented Jun 21, 2023

jdapena commented Jun 22, 2023

robpalme commented Jun 22, 2023

jdapena commented Jul 3, 2023

joyeecheung left a comment

syg Jul 10, 2023

syg Jul 10, 2023

jdapena Jul 12, 2023

jdapena commented Jul 13, 2023

syg left a comment

syg Jul 18, 2023

jdapena commented Jul 19, 2023

syg commented Jul 19, 2023

jdapena commented Jul 26, 2023

joyeecheung Jul 26, 2023

jdapena Jul 27, 2023

joyeecheung Jul 27, 2023

syg left a comment

syg commented Jul 27, 2023

joyeecheung left a comment


		## Narrowing down the Problem

		Jason Williams started investigating the issue using some V8 parameters. As described in the previous post, V8 has some nice command line parameters that can help with that. These options were used to curate the heap snapshots , simplify the reproduction, and improve observability:


		Furthermore, we found the problem happened on both Windows and Linux. The problem was also not platform-specific.

		## Windows Performance Analyzer to the rescue


		Normally, after V8 finishes calculating the offsets of line breaks in a script, it caches them in a newly allocated array attached to the script. Unfortunately, the snapshot implementation cannot modify the heap when traversing it, so the newly calculated line information cannot be cached.

		The solution? Before generating the heap snapshot, we now iterate over all the scripts in the V8 context to compute and cache the offsets of the line breaks. As this is not done when we traverse the heap for heap snapshot generation, it is still possible to modify the heap and store the source line positions as a cache.

Speeding up v8 heap snapshots #702

Speeding up v8 heap snapshots #702

Conversation

jdapena commented Jun 21, 2023

robpalme commented Jun 21, 2023

jdapena commented Jun 22, 2023

robpalme commented Jun 22, 2023

jdapena commented Jul 3, 2023

joyeecheung left a comment

Choose a reason for hiding this comment

syg Jul 10, 2023

Choose a reason for hiding this comment

syg Jul 10, 2023

Choose a reason for hiding this comment

jdapena Jul 12, 2023

Choose a reason for hiding this comment

jdapena commented Jul 13, 2023

syg left a comment

Choose a reason for hiding this comment

syg Jul 18, 2023

Choose a reason for hiding this comment

jdapena commented Jul 19, 2023

syg commented Jul 19, 2023

jdapena commented Jul 26, 2023

joyeecheung Jul 26, 2023

Choose a reason for hiding this comment

jdapena Jul 27, 2023

Choose a reason for hiding this comment

joyeecheung Jul 27, 2023

Choose a reason for hiding this comment

syg left a comment

Choose a reason for hiding this comment

syg commented Jul 27, 2023

joyeecheung left a comment

Choose a reason for hiding this comment