-
-
Notifications
You must be signed in to change notification settings - Fork 30.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mark all objects reachable from roots as live before doing main cyclic GC pass #126491
Comments
Is that benchmark public, link isn't working for me? |
Fixed the link |
FYI, I made an incremental "mark alive" version of my idea. My PR is based off of 3.13 but it could be done on top of the "incremental gc" that's in the "main" branch as well. This approach avoids a problem with the current incremental GC: if you happen to pull in an object near the root of the object graph, the GC pause could be long because all reachable objects are pull in to the cyclic GC pass. |
…ollection (GH-126502) * Mark almost all reachable objects before doing collection phase * Add stats for objects marked * Visit new frames before each increment * Remove lazy dict tracking * Update docs * Clearer calculation of work to do.
… doing cycle collection (pythonGH-126502)" This reverts commit b0fcc2c.
…ollection (GH-127110) * Mark almost all reachable objects before doing collection phase * Add stats for objects marked * Visit new frames before each increment * Update docs * Clearer calculation of work to do.
@markshannon Out of curiosity:
|
… doing cycle collection (pythonGH-127110)" This reverts commit a8dd821.
Python doesn't have a precise set of roots. For example, an extension module can create objects but not have them connected to any roots that the CPython runtime knows about. |
…e doing cycle collection (pythonGH-127110)" This reverts commit 2923163.
… doing cycle collection (pythonGH-127110)" This reverts commit a8dd821.
* Faster marking of reachable objects * Changes calculation of work to do and work done. * Merges transitive closure calculations
…ythonGH-127519)" This reverts commit 023b7d2, which introduced a refleak.
I've reverted the last change, which introduced a refleak. |
…ycle collection (pythonGH-127110) * Mark almost all reachable objects before doing collection phase * Add stats for objects marked * Visit new frames before each increment * Update docs * Clearer calculation of work to do.
…127519) * Faster marking of reachable objects * Changes calculation of work to do and work done. * Merges transitive closure calculations
…faster marking (pythonGH-127519)" (pythonGH-127770) Revert "pythonGH-126491: Lower heap size limit with faster marking (pythonGH-127519)" This reverts commit 023b7d2, which introduced a refleak.
…ycle collection (pythonGH-126502) * Mark almost all reachable objects before doing collection phase * Add stats for objects marked * Visit new frames before each increment * Remove lazy dict tracking * Update docs * Clearer calculation of work to do.
… doing cycle collection (pythonGH-126502)" (python#126983)
…ycle collection (pythonGH-127110) * Mark almost all reachable objects before doing collection phase * Add stats for objects marked * Visit new frames before each increment * Update docs * Clearer calculation of work to do.
Objects can only be cyclic garbage if they are not reachable.
So, if we can cheaply identify the majority of reachable objects before performing the (relatively slow) cycle detecting pass, we can save a lot of time.
Performing a transitive closure of all objects reachable from global roots (the sys and builtins modules as well as builtin class's dicts and sublasses) plus a transitive closure of all objects reachable from the stacks can eliminate >90% of all objects relatively cheaply.
Initial experiments show a ~3% speedup, with an almost 50% speedup of the most gc-heavy benchmark.
This idea has been proposed a few times.
@nascheme has definitely suggested it before. Perhaps he can add links to any prior discussion and/or experiments?
What makes this more feasible now is that the GC can see the evaluation stack of frames, thanks to #124392, so we would now expect that the vast majority of reachable objects can be cheaply marked, thus improving the efficiency of cycle detection considerably.
Linked PRs
The text was updated successfully, but these errors were encountered: