Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mark all objects reachable from roots as live before doing main cyclic GC pass #126491

Open
markshannon opened this issue Nov 6, 2024 · 6 comments
Labels
interpreter-core (Objects, Python, Grammar, and Parser dirs) performance Performance or resource usage type-feature A feature request or enhancement

Comments

@markshannon
Copy link
Member

markshannon commented Nov 6, 2024

Objects can only be cyclic garbage if they are not reachable.
So, if we can cheaply identify the majority of reachable objects before performing the (relatively slow) cycle detecting pass, we can save a lot of time.

Performing a transitive closure of all objects reachable from global roots (the sys and builtins modules as well as builtin class's dicts and sublasses) plus a transitive closure of all objects reachable from the stacks can eliminate >90% of all objects relatively cheaply.

Initial experiments show a ~3% speedup, with an almost 50% speedup of the most gc-heavy benchmark.

This idea has been proposed a few times.
@nascheme has definitely suggested it before. Perhaps he can add links to any prior discussion and/or experiments?

What makes this more feasible now is that the GC can see the evaluation stack of frames, thanks to #124392, so we would now expect that the vast majority of reachable objects can be cheaply marked, thus improving the efficiency of cycle detection considerably.

Linked PRs

@markshannon markshannon added performance Performance or resource usage interpreter-core (Objects, Python, Grammar, and Parser dirs) 3.14 new features, bugs and security fixes labels Nov 6, 2024
@kumaraditya303
Copy link
Contributor

Is that benchmark public, link isn't working for me?

@markshannon
Copy link
Member Author

Fixed the link

@nascheme
Copy link
Member

nascheme commented Nov 6, 2024

FYI, I made an incremental "mark alive" version of my idea. My PR is based off of 3.13 but it could be done on top of the "incremental gc" that's in the "main" branch as well. This approach avoids a problem with the current incremental GC: if you happen to pull in an object near the root of the object graph, the GC pause could be long because all reachable objects are pull in to the cyclic GC pass.

#126511

markshannon added a commit that referenced this issue Nov 18, 2024
…ollection (GH-126502)

* Mark almost all reachable objects before doing collection phase

* Add stats for objects marked

* Visit new frames before each increment

* Remove lazy dict tracking

* Update docs

* Clearer calculation of work to do.
hugovk added a commit to hugovk/cpython that referenced this issue Nov 18, 2024
markshannon added a commit that referenced this issue Dec 2, 2024
…ollection (GH-127110)

* Mark almost all reachable objects before doing collection phase

* Add stats for objects marked

* Visit new frames before each increment

* Update docs

* Clearer calculation of work to do.
@eendebakpt
Copy link
Contributor

@markshannon Out of curiosity:

  • Which kind of objects are not reachable by the transitive closure of the global roots + stack, but are reachable in some other way so they are not collected by the GC?
  • Are there any statistics on the ratio of number of objects (or number of cycles) that iskept by the GC to the number that is deallocated by the GC?

@picnixz picnixz added type-feature A feature request or enhancement and removed 3.14 new features, bugs and security fixes labels Dec 3, 2024
mpage added a commit to mpage/cpython that referenced this issue Dec 3, 2024
@nascheme
Copy link
Member

nascheme commented Dec 3, 2024

Python doesn't have a precise set of roots. For example, an extension module can create objects but not have them connected to any roots that the CPython runtime knows about.

mpage added a commit to mpage/cpython that referenced this issue Dec 4, 2024
mpage added a commit to mpage/cpython that referenced this issue Dec 4, 2024
markshannon added a commit that referenced this issue Dec 6, 2024
* Faster marking of reachable objects

* Changes calculation of work to do and work done.

* Merges transitive closure calculations
encukou added a commit to encukou/cpython that referenced this issue Dec 9, 2024
encukou added a commit that referenced this issue Dec 10, 2024
…ng (GH-127519)" (GH-127770)

Revert "GH-126491: Lower heap size limit with faster marking (GH-127519)"

This reverts commit 023b7d2, which introduced
a refleak.
@encukou
Copy link
Member

encukou commented Dec 10, 2024

I've reverted the last change, which introduced a refleak.

srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this issue Jan 8, 2025
…ycle collection (pythonGH-127110)

* Mark almost all reachable objects before doing collection phase

* Add stats for objects marked

* Visit new frames before each increment

* Update docs

* Clearer calculation of work to do.
srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this issue Jan 8, 2025
…127519)

* Faster marking of reachable objects

* Changes calculation of work to do and work done.

* Merges transitive closure calculations
srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this issue Jan 8, 2025
…faster marking (pythonGH-127519)" (pythonGH-127770)

Revert "pythonGH-126491: Lower heap size limit with faster marking (pythonGH-127519)"

This reverts commit 023b7d2, which introduced
a refleak.
ebonnal pushed a commit to ebonnal/cpython that referenced this issue Jan 12, 2025
…ycle collection (pythonGH-126502)

* Mark almost all reachable objects before doing collection phase

* Add stats for objects marked

* Visit new frames before each increment

* Remove lazy dict tracking

* Update docs

* Clearer calculation of work to do.
ebonnal pushed a commit to ebonnal/cpython that referenced this issue Jan 12, 2025
ebonnal pushed a commit to ebonnal/cpython that referenced this issue Jan 12, 2025
…ycle collection (pythonGH-127110)

* Mark almost all reachable objects before doing collection phase

* Add stats for objects marked

* Visit new frames before each increment

* Update docs

* Clearer calculation of work to do.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
interpreter-core (Objects, Python, Grammar, and Parser dirs) performance Performance or resource usage type-feature A feature request or enhancement
Projects
None yet
Development

No branches or pull requests

6 participants