gh-128807: Add marking phase for free-threaded cyclic GC #128808

nascheme · 2025-01-14T01:06:44Z

This is conceptually similar to the phase that was added to the non-free-threaded GC. Start with a set of known root objects, like sysdict and mark all objects reachable from those (revealed by the tp_traverse method) as "alive". We know anything marked alive cannot be garbage and can be excluded from the regular cyclic GC process. For most programs, this saves a moderate amount of computation since the marking pass is relatively cheaper per object.

If gc.freeze() is used, it's unlikely that this marking phase will be a win since it's expected that the majority of objects will be frozen. Disable the marking phase if freeze is used.

See gh-126491 for the non-free-threaded version of this technique.

pyperformance results vs merge base. I suspect the slowdown on some benchmarks is not real. For example, regex_v8 should not be slower.

Here are the pyperformance results for a bare-metal AMD Ryzen machine. It does not show a slowdown on regex_v8, for example.

To better show the expected improvement, I ran a "sphinx build" benchmark, like in gh-124567. Results are:

	old	new
GC collections	38	38
long-lived objects	586,986	585,730
total run time	3.24 s	3.03 s
mark phase time	0.00 s	0.16 s
total gc time	0.72 s	0.57 s

Issue: Add marking phase to free-threaded cyclic GC #128807

cpython-cla-bot · 2025-01-14T01:06:48Z

All commit authors signed the Contributor License Agreement.

If the object is already marked as reachable, we shouldn't traverse it again.

These are not strictly needed, simplify PR.

More code cleanup (better names, comments, simplify error handling). Fix bug in that "alive" bit must be checked in mark_alive_stack_push() to avoid visiting already visited objects.

Make sure we still do this optimization. There is also a unit test that checks for this.

Reduces duplicate code.

colesbury

Looks great! A few comments below

Python/gc_free_threading.c

Use gc_abort_mark_alive() helper in case of OOM. In addition to freeing the stack, we need to ensure that no object has the alive bit set on it. This also adding missing error handling in the case that propagate_alive_bits() fails.

Python/gc_free_threading.c

colesbury

LGTM

Python/gc_free_threading.c

wip: gc mark alive for nogil

a111bff

bedevere-app bot mentioned this pull request Jan 14, 2025

Add marking phase to free-threaded cyclic GC #128807

Closed

nascheme added topic-free-threading performance Performance or resource usage labels Jan 14, 2025

nascheme added 14 commits January 13, 2025 17:09

wip: mark more roots

200b69b

wip: log gc timing to temp file

713689a

wip: mark stack refs as alive

2635fe0

wip: include # of collected in debug log

72128e2

wip: move freeze_used to gc state

942f77d

wip: fix OOM error handling, add ifdef toggles

a3089e5

wip: fix reversion on mark_heap_visitor()

4f99de6

If the object is already marked as reachable, we shouldn't traverse it again.

wip: revert changes to gc_should_collect()

4f6e539

These are not strictly needed, simplify PR.

wip: removing timing code

b99f2df

wip: code cleanup, small bug fix

5f6ab4c

More code cleanup (better names, comments, simplify error handling). Fix bug in that "alive" bit must be checked in mark_alive_stack_push() to avoid visiting already visited objects.

wip: untrack tuples in "mark alive" pass

bd46b5f

Make sure we still do this optimization. There is also a unit test that checks for this.

wip: enable stacks and extra roots

d25cb4a

wip: add helper gc_maybe_untrack()

347db45

Reduces duplicate code.

Add NEWS entry.

680f80f

nascheme force-pushed the nogil-gc-mark-alive branch from ccc5a11 to 680f80f Compare January 14, 2025 01:10

nascheme added 2 commits January 13, 2025 17:22

wip: spelling fixes, minor code cleanup

0c173c9

Merge branch 'origin/main' into nogil-gc-mark-alive

6654424

nascheme marked this pull request as ready for review January 14, 2025 18:12

bedevere-app bot added the awaiting core review label Jan 14, 2025

colesbury reviewed Jan 14, 2025

View reviewed changes

Python/gc_free_threading.c Outdated Show resolved Hide resolved

Python/gc_free_threading.c Show resolved Hide resolved

Python/gc_free_threading.c Show resolved Hide resolved

nascheme added 3 commits January 14, 2025 14:45

wip: use -1 for error convention for functions

79ea47e

wip: improve error handling for OOM

0090365

Use gc_abort_mark_alive() helper in case of OOM. In addition to freeing the stack, we need to ensure that no object has the alive bit set on it. This also adding missing error handling in the case that propagate_alive_bits() fails.

Add comment about gc.freeze() disabling marking.

6100691

kumaraditya303 reviewed Jan 15, 2025

View reviewed changes

Python/gc_free_threading.c Show resolved Hide resolved

colesbury approved these changes Jan 15, 2025

View reviewed changes

Python/gc_free_threading.c Show resolved Hide resolved

Python/gc_free_threading.c Show resolved Hide resolved

bedevere-app bot added awaiting merge and removed awaiting core review labels Jan 15, 2025

nascheme merged commit 080f444 into python:main Jan 15, 2025
45 checks passed

bedevere-app bot removed the awaiting merge label Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-128807: Add marking phase for free-threaded cyclic GC #128808

gh-128807: Add marking phase for free-threaded cyclic GC #128808

nascheme commented Jan 14, 2025 •

edited

Loading

cpython-cla-bot bot commented Jan 14, 2025 •

edited

Loading

colesbury left a comment

colesbury left a comment

gh-128807: Add marking phase for free-threaded cyclic GC #128808

gh-128807: Add marking phase for free-threaded cyclic GC #128808

Conversation

nascheme commented Jan 14, 2025 • edited Loading

cpython-cla-bot bot commented Jan 14, 2025 • edited Loading

colesbury left a comment

Choose a reason for hiding this comment

colesbury left a comment

Choose a reason for hiding this comment

nascheme commented Jan 14, 2025 •

edited

Loading

cpython-cla-bot bot commented Jan 14, 2025 •

edited

Loading