Reduce allocations #855

kmod · 2015-08-21T08:35:51Z

Spent some time looking into it since GC time is showing up pretty high on the django benchmarks.

This PR reduces django_template3 bytes allocated by about 20%, leading to about a 3% improvement on that benchmark, and about 2% overall (geomean).

undingen · 2015-08-21T08:57:44Z

The speedup it pretty amazing ;-)
I too forgot to add the destructor to the set class. Do you want to add it or should I create a PR?

kmod · 2015-08-21T09:13:56Z

ok, I'll update the pr to fix set too :)

Forgot to do this when I switched away from an StlCompatAllocator, so we were leaking memory pretty badly.

- Check for the allocation of empty tuples and just return the singleton - Try to avoid creating the kwargs dict since it might end up being empty - Let unicode-creation special case apply to all argument types - Fix type(obj) to be fast again (got superceded by a different special case) - Do fewer allocations in int()

This is a temporary fix for the fact that "-1" is currently getting parsed as "-(1)", which will cause us to call '(1).__neg__()' with the associated overhead and allocation. It should be useful even after that gets fixed though.

Reduces boxing during the import process

For some reason CPython allocates an extra "item" for generic variable-sized objects, but it looks like it doesn't do that for tuples. We had been doing that, so let's try not doing that and saving 8 bytes per tuple.

kmod · 2015-08-21T10:40:16Z

Hmm spoke a bit too soon; after fixing some bugs the effects are about half what they were at first :/

Reduce allocations

kmod force-pushed the perf5 branch from 3420255 to 2d9e31c Compare August 21, 2015 08:42

kmod force-pushed the perf5 branch from 2d9e31c to 9ce1c3b Compare August 21, 2015 09:16

kmod added 6 commits August 21, 2015 09:16

Fix: free dict+set internal memory

978974b

Forgot to do this when I switched away from an StlCompatAllocator, so we were leaking memory pretty badly.

Make a couple things not gc-allocated

d7934a4

Add unaryop to our codegen type system

2e409bd

This is a temporary fix for the fact that "-1" is currently getting parsed as "-(1)", which will cause us to call '(1).__neg__()' with the associated overhead and allocation. It should be useful even after that gets fixed though.

Convert parts of the import system to use BoxedStrings

d50f760

Reduces boxing during the import process

Don't allocate an extra tuple element

2b9fd97

For some reason CPython allocates an extra "item" for generic variable-sized objects, but it looks like it doesn't do that for tuples. We had been doing that, so let's try not doing that and saving 8 bytes per tuple.

kmod force-pushed the perf5 branch from 9ce1c3b to 2b9fd97 Compare August 21, 2015 09:16

kmod added a commit that referenced this pull request Aug 21, 2015

Merge pull request #855 from kmod/perf5

432fcb5

Reduce allocations

kmod merged commit 432fcb5 into pyston:master Aug 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce allocations #855

Reduce allocations #855

kmod commented Aug 21, 2015

undingen commented Aug 21, 2015

kmod commented Aug 21, 2015

kmod commented Aug 21, 2015

Reduce allocations #855

Reduce allocations #855

Conversation

kmod commented Aug 21, 2015

undingen commented Aug 21, 2015

kmod commented Aug 21, 2015

kmod commented Aug 21, 2015