ENH: Optimize take_*; improve non-NA fill_value support #2819

stephenwlin · 2013-02-08T05:39:04Z

Optimizes/improves common.take_1d, common.take_2d, common.take_2d_multi, and common.take_fast in the following ways:

For common cases, no intermediate buffer is required anymore to convert between two different dtypes (i.e. when upcasting is necessary): the data is taken and upcasted in one step. These cases are optimized in Cython and more can be added if desired.
Better support for fill_value parameters other than NaN: previously, all upcasting was done assuming NaN as the fill_value: now, the correct upcast will be chosen based on the type of the fill_value. (Ex. integers stay as integers with integer fill_value if the value can fit in the existing size without overflow, integers and floats upcast to complex with complex fill_value, all numerics upcast to objects with boolean or string fill_value, etc.)
~~Added option of fill_value of None, which means that the corresponding entries in the output are left unchanged (only really makes sense when out is also provided)~~

The last option is used to optimize internals._merge_blocks (i.e. the function internally called to do block consolidation): instead of vstack-ing the arrays together in an intermediate buffer and then rearranging into the output with a separate operation, the output buffer is allocated uninitialized and directly filled from the inputs using one call to common.take_fast per input block, even when they are not contiguous. (However, the special case of contiguous and already properly ordered blocks is still handed with vstack, since no other operation is required after that)

EDIT: took out fill_value of None option and reverted internals._merge_blocks: it wasn't actually helping. Upcasting within take functions is definitely faster though.

The existing vb_suite tests were not really testing upcasting, so I added one for it. Here are the results vs. master:

Results:
                                            t_head t_baseline      ratio
name
frame_reindex_upcast                       26.7874    34.3973     0.7788

and against jrebacks/dtypes:

Results:
                                            t_head t_baseline      ratio
name
frame_reindex_upcast                       23.8220    28.0891     0.8481

jreback · 2013-02-08T06:10:06Z

I updated dtypes a couple of times today
looks like u pulled in at least the first one, but not the last
I added int16/int8 some float16 support in cython. (generate_code.py)

good news is it looks like changes are for the most part independent so u can prob merge easily
(I changed astype and conver in internals/BlockManager)

jreback · 2013-02-08T06:28:58Z

jreback@e5b41db

was after you updated - amazing we actually worked on a small overlap at the same time!

stephenwlin · 2013-02-08T06:37:20Z

@jreback, rebased and resolved conflicts

EDIT: actually, need to add int8 and int16 support to common...doing that now

jreback · 2013-02-08T06:43:28Z

I think that the new way that u did the take dictionaries in order to specify the cython functions - need to add for int8/int16

I think the merge took your versions (which is correct), but need to add in those other functions
tests prob all pass - but the takes will be using the generic take instead of the cython takes for int8/int16
maybe add a test for this?

jreback · 2013-02-08T06:44:09Z

hah! again typed the same thing at the same time!

jreback · 2013-02-08T06:46:23Z

?

how are u running test_perf to get the vbench ratios - I can never get it to work?

stephenwlin · 2013-02-08T06:49:07Z

ok, merged with int8/int16 support...

i just run ./test_perf.sh -b {BASE_SHA} ... not sure why it doesn't work for you? also, I installed vbench from https://github.com/pydata/vbench

jreback · 2013-02-08T06:50:54Z

thanks I give it a try (I was putting too many options I think)

jreback · 2013-02-08T06:55:03Z

FYI - not sure if u r setup for Travis -

it picks up on a lot of dtype stuff and also shows how py3 react
(I actually installed py3.3 separately to test - it had some weird stuff)

stephenwlin · 2013-02-08T06:55:54Z

no, i'm not...how do i set that up?

jreback · 2013-02-08T07:01:30Z

http://about.travis-ci.org/docs/user/getting-started
step 2
it will auto build - I think u have to do something the first time though
then I click thru to the website to check progress and see how I fare on 3.3
and it's 32 bit
so u get really weird failures in some tests
eg if compre vs np.int_ with a specific dtype will fail (because its np.int32 on this platform)

stephenwlin · 2013-02-08T07:10:11Z

thanks for the travis info, will take a look tomorrow (hopefully I didn't break too much in 3.3)

ghost · 2013-02-08T10:18:10Z

has someone been playing with commit --author? I had nothing to do with df87f8f.

stephenwlin · 2013-02-08T13:46:25Z

i squashed all the commits in #2708 and somehow it showed up as yours, not sure why. if I do it again i'll try to update it to jreback. (EDIT: ok, I went ahead and amended)

jreback · 2013-02-08T14:00:40Z

I think you should squash my PR as well...ultimately yours will be merge on top of master (after mine)

@y-p or is that right, since he is useing my PR as a starting point (instead of master), I assume that updates/sync are all ok, but is there a difference in how you push/present?

stephenwlin · 2013-02-08T14:11:20Z

unless i'm mistaken about how things work, i think it's easier to keep it separate for now in case you update again: what i do to resolve is to reset back to master, merge your dtypes branch, squash everything into one commit, then cherry-pick the last version of my commit on top of that--it makes it easier to tell where the conflicts are that way.

after your PR is merged i will rebase and everything will be a single commit off the new master; should work fine, I think?

ghost · 2013-02-08T14:11:25Z

yep, exactly right. stephen keeps rebasing his commit on top of a squashed version of your rolling PR, and
when the time comes it'll get cherry picked after your PR is merged.

jreback · 2013-02-08T14:12:54Z

ahh...makes sense..thanks!

jreback · 2013-02-08T14:14:58Z

see you got travis running! great!

jreback · 2013-02-08T16:26:58Z

do you think we should add take support for uints? (and prob pad/fill and cases where I add ints now)?
its not hard....only thing is compile time gets longer....is this a big deal?

jreback · 2013-02-08T16:28:15Z

your generic takes handle this of course....question is should we include the cython specializations?

stephenwlin · 2013-02-08T16:34:01Z

i don't have an opinion, myself--i don't know enough about use cases. i'm fine updating the branch with whatever anyone else decides makes sense, though.

jreback · 2013-02-08T17:17:28Z

ok....minor push...removed support for float16...apparently cython really doesn't support these (and I was doing a conversion)...showed up in dff_int8 -> float16 (now goes to float32)...

also...you might want to add a test for _maybe_promote...something like:


dtypes = ['float16'.....,'int8'......'object']
for dtype in dtypes
     array_type in dtypes:
              # test whethere you are getting the correct promotion......

jreback · 2013-02-08T17:25:43Z

I rebased....hopefully didn't screw u up.....(and done unless doesn;'t build properly)

stephenwlin · 2013-02-08T18:07:51Z

test_2d_upcast_fill_nonna tests _maybe_promote indirectly...did you see that? if you think the coverage should be improved further let me know.

jreback · 2013-02-08T18:11:40Z

yes, of course.....that looks great.....its tricky to see diff's between the commits because of the dtypes changes.....but found it!

stephenwlin · 2013-02-09T02:52:36Z

Ahh, turns out _merge_blocks wasn't actually faster this way, so I reverted that part back...oh well. Upcasting is faster, though, for Cython-implemented cases, although current vbench tests don't really exercise it (will add a new test for it.)

stephenwlin · 2013-02-10T16:33:49Z

rebased off master after #2708 merged

wesm · 2013-02-10T20:51:34Z

pandas/src/generate_code.py

    if %(raise_on_na)s and _checknan(fill_value):
-        for i in range(n):
+        for i from 0 <= i < n:


FYI Cython optimizes range(n) so this won't have any effect

yeah, i know...but there was a mix of the two between the templates so I decided to make it consistent...

jreback · 2013-03-19T15:46:08Z

@stephenwlin if you have a chance....can you take a quick look at #3089

stephenwlin mentioned this pull request Feb 8, 2013

ENH: should boolean indexing preserve input dtypes where possible? #2794

Closed

stephenwlin added 2 commits February 10, 2013 11:32

TST: add vb_suite test for reindex with upcasting

119c2e1

ENH: Optimize take_*; improve non-NA fill_value support

c871050

wesm reviewed Feb 10, 2013
View reviewed changes

wesm merged commit c871050 into pandas-dev:master Feb 10, 2013

stephenwlin mentioned this pull request Feb 13, 2013

ENH: Consolidation and further optimization of take functions in common #2867

Merged

jreback mentioned this pull request Mar 19, 2013

PERF: regression from 0.10.1 #3089

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Optimize take_*; improve non-NA fill_value support #2819

ENH: Optimize take_*; improve non-NA fill_value support #2819

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

ghost commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

ghost commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 9, 2013

stephenwlin commented Feb 10, 2013

wesm Feb 10, 2013

stephenwlin Feb 10, 2013

jreback commented Mar 19, 2013

ENH: Optimize take_*; improve non-NA fill_value support #2819

ENH: Optimize take_*; improve non-NA fill_value support #2819

Conversation

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

ghost commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

ghost commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 8, 2013

jreback commented Feb 8, 2013

stephenwlin commented Feb 9, 2013

stephenwlin commented Feb 10, 2013

wesm Feb 10, 2013

Choose a reason for hiding this comment

stephenwlin Feb 10, 2013

Choose a reason for hiding this comment

jreback commented Mar 19, 2013