Grouby select multiple columns #6524

hayd · 2014-03-03T06:37:19Z

Is this supported?

g[['X', 'Y']]  # do groupby stuff with just these columns

http://stackoverflow.com/q/22139053/1240268

The text was updated successfully, but these errors were encountered:

hayd · 2014-03-03T06:40:35Z

~~This should probably raise NotImplemented ?~~

naught101 · 2014-03-03T09:37:01Z

@hayd, thanks for the follow up. For reference:

df = pandas.DataFrame({"Dummy":[1,2]*6, "X":[1,3,7]*4, 
                       "Y":[2,3,4]*4, "group":["A","B"]*6})
df[['X', 'Y']].head(1)
   X  Y
0  1  2
[1 rows x 2 columns]

df[:,['X', 'Y']].head(1)
TypeError: unhashable type: 'slice'

df.loc[:,['X', 'Y']].head(1)
   X  Y
0  1  2
[1 rows x 2 columns]

df.groupby('group')[['X', 'Y']].head(1)
         Dummy  X  Y group
group                     
A     0      1  1  2     A
B     1      2  3  3     B
[2 rows x 4 columns]

df.groupby('group').loc[:,['X', 'Y']].head(1)
AttributeError: Cannot access attribute 'loc' of 'DataFrameGroupBy' objects, try using the 'apply' method

jreback · 2014-03-03T13:31:31Z

This works, except in head/tail.
jreback@92e5c50
Current

In [1]: df = DataFrame([[1, 2], [1, 4], [5, 6]], columns=['A', 'B'])

In [2]: df
Out[2]: 
   A  B
0  1  2
1  1  4
2  5  6

[3 rows x 2 columns]

In [4]: df.groupby('A',as_index=True).head(1)
Out[4]: 
     A  B
A        
1 0  1  2
5 2  5  6

[2 rows x 2 columns]

In [5]: df.groupby('A',as_index=False).head(1)
Out[5]: 
   A  B
0  1  2
2  5  6

[2 rows x 2 columns]

master (with my change)

In [1]: df = DataFrame([[1, 2], [1, 4], [5, 6]], columns=['A', 'B'])

In [2]: df
Out[2]: 
   A  B
0  1  2
1  1  4
2  5  6

[3 rows x 2 columns]

In [3]: df.groupby('A',as_index=True).head(1)
Out[3]: 
     B
A     
1 0  2
5 2  6

[2 rows x 1 columns]

In [4]: df.groupby('A',as_index=False).head(1)
Out[4]: 
   B
0  2
2  6

[2 rows x 1 columns]

I think master is wrong here ?

TomAugspurger · 2014-03-03T14:18:39Z

Same issue? #5264

jreback · 2014-03-03T14:19:51Z

@TomAugspurger actually might be the same....let's close this on and consolidate.

The fix is pretty trivial, but a couple of tests are 'wrong' (that's why I put them up). So needs to be carefully gone over.

jreback · 2014-03-03T14:20:19Z

consolidating issue to #5264

hayd · 2014-03-03T18:06:28Z

@jreback I really want to kill this index behaviour of head/tail, it should act like filter. There is an issue somewhere, maybe should do it sooner rather than later #5755

hayd · 2014-03-03T18:10:05Z

@jreback master is wrong there!

hayd · 2014-03-03T18:16:17Z

will put up change (breaking) to make head/tail act like filter (regardless of as_index), the reason it's not is historical IMO (from when it was .apply(head) )... I don't think people ever want it ?

hayd added this to the Someday milestone Mar 3, 2014

hayd mentioned this issue Mar 3, 2014

groupby filter can't access all columns #6512

Closed

jreback closed this as completed Mar 3, 2014

TomAugspurger mentioned this issue Mar 3, 2014

BUG: groupby sub-selection ignored with some methods #5264

Closed

31 tasks

hayd mentioned this issue Mar 3, 2014

API change in groupby head and tail #6533

Merged

maurosilber mentioned this issue Dec 8, 2021

BUG: Inconsistent behaviour in DataFrameGroupBy when selecting a subset of columns #44821

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grouby select multiple columns #6524

Grouby select multiple columns #6524

hayd commented Mar 3, 2014

hayd commented Mar 3, 2014

naught101 commented Mar 3, 2014

jreback commented Mar 3, 2014

TomAugspurger commented Mar 3, 2014

jreback commented Mar 3, 2014

jreback commented Mar 3, 2014

hayd commented Mar 3, 2014

hayd commented Mar 3, 2014

hayd commented Mar 3, 2014

Grouby select multiple columns #6524

Grouby select multiple columns #6524

Comments

hayd commented Mar 3, 2014

hayd commented Mar 3, 2014

naught101 commented Mar 3, 2014

jreback commented Mar 3, 2014

TomAugspurger commented Mar 3, 2014

jreback commented Mar 3, 2014

jreback commented Mar 3, 2014

hayd commented Mar 3, 2014

hayd commented Mar 3, 2014

hayd commented Mar 3, 2014