Fix AttributeError when groupby as_index=False on empty DataFrame #35324

salem3358 · 2020-07-17T13:46:10Z

closes BUG: AttributeError when doing groupby with as_index=False on Empty DataFrame #35246
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

simonjayhawkins · 2020-07-17T14:24:43Z

pandas/core/groupby/generic.py

                    # select everything except for the last level, which is the one
                    # containing the name of the function(s), see GH 32040
                    result.columns = result.columns.rename(
                        [self._selected_obj.columns.name] * result.columns.nlevels
                    ).droplevel(-1)

+                except ValueError as err:


if you leave this unchanged, does it work to put another try/except in the else

I was trying to avoid that because nesting exception handling increase complexity.

But you are right. There is a chance that the self._aggregate_multiple_funcs also raise AttributeError and this will change the behavior in that case.

Will update the code as your suggestion. Thanks!

I agree that nesting exception handling is not ideal, maybe an isinstance check for a Series instead.

between nesting exception handling and explicit type-checking (also nesting inside another exception handling), I think the first one is less evil :). what do you think?

from a static typing perspective, the isinstance is probably better. In this case, an AttributeError should be easily avoided. hasattr is not ideal from a static typing/mypy perspective either. However, leave it as a try/except for now and see what others think.

simonjayhawkins · 2020-07-17T14:37:08Z

pandas/tests/groupby/test_groupby.py

+    df = DataFrame(columns=["A", "B", "C"])
+    left = df.groupby(by="A", as_index=False)["B"].sum()
+    assert type(left) is DataFrame
+    assert left.to_dict() == {"A": {}, "B": {}}


can you change this test to be more like (and closer to the issue op)

df = pd.DataFrame(columns=["a", "b"]) grp = df.groupby(by="a", as_index=False)["b"] result = grp.sum() expected = pd.DataFrame(index=pd.Int64Index([], dtype="int64"), columns=["a", "b"]) expected["b"] = expected["b"].astype("float64") tm.assert_frame_equal(result, expected)

this is great. I was struggling with IndexType

simonjayhawkins · 2020-07-17T14:43:44Z

Thanks @salem3358 for the PR. Can you add a release note to 1.1.0. even if this doesn't make the RC, since this fixes a regression we will be backporting if necessary. cc @TomAugspurger

TomAugspurger · 2020-07-17T17:43:55Z

Added a releases note. Merging on green.

Fix AttributeError when groupby as_index=False on empty DataFrame

87003a2

salem3358 mentioned this pull request Jul 17, 2020

BUG: AttributeError when doing groupby with as_index=False on Empty DataFrame #35246

Closed

3 tasks

simonjayhawkins added Groupby Regression Functionality that used to work in a prior pandas version labels Jul 17, 2020

simonjayhawkins reviewed Jul 17, 2020

View reviewed changes

simonjayhawkins added this to the 1.1 milestone Jul 17, 2020

TomAugspurger added 2 commits July 17, 2020 12:40

Merge remote-tracking branch 'upstream/master' into fix_issue_35246

01b22cb

note

9bac93d

TomAugspurger mentioned this pull request Jul 17, 2020

RLS: 1.1 #34730

Closed

TomAugspurger merged commit ddc6442 into pandas-dev:master Jul 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AttributeError when groupby as_index=False on empty DataFrame #35324

Fix AttributeError when groupby as_index=False on empty DataFrame #35324

salem3358 commented Jul 17, 2020 •

edited by simonjayhawkins

Loading

simonjayhawkins Jul 17, 2020

salem3358 Jul 17, 2020

simonjayhawkins Jul 17, 2020

salem3358 Jul 17, 2020

simonjayhawkins Jul 17, 2020

simonjayhawkins Jul 17, 2020

salem3358 Jul 17, 2020

simonjayhawkins commented Jul 17, 2020

TomAugspurger commented Jul 17, 2020

Fix AttributeError when groupby as_index=False on empty DataFrame #35324

Fix AttributeError when groupby as_index=False on empty DataFrame #35324

Conversation

salem3358 commented Jul 17, 2020 • edited by simonjayhawkins Loading

simonjayhawkins Jul 17, 2020

Choose a reason for hiding this comment

salem3358 Jul 17, 2020

Choose a reason for hiding this comment

simonjayhawkins Jul 17, 2020

Choose a reason for hiding this comment

salem3358 Jul 17, 2020

Choose a reason for hiding this comment

simonjayhawkins Jul 17, 2020

Choose a reason for hiding this comment

simonjayhawkins Jul 17, 2020

Choose a reason for hiding this comment

salem3358 Jul 17, 2020

Choose a reason for hiding this comment

simonjayhawkins commented Jul 17, 2020

TomAugspurger commented Jul 17, 2020

salem3358 commented Jul 17, 2020 •

edited by simonjayhawkins

Loading