ENH: Support merging DataFrames on a combination of columns and index levels #14355

jonmmease · 2016-10-05T14:00:26Z

.groupby ENH/API: clarify groupby by to handle columns/index names #5677
.sort_values ENH: Support sorting DataFrames by a combination of columns and index levels #14353, though this is directly in combat with .sort_index and non-explict
.merge (this issue)

Overview

@jorisvandenbossche
As a part of the Pandas 1.0 goal to "Make the index/column distinction less painful (#5677, #8162)" I propose that the df.merge method support merging DataFrames on a combination of columns and index levels.

This could be accomplished in the API by allowing the on, left_on, and right_on keywords to accept a combination of column names and index level names. Any index levels that are joined on would be preserved as index levels in the resulting merged DataFrame, while all other index levels would be removed.

This proposal is in the spirit of #5677 for df.groupby and #14353 for df.sort_values.

The text was updated successfully, but these errors were encountered:

jorisvandenbossche · 2016-10-10T12:46:43Z

+1 on this proposal. As I said in the related issue (#14353 (comment)), would be nice to have a general solution for this, but for now enabling this behaviour for specific functions/keywords is fine for me.

@jreback @wesm @TomAugspurger @shoyer @sinhrks @chris-b1 any concerns of feedback regarding this proposal, before work is done to implement it? (@jmmease you plan to tackle this if OK?)

jonmmease · 2016-10-10T14:17:36Z

@jorisvandenbossche Yes, if the direction is agreeable I plan to begin tackling this set of issues during the next month or two.
Thanks for the feedback!

TomAugspurger · 2016-10-10T15:24:06Z

I think I'm in favor of all the changes. Thanks for taking them on @jmmease!

shoyer · 2016-10-10T16:05:18Z

Yes, seems like a good idea to me.

shoyer · 2016-10-10T16:07:14Z

We do need to clarify how we will handle conflicting index/column names in a uniform way. For backwards compatibility, I think we need to always check column names before falling back to use index names.

jonmmease · 2016-10-12T23:22:26Z

@shoyer Agreed regarding conflict resolution. Thanks for the feedback!

jreback added Enhancement API Design Compat pandas objects compatability with Numpy or Python functions Master Tracker High level tracker for similar issues labels Oct 6, 2016

jorisvandenbossche added this to the Next Major Release milestone Oct 10, 2016

jonmmease mentioned this issue Oct 19, 2016

ENH: Allow the groupby by param to handle columns and index levels (GH5677) #14432

Merged

8 tasks

jonmmease mentioned this issue Sep 9, 2017

Support merging DataFrames on a combo of columns and index levels (GH 14355) #17484

Merged

5 tasks

jreback modified the milestones: Next Major Release, High Level Issue Tracking Sep 24, 2017

jreback closed this as completed in #17484 Dec 1, 2017

jreback modified the milestones: High Level Issue Tracking, 0.22.0 Dec 1, 2017

jonmmease mentioned this issue Dec 3, 2017

ENH: Support merging Dask DataFrames on a combination of columns and index levels dask/dask#2950

Closed

jorisvandenbossche mentioned this issue Aug 1, 2019

API: Meta-issue for making consistent API's to refer to column names and index names #27652

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Support merging DataFrames on a combination of columns and index levels #14355

ENH: Support merging DataFrames on a combination of columns and index levels #14355

jonmmease commented Oct 5, 2016 •

edited

Loading

jorisvandenbossche commented Oct 10, 2016

jonmmease commented Oct 10, 2016 •

edited

Loading

TomAugspurger commented Oct 10, 2016

shoyer commented Oct 10, 2016

shoyer commented Oct 10, 2016

jonmmease commented Oct 12, 2016

ENH: Support merging DataFrames on a combination of columns and index levels #14355

ENH: Support merging DataFrames on a combination of columns and index levels #14355

Comments

jonmmease commented Oct 5, 2016 • edited Loading

Overview

jorisvandenbossche commented Oct 10, 2016

jonmmease commented Oct 10, 2016 • edited Loading

TomAugspurger commented Oct 10, 2016

shoyer commented Oct 10, 2016

shoyer commented Oct 10, 2016

jonmmease commented Oct 12, 2016

jonmmease commented Oct 5, 2016 •

edited

Loading

jonmmease commented Oct 10, 2016 •

edited

Loading