Implement support for retaining Pandas index #6061

philippjfr · 2024-01-06T13:53:44Z

The Pandas support in HoloViews has always been severely hampered by the fact that we do not properly support indexes, i.e. if a user references an index column we are forced to call .reset_index() which is inefficient and breaks one core principle of HoloViews, which is that we simply provide thin wrappers around data. It also has significant performance implications both from a memory perspective and from a speed perspective since indexes can provide significant speedups when indexing or performing aggregations. One other major issue solved by supporting indexes is the problem of having multiple elements providing a view onto the same DataFrame, e.g. when you have an NdOverlay of curves each visualizing a different column.

Implements #2537

codecov-commenter · 2024-01-06T13:58:44Z

Codecov Report

Attention: Patch coverage is 20.06579% with 243 lines in your changes are missing coverage. Please review.

Project coverage is 26.96%. Comparing base (342d81c) to head (4546212).
Report is 15 commits behind head on main.

❗ Current head 4546212 differs from pull request most recent head 1476fac. Consider uploading reports for the commit 1476fac to get more accurate results

Files	Patch %	Lines
holoviews/tests/core/data/test_pandasinterface.py	21.17%	134 Missing ⚠️
holoviews/core/data/pandas.py	16.39%	102 Missing ⚠️
holoviews/core/data/ibis.py	41.66%	7 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #6061       +/-   ##
===========================================
- Coverage   88.68%   26.96%   -61.73%     
===========================================
  Files         316      318        +2     
  Lines       66072    67132     +1060     
===========================================
- Hits        58598    18104    -40494     
- Misses       7474    49028    +41554

Flag	Coverage Δ
ui-tests	`26.96% <20.06%> (+3.17%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

droumis · 2024-01-09T16:03:29Z

may partially resolve #6058

holoviews/core/data/pandas.py

philippjfr

Looks good, left a few suggestions and questions for you.

Co-authored-by: Philipp Rudiger <prudiger@anaconda.com>

philippjfr · 2024-04-17T15:55:55Z

Okay, I'm happy with this PR personally. We really need to merge this and test it extensively in all kinds of scenarios over the next few weeks.

holoviews/core/data/pandas.py

github-actions · 2024-10-23T10:28:50Z

This pull request has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

Implement support for retaining Pandas index

82a61e1

philippjfr mentioned this pull request Jan 6, 2024

[GOAL] Support viewing of medium, multi-channel timeseries data #6058

Closed

4 tasks

philippjfr added the type: enhancement Minor feature or improvement to an existing feature label Jan 6, 2024

philippjfr added 4 commits January 6, 2024 15:09

Column takes precedence over index

6915ad5

Small fix

a5a2772

Further fixes

96b69e3

Fix ibis

f8e1d48

droumis mentioned this pull request Jan 9, 2024

Add viewport downsample algorithm #6017

Merged

droumis assigned hoxbro Jan 23, 2024

hoxbro added 5 commits February 1, 2024 12:27

Merge branch 'main' into pandas_index_support

feaba77

Remove xfail for downsample test

5df94ad

Merge branch 'main' into pandas_index_support

9db1b27

Rename is_index to isindex to match with isscalar

b3398af

Small updates

e2ed038

hoxbro marked this pull request as draft February 8, 2024 20:24

hoxbro added 6 commits February 14, 2024 11:24

Update aggregate to work with MultiIndex

1adc533

Try commenting out reset_index

f4ad81f

Add select test and clean up

cf5cf3a

iloc with scalar values + fix

f522fdb

Update iloc to work with slice of indexes

4a2e368

Handle iloc scalar and slice

3cb074f

hoxbro mentioned this pull request Feb 15, 2024

Minor changes in preparation of the HoloViews Pandas index refactor holoviz/hvplot#1281

Merged

hoxbro added 2 commits February 15, 2024 16:40

Hvplot fixes

8e51d8a

Add test case for pandas range index

03af717

hoxbro force-pushed the pandas_index_support branch from daf36cf to 03af717 Compare February 15, 2024 16:52

hoxbro mentioned this pull request Feb 20, 2024

Add first-class support for polars #5939

Open

Add last tests and updates for iloc

ab1f2c2

hoxbro force-pushed the pandas_index_support branch from 5ec315a to ab1f2c2 Compare February 21, 2024 17:42

hoxbro marked this pull request as ready for review April 8, 2024 12:53

Clean up

526c188