Library Request: cuDF + RAPIDS #8318

AlexCatarino · 2024-09-11T19:26:52Z

cuDF (pronounced "KOO-dee-eff") is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data.

Test:

import cudf

tips_df = cudf.read_csv("https://github.com/plotly/datasets/raw/master/tips.csv")
tips_df["tip_percentage"] = tips_df["tip"] / tips_df["total_bill"] * 100

# display average tip by dining party size
print(tips_df.groupby("size").tip_percentage.mean())

Gives us:

No module named 'cudf'

EDIT: We need to install RAPIDS too.

Checklist

I have completely filled out this template
I have confirmed that this issue exists on the current master branch
I have confirmed that this is not a duplicate issue by searching issues
I have provided detailed steps to reproduce the issue

The text was updated successfully, but these errors were encountered:

beckernick · 2024-09-18T19:29:17Z

Hi! I came across this issue due to the cuDF reference. I work on cuDF and other RAPIDS projects at NVIDIA.

In addition to being a GPU library, cuDF can provide zero code change GPU-acceleration for pandas and (as of yesterday) Polars.

%load_ext cudf.pandas # or via command line for Python scripts

df = pd.read_parquet(filepath)

(df[["Registration State", "Violation Description"]]
 .value_counts()
 .groupby("Registration State")
 .head()
 .sort_index()
)

import polars as pl

ldf = pl.LazyFrame({"a": [1.242, 1.535]})

print(
    ldf.select(
        pl.col("a").round(1)
    ).collect(engine="gpu")
)

Would love to see these capabilities available for LEAN users. Happy to try to help answer any questions that might come up if you or anyone else explores this.

Martin-Molinero · 2024-12-10T18:08:48Z

We will be adding these libraries to cloud only default environment due to the large footprint required (~8GB)

AlexCatarino added the library-request label Sep 11, 2024

AlexCatarino changed the title ~~Library Request: cuDF~~ Library Request: cuDF + RAPIDS Sep 25, 2024

Martin-Molinero mentioned this issue Dec 10, 2024

Foundation update #8455

Merged

11 tasks

Martin-Molinero closed this as completed in #8455 Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Library Request: cuDF + RAPIDS #8318

Library Request: cuDF + RAPIDS #8318

AlexCatarino commented Sep 11, 2024 •

edited

Loading

beckernick commented Sep 18, 2024 •

edited

Loading

Martin-Molinero commented Dec 10, 2024

Library Request: cuDF + RAPIDS #8318

Library Request: cuDF + RAPIDS #8318

Comments

AlexCatarino commented Sep 11, 2024 • edited Loading

Checklist

beckernick commented Sep 18, 2024 • edited Loading

Martin-Molinero commented Dec 10, 2024

AlexCatarino commented Sep 11, 2024 •

edited

Loading

beckernick commented Sep 18, 2024 •

edited

Loading