[dask] preserve chunks in results of multi-class pred_contrib predictions on sparse matrices #4438

jameslamb · 2021-07-04T17:04:34Z

Summary

As of #4378, DaskLGBMClassifier.predict(X, pred_contrib=True) returns a list of Dask Arrays if the model is a multiclass classification model and X is a scipy sparse array.

However, those Dask Arrays only have a single chunk. That code should be updated to preserve the original chunking from X.

Motivation

Preserving the chunking would improve the parallelism of any postprocessing of the prediction results using other Dask Array operations, which would reduce the risk of out-of-memory issues.

Description

See #4378 (comment) for a proposed solution, using dask.array.core.concatenate_lookup().

References

Created from #4378 (comment) and #4378 (comment).

This issue is only relevant once #4378 is merged.

The different output format for the multiclass + pred_contrib + sparse X case is described in detail in #3881.

The text was updated successfully, but these errors were encountered:

jameslamb · 2021-07-04T17:07:05Z

Per this project's process, I've added this to #2302, the issue where all feature requests are tracked. Anyone is welcome to contribute this feature. Please leave a comment here if you're interested in contributing and this issue can be re-opened.

jameslamb added feature request dask labels Jul 4, 2021

This was referenced Jul 4, 2021

[dask] Make output of feature contribution predictions for sparse matrices match those from sklearn estimators (fixes #3881) #4378

Merged

Feature Requests & Voting Hub #2302

Open

jameslamb closed this as completed Jul 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dask] preserve chunks in results of multi-class pred_contrib predictions on sparse matrices #4438

[dask] preserve chunks in results of multi-class pred_contrib predictions on sparse matrices #4438

jameslamb commented Jul 4, 2021

jameslamb commented Jul 4, 2021

[dask] preserve chunks in results of multi-class pred_contrib predictions on sparse matrices #4438

[dask] preserve chunks in results of multi-class pred_contrib predictions on sparse matrices #4438

Comments

jameslamb commented Jul 4, 2021

Summary

Motivation

Description

References

jameslamb commented Jul 4, 2021