[FEA] Add option to accept sample weight vectors to fit methods #669

beckernick · 2019-06-11T14:54:48Z

Is your feature request related to a problem? Please describe.
In sklearn, estimator.fit can (almost always?) accept a sample_weight parameter (defaulting to None) that allows users to pass in a weights vector that determines how much weight each sample should receive (with length equal to the number of samples).

This would be a useful feature for cuML estimators, too. As an example, see the sklearn KMeans documentation

sample_weight : array-like, shape (n_samples,), optional
The weights for each observation in X. If None, all observations are assigned equal weight (default: None)

The text was updated successfully, but these errors were encountered:

JohnZed · 2019-06-27T20:44:54Z

Agreed this will be useful for most estimators. It will be an estimator-by-estimator process to add it, but we could start with linear models and get some commonality there. Not going to make it to 0.9 given current load there, but we'll keep it for a near future release.

JohnZed · 2019-08-08T19:55:27Z

Priority is for KMeans based on requests

Denisevi4 · 2019-09-18T15:26:01Z

Linear models pretty please?

JohnZed · 2019-09-19T06:39:24Z

Sorry, this didn't make it to the current release, but we'll add it to the list for an upcoming release.

JohnZed · 2020-02-03T16:33:38Z

Removing from 0.13 as we've added the k-means specific: #1625

beckernick · 2021-02-23T16:38:30Z

I think it may be worth re-opening this issue for tracking purposes.

A variety of issues exist requesting the ability to specify observation-level weights for various estimators and primitives. As the implementation may need to vary across estimators, it may make sense to keep these issues separate but linked together like an epic. Perhaps this issue can serve as that link, as it's the most broad and the oldest.

Estimators

Logistic Regression ([FEA] Support for Weights in Nearest Neighbors #3006 )
KMeans ([FEA] Support sample weights for KMeans #1625 ) (done single GPU)
SVM ([FEA] sample_weight for SVM #2222 ) (done single GPU)
KNN Classifier ([FEA] Support for Weights in Nearest Neighbors #3006 (comment))

Primitives

contingency_matrix ([FEA] Support for sample weights for contingency_matrix prim #2142).

Additionally, as these are implemented, it will also unblock using the respective estimators inside the sklearn AdaBoostClassifer meta-estimator API (#2401 (comment))

JohnZed · 2021-02-23T17:14:46Z

Long term definitely viable. We will evaluate in more detail whether it can make it into 0.19 and mark it as P1 or P0 if so.

…t) (#4867) Linking #669. This PR adds `sample_weight` parameter to the C++ Coordinate Descent solver, which is used by Lasso and ElasticNet. With some tests on C++ and Python level. I am also removing some cudaStream parameters when the raft handle can be used. Authors: - Micka (https://github.com/lowener) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: #4867

…t) (rapidsai#4867) Linking rapidsai#669. This PR adds `sample_weight` parameter to the C++ Coordinate Descent solver, which is used by Lasso and ElasticNet. With some tests on C++ and Python level. I am also removing some cudaStream parameters when the raft handle can be used. Authors: - Micka (https://github.com/lowener) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4867

beckernick added feature request New feature or request ? - Needs Triage Need team to review and classify labels Jun 11, 2019

beckernick changed the title ~~[FEA] Add options to accept samples weight vectors to fit methods~~ [FEA] Add option to accept samples weight vectors to fit methods Jun 11, 2019

cjnolet added CUDA / C++ CUDA issue Cython / Python Cython or Python issue labels Jan 16, 2020

akkamesh mentioned this issue Apr 10, 2020

[REVIEW] Weighted k-means #2057

Merged

dantegd closed this as completed in #2057 May 6, 2020

beckernick reopened this Feb 23, 2021

beckernick changed the title ~~[FEA] Add option to accept samples weight vectors to fit methods~~ [FEA] Add option to accept sample weight vectors to fit methods Feb 23, 2021

JohnZed mentioned this issue Feb 25, 2021

[FEA] Support sample_weights in logistic regression #3559

Closed

lowener mentioned this issue Aug 17, 2022

Add sample_weight to Coordinate Descent solver (Lasso and ElasticNet) #4867

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Add option to accept sample weight vectors to fit methods #669

[FEA] Add option to accept sample weight vectors to fit methods #669

beckernick commented Jun 11, 2019 •

edited

Loading

JohnZed commented Jun 27, 2019

JohnZed commented Aug 8, 2019

Denisevi4 commented Sep 18, 2019 •

edited

Loading

JohnZed commented Sep 19, 2019

JohnZed commented Feb 3, 2020

beckernick commented Feb 23, 2021 •

edited

Loading

JohnZed commented Feb 23, 2021

[FEA] Add option to accept sample weight vectors to fit methods #669

[FEA] Add option to accept sample weight vectors to fit methods #669

Comments

beckernick commented Jun 11, 2019 • edited Loading

JohnZed commented Jun 27, 2019

JohnZed commented Aug 8, 2019

Denisevi4 commented Sep 18, 2019 • edited Loading

JohnZed commented Sep 19, 2019

JohnZed commented Feb 3, 2020

beckernick commented Feb 23, 2021 • edited Loading

JohnZed commented Feb 23, 2021

beckernick commented Jun 11, 2019 •

edited

Loading

Denisevi4 commented Sep 18, 2019 •

edited

Loading

beckernick commented Feb 23, 2021 •

edited

Loading