Spike and slab feature sampling priors (feature weighted sampling) #2542

Guillermogsjc · 2019-11-05T15:28:06Z

being thinking about the feature_baggingand crossing myself information with permutation importances and other tools related to feature selection on decision tree ensemble models, I have reached an idea about a possible feature on LightGBM.

Threre could set the posibility of performing the feature bagging according to assigned probabilities of inclusion for each feature on each boosting iteration or each randomforest tree.

It would work as some kind of a proxy to spike-and-slab technique. Do not know if it would improve the full random option, as it would also produce trees with different number of features used in each iteration, but it would include more the a priori best variables.

The main idea is to run LightGBM, do permutation importances, then set spike-and-slab probability of inclusions based on this permutation importances, then run again LightGBM with those feature sampling prior probabilities.

Maybe I am asking for an already existing feature as feature_weight, but I have not managed to find it, excuse me in this case.

Regards.

The text was updated successfully, but these errors were encountered:

pford221 · 2019-11-18T18:02:12Z

Just having by-feature probability of sampling for any given tree learner, would be awesome.

StrikerRUS · 2019-12-20T02:05:31Z

Closed in favor of being in #2302. We decided to keep all feature requests in one place.

Welcome to contribute this feature! Please re-open this issue (or post a comment if you are not a topic starter) if you are actively working on implementing this feature.

Guillermogsjc added enhancement feature request labels Nov 5, 2019

guolinke mentioned this issue Dec 20, 2019

Feature Requests & Voting Hub #2302

Open

StrikerRUS added the help wanted label Dec 20, 2019

StrikerRUS closed this as completed Dec 20, 2019

jameslamb mentioned this issue Sep 9, 2023

Pass in a probability for each feature when sampling features #6087

Closed

jameslamb mentioned this issue Oct 6, 2023

[Python package] Weighted Feature Sampling in LGBM Random Forest Models #6129

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spike and slab feature sampling priors (feature weighted sampling) #2542

Spike and slab feature sampling priors (feature weighted sampling) #2542

Guillermogsjc commented Nov 5, 2019 •

edited

Loading

pford221 commented Nov 18, 2019

StrikerRUS commented Dec 20, 2019

Spike and slab feature sampling priors (feature weighted sampling) #2542

Spike and slab feature sampling priors (feature weighted sampling) #2542

Comments

Guillermogsjc commented Nov 5, 2019 • edited Loading

pford221 commented Nov 18, 2019

StrikerRUS commented Dec 20, 2019

Guillermogsjc commented Nov 5, 2019 •

edited

Loading