-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] OutlierRemover doesn't work with supervised learning #342
Comments
I suppose this could be fixed using imbalanced-learn Pipeline or implementing own pipeline that allows this kind of actions. |
I like the idea and it's certainly valid statement, but we'd like this library to remain compatible with scikit-learn primarily. I can imagine that it gets complicated for users too if they need to figure out which components require which pipeline backend. |
I've given it some thought and I do think that a custom pipeline will be the only way to properly support this. That said, there might be something to say for reusing the @Matgrb were you thinking about picking this issue up? |
Hi @MBrouns, |
Hm I kind of like being able to do as much as possible inside a pipeline. It makes persistence much simpler and there's less chance stuff will go wrong with data splits. I'll take a look at your approach though and see whether that fits! |
Totally agree - that's why I was wondering what is your current solutions for outliers |
X and y will have a different size after the outlier removal because we can't filter y in the pipeline
The text was updated successfully, but these errors were encountered: