Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataCleaner needs scaling #115

Closed
ardunn opened this issue Nov 1, 2018 · 1 comment
Closed

DataCleaner needs scaling #115

ardunn opened this issue Nov 1, 2018 · 1 comment
Assignees

Comments

@ardunn
Copy link
Contributor

ardunn commented Nov 1, 2018

Was having trouble implementing this so I am putting it off till later

Basically fitting (with scaling enabled) using DataCleaner should define a scaler_obj for the class. This scaler object can then be used to transform all numerical (excluding target and one-hot or label columns) columns on other dataframes while preserving the scaling from the fitted scaler.

In other words the scaler should not be refit during .transform, only .fit.

@ardunn ardunn self-assigned this Nov 1, 2018
@ardunn
Copy link
Contributor Author

ardunn commented Nov 7, 2018

It appears to me that tpot already handles all scaling and normalization needed based on models (including trying models with and without scaling/normalization). So as of right now doing scaling externally to tpot is not needed.

@ardunn ardunn closed this as completed Nov 7, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant