Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stabilize PyMFE on big datasets #44

Merged
merged 3 commits into from
Jul 27, 2023
Merged

Conversation

MorrisNein
Copy link
Collaborator

@MorrisNein MorrisNein commented Jul 27, 2023

  • alter categorical features transformation to "one-hot" if pymfe raises RecursionError. This fixes pymfe for datasets with thousands of categorical features (e.g., OpenMLDataset(id_=40978, name=Internet-Advertisements))
  • allow datasets data to not have "categorical_indicator" for pymfe (for future stability)
  • allow to specify kwargs for MFE.fit() & MFE.extract() in PymfeExtractor.extract() (just in case)

@MorrisNein MorrisNein requested a review from nicl-nno July 27, 2023 16:54
@MorrisNein MorrisNein merged commit c2f890b into main Jul 27, 2023
@MorrisNein MorrisNein deleted the stabilize_pymfe_on_big_datasets branch July 27, 2023 17:11
maypink pushed a commit that referenced this pull request Aug 15, 2023
* alter categorical features transformation to "one-hot" if pymfe raised RecursionError

* allow datasets data to not have "categorical_indicator" for pymfe

* allow to specify kwargs for MFE.fit() & MFE.extract() in PymfeExtractor.extract()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants