Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Analysis class needs to be beefed up with something actually useful #105

Closed
ardunn opened this issue Oct 24, 2018 · 2 comments
Closed

Analysis class needs to be beefed up with something actually useful #105

ardunn opened this issue Oct 24, 2018 · 2 comments

Comments

@ardunn
Copy link
Contributor

ardunn commented Oct 24, 2018

Given a PredictionPipeline object (or just a tpot model and feature list until I get the top level classes working better), analysis should give back a nice html (or other format?) containing:

  • identification of outliers
  • identification of the most important features
  • partial dependence plots w/ skater
  • LIME plots w/ skater
  • details of the features dropped/features retained
  • breakdown of the featurization time/fitting time/etc.
  • t-SNE plot based on features and labelled by material formula/phase

@Doppe1g4nger @ADA110 any other ideas on cool things to include here?

@albalu
Copy link
Contributor

albalu commented Oct 24, 2018

For outliers, there is single class SVM and isolation forest which are basically what they sound like but with one-class classification. In isolation forest you run random forest and the points that are identified by too few splits can be the outliers. This could be used as a default. I wanted to implement it but don't think will have time :| It is already implemented in sklearn but someone should just integrate it in the workflow

@Doppe1g4nger
Copy link
Contributor

Also we should add a tab of data giving a description of how the model that was selected works, at least in a shallow not-directed-towards-experts manner.

@ardunn ardunn added the ugrads label Dec 11, 2018
@ardunn ardunn changed the title analysis ideas - wip Analysis class needs to be beefed up with something actually useful Jan 26, 2019
@ardunn ardunn closed this as completed in cbe9437 Feb 8, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants