Analysis class needs to be beefed up with something actually useful #105

ardunn · 2018-10-24T21:02:01Z

Given a PredictionPipeline object (or just a tpot model and feature list until I get the top level classes working better), analysis should give back a nice html (or other format?) containing:

identification of outliers
identification of the most important features
partial dependence plots w/ skater
LIME plots w/ skater
details of the features dropped/features retained
breakdown of the featurization time/fitting time/etc.
t-SNE plot based on features and labelled by material formula/phase

@Doppe1g4nger @ADA110 any other ideas on cool things to include here?

albalu · 2018-10-24T21:12:09Z

For outliers, there is single class SVM and isolation forest which are basically what they sound like but with one-class classification. In isolation forest you run random forest and the points that are identified by too few splits can be the outliers. This could be used as a default. I wanted to implement it but don't think will have time :| It is already implemented in sklearn but someone should just integrate it in the workflow

Doppe1g4nger · 2018-10-24T22:22:54Z

Also we should add a tab of data giving a description of how the model that was selected works, at least in a shallow not-directed-towards-experts manner.

ardunn assigned Doppe1g4nger and ADA110 Oct 24, 2018

ardunn unassigned ADA110 Nov 2, 2018

ardunn added the ugrads label Dec 11, 2018

ardunn unassigned Doppe1g4nger Dec 11, 2018

ardunn changed the title ~~analysis ideas - wip~~ Analysis class needs to be beefed up with something actually useful Jan 26, 2019

ardunn closed this as completed in cbe9437 Feb 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analysis class needs to be beefed up with something actually useful #105

Analysis class needs to be beefed up with something actually useful #105

ardunn commented Oct 24, 2018 •

edited

Loading

albalu commented Oct 24, 2018

Doppe1g4nger commented Oct 24, 2018

Analysis class needs to be beefed up with something actually useful #105

Analysis class needs to be beefed up with something actually useful #105

Comments

ardunn commented Oct 24, 2018 • edited Loading

albalu commented Oct 24, 2018

Doppe1g4nger commented Oct 24, 2018

ardunn commented Oct 24, 2018 •

edited

Loading