Evaluating and Tuning an ANN

Motivation

Using a simple single train-test split evaluation as shown in the ann_churn_classifier code is a good way to "quickly" iterate over the different features and network arrangements until we find a candidate architecture we feel confident about.

However, using a single train-test split as a broader measure of performance has a problem. The training process of an ANN is non deterministic and dependent on the training data from the split. Hence, the accuracy results tend to have a high variance.

As a result, once we want to seriously evaluate the performance of a candidate architecture, we might need to resort to more robust evaluation strategies like k-fold cross validation.

K-fold cross validation

The data is first split into training and test set
The k-fold cross validation is run on the training set as shown on the image above.
If k = 5 the model is trained 5 + 1 times
- 5 times each leaving a different fold out. Accuracy is calculated for each training.
- A 6th time to train the final model with all the training data.
The list of 5 accuracies allows us to calculate the accuracy's average and the standard deviation / variance. This is a much more robust evaluation than a single train-test split.
The test set is still reserved as a strictly unseen piece of data that can be used to do a final evaluation of the final model (e.g. the 6th iteration trained with all the data).

Hyperparameter tuning

This is mostly done through grid search. In sklearn GridSearchCV internally uses k-fold cross validation. See the code for full details on how to do this.

Code

See these concepts in action here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3-evaluating-and-tuning-an-ann.md

3-evaluating-and-tuning-an-ann.md

Evaluating and Tuning an ANN

Motivation

K-fold cross validation

Hyperparameter tuning

Code

Files

3-evaluating-and-tuning-an-ann.md

Latest commit

History

3-evaluating-and-tuning-an-ann.md

File metadata and controls

Evaluating and Tuning an ANN

Motivation

K-fold cross validation

Hyperparameter tuning

Code