Traffic Sign Recognition

Writeup

Build a Traffic Sign Recognition Project

The goals / steps of this project are the following:

Load the data set (see below for links to the project data set)
Explore, summarize and visualize the data set
Design, train and test a model architecture
Use the model to make predictions on new images
Analyze the softmax probabilities of the new images
Summarize the results with a written report

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Data Set Summary & Exploration

1.Summary of data set In the code

In[4] contains the code used to generate the summary statistics of the traffic signs data set. The following is a brief summary of the data set:

The size of training set is 34799
The size of the validation set is 4410
The size of test set is 12630
The shape of a traffic sign image is 32x32x3, the 3 channels storing RGB information
The number of unique classes/labels in the data set is 43

2. Include an exploratory visualization of the dataset.

As an exploration of the data set, histograms of the training, validation and test set were plot to observe the distribution of the data. Out[6] contains a histogram of the training set, validation set and test set in the aforementioned order. An interesting observation is that the frequency distribution of the output classes seem to be rather similar in all three sets of data.

Design and Test a Model Architecture

1. Image Pre-processing

After some testing a range of values for normalization, I noticed that the CNN produced the best results when the image data was normalized between a range of 0.1 to 0.9. Here is a list of ranges attempted with the neural net:

-1 to 1
-0.5 to 0.5
0 to 1
0.05 to 0.95
0.1 to 0.9

From the different trial and errors, it was observed that (5) produced the optimal result with the given CNN architecture. This final configuration was obtained after cross-referencing with several other methods of normalization made available online.

2.Final model architecture

My final model consisted of the following layers:

Layer	Description
Input	32x32x3 Grayscale image
Convolution 5x5	1x1 stride, valid padding, outputs 28x28x6
RELU	RELU Activation function
Max pooling 2x2	2x2 stride, outputs 14x14x6
Convolution 5x5	1x1 stride, valid padding, outputs 10x10x16
RELU	RELU Activation Function
Max pooling 2x2	2x2 stride, outputs 5x5x16
Fully Connected Layer	5x5x16, outputs 256
Fully Connected Layer	outputs 120 with dropout of 0.5
Fully Connected Layer	outputs n_classes

The code for the neural net architecture may be found in code cell In[41].

3. Training parameters

To train the model, the following parameters were used:

Adam Optimizer: The gradient descent used to optimize the weights trained rather well, hence no changes were made to this from the base code.
Batch size of 128: The data set was too large and therefore had to be broken down into smaller sets. At the same time, we would not want the batch size to be too small as this would result in multiple iterations
80 epochs: With the dropout layer, the training took a larger number of epochs to converge.
Learning rate of 0.001: Based on tuning after several attempts between 0.001, 0.002, 0.005, and 0.01, it was found that the learning rate of 0.001 resulted in the best accuracy out of the three values.

4. Chosen Architecture and Approach

I used the LeNet architecture which was proposed by Yann Lecun, as it has been proven to be a highly accurate image classifier when trained over a large number of epochs with tuned hyperparameters.

However, the parameters in the fully connected layers were modified slightly from the neural net architecture provided in the deep learning lectures:

A dropout layer was implemented to retain the weights that were perceived to be relevant. While this network architecture took a larger number of epochs to converge, the addition of a single dropout layer at the second last layer improved the accuracy by approximately 2%.
The number of neurons in each fully connected hidden layer was increased to allow the neural net to detect a greater number of features from the images, before deciding whether these features were relevant in the final classification or not through the use of the dropout layer. Detecting more featres would allow a better regression fit in the final layer.

My final model results were:

training set accuracy of 100%
validation set accuracy of 94.7%
test set accuracy of 93.2%

It was observed that possibly due to the dropout layer, the convergence proved more to be a range of accuracies, rather than a single accuracy value, which was observed when the dropout layer was not implemented originally.

Test a Model on New Images

1. Choose five German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

Here are five German traffic signs that I found on the web:

Some of the images chosen were seemed to be similar to several other shapes. For example, the double curve sign could be interpreted by the network as other signs instead due to its shape. Such an image selecton would therefore test the limits of the classifying network.

2. Discuss the model's predictions on these new traffic signs and compare the results to predicting on the test set. At a minimum, discuss what the predictions were, the accuracy on these new predictions, and compare the accuracy to the accuracy on the test set (OPTIONAL: Discuss the results in more detail as described in the "Stand Out Suggestions" part of the rubric).

Here are the results of the prediction:

Image	Prediction
60kmh	60kmh
70kmh	Priority Road
Doublecurve	Slippery Road
Bumpy Road	Bumpy Road
Roadworks	Roadworks

The model was able to correctly guess 3 of the 5 traffic signs, which gives an accuracy of 60%. This does not compare too favorably to the accuracy on the test set of 93.3%. One possible reason could be that the signs had similar outer shapes and inner colours to other signs For example, both the slippery road and double curve signs have red borders, white filling, and triangular shapes. Furthermore, both signs have curved features to similar angles. With such similar features, it may be possible for the neural network to pick up the same features, but classify as a slippery road instead.

3. Describe how certain the model is when predicting on each of the five new images by looking at the softmax probabilities for each prediction. Provide the top 5 softmax probabilities for each image along with the sign type of each probability. (OPTIONAL: as described in the "Stand Out Suggestions" part of the rubric, visualizations can also be provided such as bar charts)

The code for making predictions on my final model is located in code cell In[95] of the notebook.

The table below shows the top 5 softmax predictions based on probabilities output from the model for each image.

Image	Prediction (Probability)
60kmh	60kmh(1.00)
70kmh	Priority Road (1.00)
Double curve	Roadworks (1.00)
Bumpy Road	Bumpy Road (1.00)
Roadworks	Roadworks (1.00)

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
examples		examples
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
Traffic_Sign_Classifier-1.html		Traffic_Sign_Classifier-1.html
Traffic_Sign_Classifier-1.ipynb		Traffic_Sign_Classifier-1.ipynb
checkpoint		checkpoint
lenet.data-00000-of-00001		lenet.data-00000-of-00001
lenet.index		lenet.index
lenet.meta		lenet.meta
signnames.csv		signnames.csv
visualize_cnn.png		visualize_cnn.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Sign Recognition

Writeup

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Data Set Summary & Exploration

1.Summary of data set In the code

2. Include an exploratory visualization of the dataset.

Design and Test a Model Architecture

1. Image Pre-processing

2.Final model architecture

3. Training parameters

4. Chosen Architecture and Approach

Test a Model on New Images

1. Choose five German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

About

Releases

Packages

Languages

License

huiyulhy/CarND-Traffic-Sign-Classifier-Project

Folders and files

Latest commit

History

Repository files navigation

Traffic Sign Recognition

Writeup

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Data Set Summary & Exploration

1.Summary of data set In the code

2. Include an exploratory visualization of the dataset.

Design and Test a Model Architecture

1. Image Pre-processing

2.Final model architecture

3. Training parameters

4. Chosen Architecture and Approach

Test a Model on New Images

1. Choose five German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages