This project contains the development of a vehicle image classification model based on convolutional neural network (CNN). The model is capable of classifying vehicle models based on images. It is the AI model used in the Flask API that is developed to facilitate the image recognition capabilities of the Price Finder mobile application.
- Achieved an accuracy of 97.26% when being evaluated on the test dataset across four classes.
- Use of a custom dataset to train and test the CNN model (No Kaggle datasets were used).
- Implementation of a selenium-based web scraper to gather images for the custom dataset.
- Automation of image preprocessing such as duplicate image removal and test-train-validation split using python.
- Accurate model evaluations using training and validation loss / accuracy graphs and confusion matrix using Matplotlib and NumPy.
- Modular project structure that separates the dataset collection and model development for ease in future scalability.
- Detailed in depth documentation on both dataset collection and model creation which covers the entire workflow structure from project set-up to saving the trained model.
- Model creation using Jupyter Notebook to provide ease in project reusability for further development as well as enhanced interactivity between testing and training compared to other forms of IDEs.
- Python - 3.8
- TensorFlow - 2.9
- NumPy - 1.20
- Scikit-learn - 1.1
- Matplotlib - 3.4
- Pillow - 9.2
- BeautifulSoup4 - 4.11
- Requests - 2.27
- Selenium - 4.4
- DifPy - 2.4
The dataset used in this project was sourced from google images with the use of a custom web scraper which is explained in detail during the Collect image data section.
For the current version of the model, the dataset consists of 4,138 images across 4 classes. The data is then split up into train, test and validation sets which each contains 4 sub directories of the 4 image classes.
The above process is mentioned in detail in the split dataset section.
Overview of the image data:
Class | Train | Test | Validate | Total |
---|---|---|---|---|
Alto 2015 | 1,252 | 139 | 140 | 1531 |
Hero Dash 2016 | 405 | 51 | 51 | 507 |
Toyota Aqua 2014 | 897 | 113 | 112 | 1,122 |
Wagon R Stingray 2018 | 782 | 98 | 98 | 978 |
Grand Total | - | - | - | 4,138 |
Few example images from each class:
Alto 2015 | Hero Dash 2016 | Toyota Aqua 2014 | Wagon R Stingray 2018 |
---|---|---|---|
📍 Note: The images used in the current project are not shared with the repository since i do not own the rights to the images downloaded through google images. However, that would not be an issue for anyone following this project since the webscraper script and instructions are given.
The project is based on two main environments, one for the image data collection and another for developing the model. The image data collection is handled through a python web scraper script and the model is developed on a jupyter notebook.
Clone the repository
git clone https://github.com/donheshanthaka/Price-Finder-Deep-Learning-Model.git
Prerequisites:
📌 Make sure that you chrome is updated, if not the browser will close automatically during the start of the script
Step 1:
- Create a python virtual environment inside the
web_scraper_for_image_collection
folder. - Navigate to the folder and open up a terminal and insert the following command.
python -m venv env
- Navigate to the activation path
cd env/Scripts
- Activate the virtual environment (Run either one, not both)
activate.bat //In CMD
Activate.ps1 //In Powershell
Step 2:
- Install the required dependencies
pip install -r requirements.txt
Step 1:
This script uses selenium webdriver module to scrape images from Google Images.
- Open the web_scraping_google_images.py script
- You have to edit two fields inside the script according to the preferred vehicle model.
folder_name
= The name of the folder where downloaded images are saved (add iteration value at the end of the name).
search_URL
= URL of the search query made using google images with the vehicle model
Example: (Vehicle model = Toyota Aqua 2014)
- Go to google images and search the vehicle: example
- Use that url for
search_URL
folder_name = "images\\aqua 01".
search_URL = "https://www.google.lk/search?q=Toyota+Aqua+2014&hl...."
- Run the script inside the virtual environment.
- A Chrome browser will open with the given url, Do not click on the images, instead scroll to the very bottom of the webpage.
- Then press enter on the python terminal where the script is running.
- The browser will scroll up automatically and start downloading each image.
- After downloading the images, the browser will close itself.
Repeate the above steps until you have collected sufficient amount of images per vehicle (4-5 times) and do the same for all the vehicle models you want the cnn model to be trained on.
📌 Hint: Use different keywords on every search (Toyota Aqua 2014 Front / Toyota Aqua 2014 Japan / Prius C 2014) > Prius C is another name used for Aqua in international markets.
The download_image_preprocessor.py
script is used to make image collections of each vehicle model by combining the subsets of each run such as (aqua 01, aqua 02 ...) as well as identify duplicate images downloaded when running the image collection script and delete them.
Step 01:
- Copy all the sub folders (aqua 01, aqua 02.. ) into a parent folder. E.g. ->
aqua collection
- Repeat the above step for all the vehicle image sets collected.
An example of the directory structure of images folder:
images <- top level folder
│
└───aqua collection <- the collection folder that was created using the above step
│ │
│ └───aqua 01
│ │ | 001.jpg
│ │ | 002.jpg
│ │ | ...
│ └───aqua 02
│ │ | 001.jpg
│ │ | 002.jpg
│ │ | ...
│ │
└───prius collection
│ │
│ └───prius 01
│ │ | 001.jpg
│ │ | 002.jpg
│ │ | ...
│ └───prius 02
│ │ | 001.jpg
│ │ | 002.jpg
│ │ | ...
│ │ ...
Step 02:
- Run the
download_image_preprocessor.py
script. - It will create a
duplicate_checked
folder inside every vehicle model collection folder which contain the processed images after deleting the duplicates, however it is Highly recommended to manually check that folder and delete unwanted images that are downloaded from the webscraper since it is quite normal for the google images to contain unrelated images with every search and the webscraper has no intelligence in identifying images before download, therefore it will download all the results.
The image dataset will be divided among three sections with the below mentioned split ratios.
- Train -> 50%
- Test -> 25%
- Validation -> 25%
Step 01:
- Rename the image dataset folders with proper vehicle names including the model year instead of 'collection'
Example:
aqua collection > Toyota Aqua 2014
prius collection > Toyota Prius 2016
Step 02:
- Run the
dataset_split.py
script.
The script will split the dataset and create the new folder structure inside the classification model
folder where the model will be developed.
Dataset directory structure of the images folder inside classification_model after the above step:
images <- top level folder
└───train <- training images
│ └───Alto 2015
│ │ │ 01005.jpg
│ │ │ 08445.jpg
│ │ │ ...
│ └───Toyota Aqua 2014
│ │ 01654.jpg
│ │ 05422.jpg
│ │ ...
│
└───test <- testing images
│ └───Alto 2015
│ │ │ 01545.jpg
│ │ │ 01985.jpg
│ │ │ ...
│ └───Toyota Aqua 2014
│ │ 09655.jpg
│ │ 01054.jpg
│ │ ...
│
└───validate <- validation images
│ └───Alto 2015
│ │ │ 96505.jpg
│ │ │ 01655.jpg
│ │ │ ...
│ └───Toyota Aqua 2014
│ │ 09866.jpg
│ │ 35444.jpg
│ │ ...
Prerequisites:
- Anaconda - Don't need to install if you are following the Nvidia guide, since you will be downloading the miniconda version there.
If you have an Nvidia GPU then follow this amazing guide mentioned below by Jeff Heaton both in video and in markdown for a full in-depth explanation of the setup process.
- Setting Up CUDA, CUDNN, Keras, and TensorFlow on Windows 11 for GPU Deep Learning (Video) - Does work for Windows 10 as well.
- Markdown format setup
Step 01:
- Install Anaconda
Step 02:
- Open Anaconda Prompt
- Navigate to the
classification_model
folder inside the project - Type the following command inside anaconda prompt to create a virtual environment for anaconda -> The environment will be created according to the specifications mentioned in the
environment.yml
file
conda env create
- Activate the environment with the following command
activate vehicle-image-cnn-env
Step 03:
- Open jupyter notebook
jupyter notebook
The model in this project is developed using transfer learning method known as feature extraction. A pre-trained model is used as the starting point to the current model.
Transfer Learning:
Transfer learning is the reuse of a pre-trained model on a new problem. It's currently very popular in deep learning because it can train deep neural networks with comparatively little data. This is very useful in the data science field since most real-world problems typically do not have millions of labeled data points to train such complex models. And it can be considered a valid statement for the current project as well since the data was collected through a web scraper.
Feature Extraction:
Feature extraction transfer learning is when the underlying patterns (also called weights) of a pretrained model has learned are taken and adjusted its output to be more suited to the current problem.
For the current project, a pre-trained model from Keras Applications library is used. Which is the EfficientNetB1 model.
For example, EfficientNetB0 has 236 layers, but the top layer outputs 1000 classes because it was pretrained on ImageNet. To adjust this to the current scenario, the original activation layer has been removed the and replaced with the amount of output classes according to the vehicle models gathered. The important part here is that only the top few layers become trainable, the rest remain frozen.
This way all the underlying patterns remain in the rest of the layers and the rest can be utilized according to the current scenario. This method of transfer learning is highly efficient when the data is similar to the data the model has been pretrained on (data = images).
EfficientNet:
EfficientNets rely on AutoML and compound scaling to achieve superior performance without compromising resource efficiency. The AutoML Mobile framework has helped develop a mobile-size baseline network, EfficientNet-B0, which is then improved by the compound scaling method to obtain EfficientNet-B1 to B7.
EfficientNets achieve state-of-the-art accuracy on ImageNet with an order of magnitude with better efficiency.
Building the feature extraction model:
Since the model is created using a jupyter notebook, all the relevant steps and decisions made are mentioned in the notebook itself, therefore it will not be repeated in this documentation. However, since the notebook is available in github it can be viewed without running the project through an anaconda instance.
The model develeopment notebook
For users who are following this project from the initial steps can start running the notebook and continue to work there if you have followed up to step 3 in setting up the anaconda virtual environment.
After successful build on the model and fitting for 5 epochs with train and validation datasets an accuracy of 97.26% was achieved when being evaluated on the test dataset.
Evaluating the model based on training and validation loss:
Evaluating the model based on training and validation accuracy:
Confusion Matrix:
It seems that the model do get confused with Toyota Aqua 2014
and Wagon R Stingray 2018
with Alto 2015
on certain occasions. It suggest that more diverse images of Toyota Aqua 2014
and Wagon R Stingray 2018
are needed for the model to be trained more efficiently.
Visualizing predictions on test images:
This model is deployed using a flask API in google cloud platform that facilitates the image recognition capabilities of a mobile application that I have developed.
Feel free to checkout those projects as well mentioned on the links below: