A python library built to empower developers to build applications and systems with self-contained Deep Learning and Computer Vision capabilities using simple
and few lines of code.
A DeepQuest AI project https://deepquestai.com . Developed and Maintained by Moses Olafenwa and John Olafenwa, brothers, creators of TorchFusion, Authors of Introduction to Deep Computer Vision and creators of DeepStack AI Server.
Built with simplicity in mind, ImageAI
supports a list of state-of-the-art Machine Learning algorithms for image prediction, custom image prediction, object detection, video detection, video object tracking
and image predictions trainings. ImageAI currently supports image prediction and training using 4 different Machine Learning algorithms
trained on the ImageNet-1000 dataset. ImageAI also supports object detection, video detection and object tracking using RetinaNet, YOLOv3 and TinyYOLOv3 trained on COCO dataset. Finally, ImageAI allows you to train custom models for your peforming recognition of new objects.
Eventually, ImageAI will provide support for a wider
and more specialized aspects of Computer Vision including and not limited to image
recognition in special environments and special fields.
New Release : ImageAI 2.0.3
What's new:
- Options to set video detection timeout in seconds
- Options to save trained custom model in full
- Support for running custom prediction with fully saved model without specifying network type
- Support for running custom prediction with any fully saved Keras model
- Support for converting custom trained models to Tensorflow (.pb) format
- Support for continuous training from previously saved custom model
- Support for transfer learning from pre-trained models for small datasets
- Only models with increased accuracy will be saved during training
- Support for loading and prediction with multiple custom models
- Support for exporting models to DeepStack format
▣ Installation
▣ Image Prediction
▣ Object Detection
▣ Video Object Detection, Tracking & Analysis
▣ Custom Model Training
▣ Custom Image Prediction
▣ Documentation
▣ Projects Built on ImageAI
▣ High Performance Implementation
▣ AI Practice Recommendations
▣ Contact Developers
▣ Contributors
▣ References
- Python 3.5.1 (and later versions) Download (Support for Python 2.7 coming soon)
- pip3 Install
- Tensorflow 1.4.0 (and later versions) Install or install via pip
pip3 install --upgrade tensorflow
- OpenCV Install or install via pip
pip3 install opencv-python
- Keras 2.x Install or install via pip
pip3 install kerasTo install ImageAI, run the python installation instruction below in the command line:
pip3 install https://github.com/OlafenwaMoses/ImageAI/releases/download/2.0.3/imageai-2.0.3-py3-none-any.whl
or download the Python Wheel
imageai-2.0.3-py3-none-any.whl and run the python installation instruction in the command line
to the path of the file like the one below:
pip3 install C:\User\MyUser\Downloads\imageai-2.0.3-py3-none-any.whl
convertible : 52.459555864334106 sports_car : 37.61284649372101 pickup : 3.1751200556755066 car_wheel : 1.817505806684494 minivan : 1.7487050965428352
ImageAI provides 4 different algorithms and model types to perform image prediction, trained on the ImageNet-1000 dataset. The 4 algorithms provided for image prediction include SqueezeNet, ResNet, InceptionV3 and DenseNet.
Click the link below to see the full sample codes, explanations and best practices guide.
ImageAI provides very convenient and powerful methods to perform object detection on images and extract each object from the image. The object detection class provides support for RetinaNet, YOLOv3 and TinyYOLOv3, with options to adjust for state of the art performance or real time processing.
Click the link below to see the full sample codes, explanations and best practices guide.
Video Object Detection & Analysis
Below is a snapshot of a video with objects detected.
Video Custom Object Detection (Object Tracking)
Below is a snapshot of a video with only person, bicycle and motorcyle detected.
Video Analysis Visualization
Below is a visualization of video analysis returned by **ImageAI** into a 'per_second' function.
ImageAI provides very convenient and powerful methods to perform object detection in videos and track specific object(s). The video object detection class provided only supports the current state-of-the-art RetinaNet, but with options to adjust for state of the art performance or real time processing. Click the link to see the full videos, sample codes, explanations and best practices guide.
ImageAI provides classes and methods for you to train a new model that can be used to perform prediction on your own custom objects.
You can train your custom models using SqueezeNet, ResNet50, InceptionV3 and DenseNet in 5 lines of code.
Click the link below to see the guide to preparing training images, sample training codes, explanations and best practices.
Prediction from a sample model trained on IdenProf, for predicting professionals
mechanic : 76.82620286941528 chef : 10.106072574853897 waiter : 4.036874696612358 police : 2.6663416996598244 pilot : 2.239348366856575
ImageAI provides classes and methods for you to run image prediction your own custom objects using your own model trained with ImageAI Model Training class.
You can use your custom models trained with SqueezeNet, ResNet50, InceptionV3 and DenseNet and the JSON file containing the mapping of the custom object names.
Click the link below to see the guide to sample training codes, explanations, and best practices guide.
We have provided full documentation for all ImageAI classes and functions in 2 major languages. Find links below:
>> Documentation - English Version https://imageai.readthedocs.io
>> Documentation - Chinese Version https://imageai-cn.readthedocs.io
>> Documentation - French Version https://imageai-fr.readthedocs.io
ImageAI uses the Tensorflow backbone for it's Computer Vision operations. Tensorflow supports both CPUs and GPUs ( Specifically NVIDIA GPUs. You can get one for your PC or get a PC that has one) for machine learning and artificial intelligence algorithms' implementations. To use Tensorflow that supports the use of GPUs, follow the link below :
FOR WINDOWS
https://www.tensorflow.org/install/install_windows
FOR macOS
https://www.tensorflow.org/install/install_mac
FOR UBUNTU
https://www.tensorflow.org/install/install_linux
We also welcome submissions of applications and systems built by you and powered by ImageAI for listings here. Should you want your ImageAI powered developments listed here, you can reach to us via our Contacts below.
For anyone interested in building AI systems and using them for business, economic, social and research purposes, it is critical that the person knows the likely positive, negative and unprecedented impacts the use of such technologies will have. They must also be aware of approaches and practices recommended by experienced industry experts to ensure every use of AI brings overall benefit to mankind. We therefore recommend to everyone that wishes to use ImageAI and other AI tools and resources to read Microsoft's January 2018 publication on AI titled "The Future Computed : Artificial Intelligence and its role in society ".
Kindly follow the link below to download the publication.
https://blogs.microsoft.com/blog/2018/01/17/future-computed-artificial-intelligence-role-society/
Moses Olafenwa
Email: guymodscientist@gmail.com
Website: https://moses.aicommons.science
Twitter: @OlafenwaMoses
Medium : @guymodscientist
Facebook : moses.olafenwa
John Olafenwa
Email: johnolafenwa@gmail.com
Website: https://john.aicommons.science
Twitter: @johnolafenwa
Medium : @johnolafenwa
Facebook : olafenwajohn
We give special thanks to Kang vcar for his incredible and excellent work in translating ImageAI's documentation to the Chinese language. Find below the contact details of those who have contributed immensely to the ImageAI project.
Kang vcar
Email: kangvcar@mail.com
Website: http://www.kangvcar.com
Twitter: @kangvcar
-
Somshubra Majumdar, DenseNet Implementation of the paper, Densely Connected Convolutional Networks in Keras
https://github.com/titu1994/DenseNet/ -
Broad Institute of MIT and Harvard, Keras package for deep residual networks
https://github.com/broadinstitute/keras-resnet -
Fizyr, Keras implementation of RetinaNet object detection
https://github.com/fizyr/keras-retinanet -
Francois Chollet, Keras code and weights files for popular deeplearning models
https://github.com/fchollet/deep-learning-models -
Forrest N. et al, SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
https://arxiv.org/abs/1602.07360 -
Kaiming H. et al, Deep Residual Learning for Image Recognition
https://arxiv.org/abs/1512.03385 -
Szegedy. et al, Rethinking the Inception Architecture for Computer Vision
https://arxiv.org/abs/1512.00567 -
Gao. et al, Densely Connected Convolutional Networks
https://arxiv.org/abs/1608.06993 -
Tsung-Yi. et al, Focal Loss for Dense Object Detection
https://arxiv.org/abs/1708.02002 -
O Russakovsky et al, ImageNet Large Scale Visual Recognition Challenge
https://arxiv.org/abs/1409.0575 -
TY Lin et al, Microsoft COCO: Common Objects in Context
https://arxiv.org/abs/1405.0312 -
Moses & John Olafenwa, A collection of images of identifiable professionals.
https://github.com/OlafenwaMoses/IdenProf -
Joseph Redmon and Ali Farhadi, YOLOv3: An Incremental Improvement.
https://arxiv.org/abs/1804.02767