Icon designed by Freepik from www.flaticon.es
Portrait segmentation in Keras, using divamgupta's segmentation models source code. Aiming to perform real time segmentation in mobile and desktop devices, using a common source code written in C++ with Tensorflow Lite.
RTCPP (MBP 2017 - 2,9 GHz Quad-Core Intel Core i7) | RTIOS (iPhone 12) |
---|---|
This model was built/trained using Keras, the model was refactored from divamgupta's segmentation models source code.
The dataset used is a frankestein made from the supervisely dataset. The final dataset can be found here.
-
data augmentation with albumentations
-
YACS to manage experiment configs (these can be found in here)
-
Telegram callback notifications
-
Creates an output folder and stores:
- Tensorboard logs
- Logger output
- Model Checkpoints
-
SLURM scripts to enqueue the training process in a computer with shared resources.
The model consists of a MobileNet backbone and a UNet head. The MobileNet is prepared using fchollet's pretrained weights.
The full-model weights, along with corresponding output information, can be found in here.
This folder contains the pretrained weights for keras and tflite to test the realtime applications. These need to be located in models/model.h5
or models/model.tflite
to be used by the applications.
As the idea is to mantain a model that can run in real-time in several devices, along with the model training, there is a C++ Application to run this.
The Core and all depending applications need Tensorflow Lite library to run. In order to achieve this, Tensorflow Lite must be compiled from source, therefore, the tensorflow dependency is set as a submodule (v1.13x). To compile, use:
./tensorflow/lite/tools/make/download_dependencies.sh
./tensorflow/lite/tools/make/build_ios_universal_lib.sh
to build for iOS./tensorflow/lite/tools/make/build_lib.sh
to build for the current arch (tried on macOS)
C++ Shareable Source Code to Perform the inference using the TFLite model and other transformations in the image (blur/replace the background). Refer to the source code for more info.
C++ Application to test the Core. Uses OpenCV to load frames from Camera input and passes them through the model.
This app was made using macOS, therefore the Makefile link Tensorflow Lite library agains the path tensorflow/lite/tools/make/gen/osx_x86_64/lib
, which is generated using the script tensorflow/lite/tools/make/build_lib.sh
, if you ran this script using other SO, please modify the Makefile to link against the correct lib.
Python application to test the performance of the raw Keras model (without TFLite conversion).
iOS application that uses the Core to perform background segmentation in iOS devices. This application was set to work with arch64
only.
It uses OpenCV, using the Cocoapods dependency manager. To set up, run pod install
in rtios
root folder. The model.tflite
file needs to be copied into rtios/rtios
folder as well.
- CORE error handling
- Improve FPS:
- Downscale to ~512/224 to perform the blur
- Perform mask prediction and blur in different threads
- Perform the thread handling in C++ (if possible)
- Generate a new synthetic dataset (maybe MaskRCNN) to get better background replacement results (generate dataset using selfies and videoconference scenarios)