image-recognizer

Recognize images color formats and resolutions based on their raw binary data.

Description

image-recognizer can recognize color formats and resolutions of images based on their raw binary data. Application is written in Python and uses two Keras neural networks models to detect the correct color format and resolution. In addition, it is possible to use your own custom Keras neural networks models instead of default ones.

How does it work

Simplified image-recognizer operation principle can be seen on diagram below:

Application starts by loading raw binary data of a image. This data is then being interpreted as GRAY8 image and pixels are arranged into many possible resolutions (for example, a picture of one million pixels can be arranged in a resolution of 1000x1000 or 2000x500). Each generated arrangement is resized to 256x256 resolution and given as input to the resolution neural network. Resolution neural network picks the most "correct" looking resolutions. The application then generates image interpretations using all possible color formats with the resolution picked by the resolution neural network in the last step. Those interpretations are then resized to 256x256 resolution, converted to grayscale and given as input to the color format neural network. This neural network picks the most "correct" looking color format. At the output, the application gives the most correct looking color format and resolution with confidence levels for both.

Neural network models

Both keras models used implement a convolutional neuron network with a binary output. The model responsible for picking the best looking resolution is taught to recognize a correct-looking resolution (after interpreting raw data as GRAY8 image) from an incorrect-looking one. The model responsible for picking the best looking color format is taught to recognize a correct-looking color format intepretation from incorrect-looking one. Both models have a single neuron with a value between 0 and 1 at the output. The closer this value is to 1, the more neural network thinks that the input data "looks" correct.

Models were taught on generated datasets, and the learning process took place on Google Colab.

Google Colab Jupyter notebooks used for training are placed in notebooks folder.

Datasets

Dataset are *.tf directories containg image tensor and label

Resolution dataset

This dataset was made using own C app that formated RGB24 pictures to different formats and saved them as a raw data. Bad data was generated with treating the data as different resolution than it is really and saved as GRAY8 image tensor

Color recognition dataset

This dataset was made using imagemagick tool. Pictures from different format was converted to GRAY8 to save some bit depth details. Bad ones was made using raviewer - raw data was opened as different color format and saved as png picture that was converted to GRAY8 as image tensor

Usage

python app recognize <path_to_raw_file> <value_to_check>

where value can be : [color_format, img_width, img_height, color_format_confidence, resolution_confidence]

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.github/workflows		.github/workflows
app		app
dataset_scripts		dataset_scripts
docs		docs
image-recognizer-models @ 4779166		image-recognizer-models @ 4779166
notebooks		notebooks
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md
run_examples.sh		run_examples.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

image-recognizer

Description

How does it work

Neural network models

Datasets

Resolution dataset

Color recognition dataset

Usage

About

Releases

Packages

Contributors 4

Languages

License

antmicro-labs/image-recognizer

Folders and files

Latest commit

History

Repository files navigation

image-recognizer

Description

How does it work

Neural network models

Datasets

Resolution dataset

Color recognition dataset

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages