detectron2_cpu_img

This is 'development' docker container:

detectron2 for CPU (https://github.com/facebookresearch/detectron2)
face recognition module (https://pypi.org/project/face-recognition/)
python3.8 and libs
torch 1.8.1
simple web service, which get image as input and returns detectron2 Segmentation result, exif info and etc (Python/Flask)
simple web page to test service

Simple web page with this docker in backend https://regim.online

Note

I don't find 'ready to use' solution for my task (detect objects/segments on image) so I had to build it

I don't know much about CV/image recognition/python/ docker, so there may be errors and duck code. Any help is welcome.

I don't have CUDA GPU, so I build CPU based service which is more slower.

Using this code to build search/catalog for my home photo archive.

Usage

Build and run docker image:

docker-compose up -d or docker-compose up -d --build (to re-build image)

Service (<your_host>:<your_port>/api/v1.1/imgrecognize/) will start with docker up. Some time (~40sec) will spend for segmentation model downloading at start up (once per model: volume for models cache configured in docker-compose file)

Open in browser http://<your_host>:<your_port>/ Select image and push it. Get result.

or

Post image any way you preffer, some thing like:

curl --request POST -F "file=@IMG.JPG" localhost:5000/api/v1.0/imgrecognize/

-- get json as result

You can post more than one image in request, but procesing may take too much time and connection will close with time-out

Add request params to URL if needed:

help - to return help info (False is default, any other value is eq True). No any jobs will be done.
exif - to return exif if exist (False is default, any other value is eq True)
resimg - to return result image with objects marked as base64 string (False is default, any other value is eq True)
autorotation - autorotate image using exif data (Orientation) (False is default, any other value is eq True)
rotation={value} - rotate to degrees before analisys. Works with/without autorotation
resize={value} - resize to px (max side of image) before analisys. If value not passed: 1000px is default.
geodata - to return reverse geodecoding using exif GPS data (False is default, any other value is eq True) by https://nominatim.openstreetmap.org
lang={language_code} - used for geodata and translate ('en' is default)
translate - to return objects and segments array (objectsAndSegments_{lang} object), translated to target language (False is default, any other value is eq True)
segmentation - in api/v1.1/imgrecognize image segmentation can be turned off.
facerecognition - to return faces data, works if segmetation find persons on image, or for any image if senmentation is off. curl --request POST -F "file=@IMG.JPG" localhost:5000/api/v1.0/imgrecognize/?exif=False&autorotation&rotation=90

Responce example

https://github.com/mrekin/detectron2_cpu_img/wiki/Responce

Docker-compose

Some variables can be passed throw docker-compose.yml file

      - SEGMENTATION_MODEL=COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml
      - FLASK_DEBUG=True
      - FLASK_HOST=0.0.0.0
      - FLASK_PORT=5000

Find more segmentation models at https://github.com/facebookresearch/detectron2/tree/master/configs/COCO-InstanceSegmentation

Performance

My HW instanse is

Xeon E3 1260L
DDR3 16Gb
HDD 3Tb WD Red

Software:

VM (Proxmox) with 6 cores (host) and 1.5Gb Ram
Ubuntu 20.04 inside VM
Docker 20.10.5

Results highly depends on image resolution: less resolution - faster analisys

Instance segmentation:

 time  `curl --request POST -F "file=@IMG_3448.JPG" 192.168.1.111:5000/api/v1.0/imgrecognize/ > /dev/null`
 
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100 3452k  100 52765  100 3401k   4940   318k  0:00:10  0:00:10 --:--:-- 15201

real    0m10.698s
user    0m0.009s
sys     0m0.023s

Near future plans:

IN_PROGRESS: clean-up python code
add basic-auth for service
add main colors calculation (histogram?)
add posibility to use models not only from zoo
add posibility to return result image with segments
DONE: add some variables for request (like ?noexif or ?noobjects)
DONE: add posibility use not only segmentation models
DONE: add possibility to resize image (panoptic segmentation causes docker crash if input image too big, may be to low RAM, so resize image is good point to solve this and increase recognition speed

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
html		html
static/js		static/js
README.md		README.md
docker-compose.yml		docker-compose.yml
dockerfile		dockerfile
service.py		service.py
test.txt		test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

detectron2_cpu_img

Note

Usage

Responce example

Docker-compose

Performance

Near future plans:

About

Releases 4

Packages

Languages

mrekin/detectron2_cpu_img

Folders and files

Latest commit

History

Repository files navigation

detectron2_cpu_img

Note

Usage

Responce example

Docker-compose

Performance

Near future plans:

About

Topics

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages