make more efficient #4

bertsky · 2022-01-21T10:13:04Z

We currently only use Detectron2's DefaultPredictor for inference:

ocrd_detectron2/ocrd_detectron2/segment.py

Line 126 in 0272d95

self.predictor = DefaultPredictor(cfg)

This is meant for simple demo purposes, so it does the above steps automatically. This is not meant for benchmarks or running complicated inference logic. If you’d like to do anything more complicated, please refer to its source code as examples to build and use the model manually

One can clearly see how the GPU utilization is scarce, so a multi-threaded implementation with data pipelining would boost performance a lot.

The text was updated successfully, but these errors were encountered:

bertsky · 2022-02-03T16:28:23Z

The first try in predict-async does not actually reduce wall time (it only reduces CPU seconds a bit). Perhaps we must first disentangle the page loop (make it a pipeline).

However, 88617a2 (i.e. predicting and post-processing at lower pixel density – no more than 150 DPI) does help quite a bit already.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make more efficient #4

make more efficient #4

bertsky commented Jan 21, 2022

bertsky commented Feb 3, 2022

make more efficient #4

make more efficient #4

Comments

bertsky commented Jan 21, 2022

bertsky commented Feb 3, 2022