GitHub - kenwaytis/OCR_modelscope: OCR container with RESTful API

About The Project

An OCR server that runs in a Docker container and provides a RESTful API. Support for Chinese and English. Model has integrated line detection and text recognition , out of the box . Model files and inference code from ModelScope.

The line detection model is tested on the MTWI test set with the following results:

Backbone	Recall	Precision	F-score
ResNet18	68.1	84.9	75.6

BenchMark for text recognition model is not yet available.

Usage

1. Environmental requirements

Requires Docker engine or Docker Desktop.
Since GPUs are used in Docker, Nvidia Docker also needs to be installed to provide GPU invocation capabilities in the container.The installation process can be referenced:Nvidia container-toolkit.

All other dependencies and models are included in the pre-built Docker image, but of course you can build the exact same image from scratch based on the source code.

2. Installation

Clone the repo

git clone https://github.com/kenwaytis/OCR_modelscope.git

(opsition) Start the server with the default Docker Image

docker compose up

(opsition) Build the image from scratch

3.1 Modify the docker-compose.yml file from

image: paidax/ocr_modelscope:0.6.3

to

image: namespace/ocr_modelscope:0.6.3

3.2 Start the server

docker compose up

3. API interface description

Because fastAPI was used to build the server, you can view the automatically generated documentation instructions at localhost:9533/docs.
Description:

URL:

localhost:9533/ocr_system

Request method:

POST

json description:

field name	required or not	type	note
images	yes	list[str]	base64 encoded images

Request json example:

{
  "images":["img1", "img2", "img3"]
}

Acknowledgments

ModelScope

@article{tang2019seglink++,
  title={Seglink++: Detecting dense and arbitrary-shaped scene text by instance-aware component grouping},
  author={Tang, Jun and Yang, Zhibo and Wang, Yongpan and Zheng, Qi and Xu, Yongchao and Bai, Xiang},
  journal={Pattern recognition},
  volume={96},
  pages={106954},
  year={2019},
  publisher={Elsevier}
}

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
.gitlab-ci.yml		.gitlab-ci.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
default.jpg		default.jpg
docker-compose.yml		docker-compose.yml
download_model.py		download_model.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About The Project

Usage

1. Environmental requirements

2. Installation

3. API interface description

Acknowledgments

About

Releases 11

Packages

Languages

License

kenwaytis/OCR_modelscope

Folders and files

Latest commit

History

Repository files navigation

About The Project

Usage

1. Environmental requirements

2. Installation

3. API interface description

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases 11

Packages 0

Languages

Packages