Skip to content

How to handle an image with different kinds of texts (for example, there are printed texts and written words? #6019

Closed Answered by Evezerest
felix115 asked this question in Q&A
Discussion options

You must be logged in to vote

This is very common. Take an exam paper as an example. It has both printed texts and written words. Under this circumstance, you can adopt the solution of "1 detection model+ 1 N-class model+ N recognition models".

All types of texts share one detection model, and the N-class model which is a classifier in the extra training classifies the texts. If there are printed and written texts, the model is a two-class model; if there are N types of texts, the model is an N-class model. When it comes to recognition, every text type trains a recognition model. If there are printed and written texts in an image, two recognition models are needed, one for printed texts and the other for written texts…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Evezerest
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants