Replies: 2 comments
-
IPED does, not by default. Just set conf/ParsingTaskConfig.txt->processImagesInPdfs = true |
Beta Was this translation helpful? Give feedback.
0 replies
-
I didnt know that, thank you @lfcnassif ! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all, its been awhile since I dont use IPED, so i dont remember how pdf ocr works. But usually, with ocr tools, when I try to ocr a scanned pdf with embebbed text ( just a letter, or a digital signature, for example) all page is missed and ocr is not applied. With extractous you can set to ocr inline images and accomplish that.
If IPED doesnt do that, would be nice take a look.
Beta Was this translation helpful? Give feedback.
All reactions