Extract text in images(Purchase Order) and Automatically files processing to save key in time
Files used :
- PDF2IMG.py : Convert pdf files into images (.jpeg or .png format)
- Tesseract.py : Images prepocessing and calculations of Bounding Boxes of text .
Get these text in images by using Tesseract-OCR.
Following files processing are case dependences.