Skip to content

chusheng0505/Tesseract-OCR-and-Automated-Files-Processing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tesseract-OCR-and-Automated-Files-Processing

Extract text in images(Purchase Order) and Automatically files processing to save key in time

Files used :

  1. PDF2IMG.py : Convert pdf files into images (.jpeg or .png format)
  2. Tesseract.py : Images prepocessing and calculations of Bounding Boxes of text .
    Get these text in images by using Tesseract-OCR.
    Following files processing are case dependences.

Releases

No releases published

Packages

No packages published

Languages