Skip to content

A Java application for recognizing Arabic typed-in scripts written in an image and then writing the contents to a text file. It utilizes Tesseract, which is an optical character recognition (OCR) engine.

Notifications You must be signed in to change notification settings

welbashier/arabic-ocr-tesseract

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

arabic-ocr-tesseract

A Java application for recognizing Arabic typed-in scripts written in an image and writing the contents to a file as text.

Initial step(s):

Copy the folder Tesseract/ to your local C:/temp folder

Notes:

  • Tiff images sometimes causes an issue with TIFFImageReader and fails to recognize the contents; this is a Tesseract problem
  • The result is not accurate 100% and it needs to be trained

About

A Java application for recognizing Arabic typed-in scripts written in an image and then writing the contents to a text file. It utilizes Tesseract, which is an optical character recognition (OCR) engine.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages