Skip to content

Simple but convenient GUI for Tesseract OCR written in Python, based on Qt6.

Notifications You must be signed in to change notification settings

knochenhans/ocrreader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OCR Reader

Simple but convenient GUI for Tesseract OCR written in Python, based on Qt6.

OCR Reader’s main scope isn’t batch OCR but making it as painless as possible to prepare (and eventually perform) OCR for documents with complicated layouts that need manual steps and often post-processing. One of the goals is to create export documents as closely aligned to the original as well as possible as allowed by the target medium (for example text reflowing for EPUB and HTML formats).

This is still in a very early stage of development and contains lots of bugs.

Some of the features currently implemented:

  • box editor featuring different box types (text, raw text, image)
  • Tesseract’s confidence values within text boxes as well as box text editor
  • box sequence can be manually reordered (import for export formats like EPUB or HTML)
  • automatic hyphen elimination using dictionary (for German)
  • basic font-size approximation
  • basic layout-detection
  • basic auto-alignment for images
  • document-wide header and footer to exclude elements like page numbers from being exported without manually deleting them

screenshot

Controls

General

  • Esc - Abort current action (Todo)

Box Editor

  • F1 - Select mode (Click to select, Ctrl + Click to select multiple, Click + Drag to move)

  • F2 - Drawing mode

  • F3 - Hand mode for scrolling (Todo)

  • F4 - Place header

  • F5 - Place footer

  • F6 - Activate renumbering mode (Click on first box if no box is selected, then click second box to mark it as the next box in sequence)

  • I - Set current box type to image

  • T - Set current box type to text

  • Ctrl + A - Select all boxes

When one or more boxes are selected

  • Alt + A - Auto align current box(es)
  • Alt + D - Disable current box(es) from being exported
  • Alt + I - Set current box(es) type to image
  • Alt + R - Recognize text (will automatically split and create new boxes based on Tesseract’s estimations)
  • Alt + T - Set current box(es) type to text

About

Simple but convenient GUI for Tesseract OCR written in Python, based on Qt6.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published