Extract text from plaintext, .docx, .odt and .rtf files. Pure go.
-
Updated
Nov 25, 2023 - Go
Extract text from plaintext, .docx, .odt and .rtf files. Pure go.
Read pdf files on javascript
Image to Text Tutorial in C# - See https://ironsoftware.com/csharp/ocr/tutorials/how-to-read-text-from-an-image-in-csharp-net/
Fast and memory-efficient Python PDF Parser based on xpdf sources
Batch-convert pdf to text, extract data from pdf in python
A Python asyncio wrapper for Tesseract-OCR.
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features
A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.
Deprecated - A fast API service for retrieving day to day stats about Coronavirus(COVID-19, SARS-CoV-2) outbreak in Kerala(India).
A mirror of https://git.tecosaur.net/tec/pdftotext.el
A PDF to text converter for Scriptable App (iOS) working offline
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
"PDF To Audio" is a Python tool that transforms PDF documents into audio files using OCR and Text-to-Speech technology. Ideal for accessibility and auditory learning, it supports multiple languages, parallel processing, and smart rate limit handling.
A simple RESTFul API service for poppler
Meu projeto do curso CS50: Um analisador de pdfs que processa as notas dos aprovados pelo Acesso Enem e organiza tudo. Agora em C++
This project for converting books from PDF to Proper JSON objects by separating title and content. After you take your output, you can insert your JSON file in the database easily.
Converts an image to a CSV. This exists because Chorus 3.0 is bat-shit and only show images for vital metadata.
Add a description, image, and links to the pdftotext topic page so that developers can more easily learn about it.
To associate your repository with the pdftotext topic, visit your repo's landing page and select "manage topics."