pdf-documents

Star

Here are 7 public repositories matching this topic...

py-pdf / pypdf

Star

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

python pdf help-wanted pdf-documents pypdf2 pdf-manipulation pdf-parsing pdf-parser

Updated Jan 1, 2025
Python

pymupdf / PyMuPDF

Star

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

python pdf font data-science ocr tesseract epub mupdf text-processing pdf-documents extract-data table-extraction text-shaping xps pymupdf

Updated Dec 24, 2024
Python

pypdfium2-team / pypdfium2

Star

Python bindings to PDFium

python pdf pdf-documents rasterisation pdf-to-image pdfium

Updated Dec 30, 2024
Python

michelcrypt4d4mus / pdfalyzer

Star

Analyze PDFs. With colors. And Yara.

pdf malware-analysis pdf-documents pdf-format pdf-parser malicious-pdf-files

Updated Dec 14, 2024
Python

Anish-M-code / pdftotext

Star

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

pdf ocr tesseract-ocr pdf-documents hacktoberfest pdftotext ocr-recognition ocr-text-reader ocr-python pdftools hacktoberfest-accepted poppler-utils hacktoberfest2022

Updated Oct 27, 2024
Python

erikkastelec / PDFScraper

Star

CLI program for searching inside text and tables in PDF documents and displaying results in HTML.

ocr pdf-documents pdfminer camelot ocr-analysis

Updated Feb 7, 2024
Python

timothy-bartlett / PyMuPDF

Star

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

python pdf font data-science ocr mupdf text-processing pdf-documents extract-data table-extraction text-shaping xps pymupdf

Updated Aug 22, 2024
Python

Improve this page

Add a description, image, and links to the pdf-documents topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-documents topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-documents

Here are 7 public repositories matching this topic...

py-pdf / pypdf

pymupdf / PyMuPDF

pypdfium2-team / pypdfium2

michelcrypt4d4mus / pdfalyzer

Anish-M-code / pdftotext

erikkastelec / PDFScraper

timothy-bartlett / PyMuPDF

Improve this page

Add this topic to your repo