Skip to content

JHW5981/split_pdfs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PDF SPLITOR

Python implementation of spliting pdfs

Requirements:

  • python >= 3.8
  • fitz
Install

Clone repo and install requirements.txt in a Python>=3.8.0 environment, including fitz>=0.0.1.

git clone https://github.com/sjtu-jhw/split_pdfs.git  # clone
cd split_pdfs
pip install -r requirements.txt  # install
Usage

Just need to specify the input PDF path and the path where you are prepared to save the split results.

For instance, if the PDF is located in the root directory with the name Template.pdf, and you want to save the split results in a folder named splits also in root, you can simply run a one-liner in the terminal to obtain your split results.

python split.py --pdf_path "./Template.pdf" --output_dir "./splits/"
Results

In the splits folder, you will find a series of PDFs, each representing one page of the original PDF. The naming format is {page_number}-{RL[min]}-{Calc.MW}. If no results are parsed, the corresponding value is NaN.

About

Repository for PDF split

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages