PDF to Audio Converter

This project provides a Python script that converts PDF documents into audio files. The script extracts text from a PDF, cleans the text, converts it to speech, and saves it as an audio file in MP3 format.

Features

Extracts text from large PDF files.
Cleans unnecessary characters from the extracted text.
Converts text to speech using pyttsx3.
Splits the text into chunks and processes them to avoid memory overload.
Combines the generated audio chunks into a single MP3 file.

Requirements

Python Dependencies

This project uses Poetry for dependency management. The required Python libraries are:

PyPDF2
pyttsx3
pydub

System Dependencies

macOS (using Homebrew)

You need to install ffmpeg to handle audio processing. This can be installed via Homebrew:

brew install ffmpeg

Windows

For Windows, you will need to download and install FFmpeg manually:

Download the latest build of FFmpeg from the official website.
Extract the files and add the bin directory to your system's PATH.

Linux

On Linux, FFmpeg can usually be installed via the package manager.

For Debian-based distributions (like Ubuntu):

sudo apt-get update
sudo apt-get install ffmpeg

For Red Hat-based distributions (like Fedora or CentOS):

sudo dnf install ffmpeg

Installation

Clone the repository:
- git clone https://github.com/sairoko12/pdf-2-audio.git
- cd pdf-2-audio
Install Python dependencies using Poetry:
- Ensure you have Poetry installed. If not, you can install it via pip:
  - pip install poetry
- Then, install the project dependencies:
  - poetry install
Install System Dependencies:
- Follow the instructions in the “System Dependencies” section to install FFmpeg on your system.

Usage

To convert a PDF file to an audio file, run the following command:

poetry run python script.py

Settings

You should modify the next parameters of audio output:

rate
- The rate parameter in pyttsx3 controls the speed of speech, measured in words per minute (WPM). Lower values make the speech slower, while higher values increase the speed. Adjusting this allows you to tailor the speech pace to your needs.
volume
- The volume parameter in pyttsx3 controls the loudness of the speech. It is a float value between 0.0 (mute) and 1.0 (maximum volume). Adjusting this allows you to set the desired loudness level for the speech output.
language of speech
- Available languages (applies only for macos)

# Example for frech usage with fast speech
convert_pdf_to_audio(
    pdf_file='path/to/pdf_file.pdf',
    output_file_name='path/to/outpu_audio_file.mp3',
    language='fr-FR',
    rate=250,
)

This will convert the specified PDF file to an MP3 audio file.

Cleaning up

Temporary audio files generated during the conversion process are automatically removed after the final audio file is created.

Contributing

If you encounter any issues or have suggestions for improvements, please feel free to submit an issue or a pull request.

License

This project is licensed under the MIT License. See the MIT License for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
script.py		script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF to Audio Converter

Features

Requirements

Python Dependencies

System Dependencies

Installation

Usage

Settings

Cleaning up

Contributing

License

About

Releases

Packages

Languages

sairoko12/pdf-2-audio

Folders and files

Latest commit

History

Repository files navigation

PDF to Audio Converter

Features

Requirements

Python Dependencies

System Dependencies

Installation

Usage

Settings

Cleaning up

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages