pdf-access

pdf-access makes pdf documents more accessible to screen readers and other assistive technologies.

It uses a toml configuration file to specify a plan, match certain documents, and apply a list of actions to remediate a document.

Here is an example of a toml file that will unlock and remove text that is preventing a screen reader from reading the documents authored by Mom.

Other documents will be trimmed down to a single page compressed.

#----------------- Sources -----------------

[sources.my_pdfs]
in_path = "./originals"
out_path = "./accessible"

#----------------- Plans -------------------

[plans.unlock-compress]
actions = ["clear_encoding_differences"]
# match documents from Mom
metadata_search = { "author" = "Mom" }
passwords = ["c@11-y0ur-m0+h3r", "w3@r-c13@n-und3rw34r"]
post_process = ["gs-compress"]

[plans.compress-and-trim]
actions = ["single-page"]
# match everything else
metadata_search = {}
post_process = ["gs-compress"]

#----------------- Actions -----------------

[actions.clear_encoding_differences]
name = "Clear encoding differences"
function = "clear-encoding-differences"

[actions.single-page]
name = "Keep one page"
function = "keep-pages"
args.pages = [0]

To run the plan, you would use the following command:

pdf-access config.toml

The files in the ./originals directory would be processed and the results would be placed in the ./accessible directory.

Installation

pip install git+https://github.com/felddy/pdf-access.git

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details.

License

This project is in the worldwide public domain.

This project is in the public domain within the United States, and copyright and related rights in the work worldwide are waived through the CC0 1.0 Universal public domain dedication.

All contributions to this project will be released under the CC0 dedication. By submitting a pull request, you are agreeing to comply with this waiver of copyright interest.

Name		Name	Last commit message	Last commit date
Latest commit History 880 Commits
.github		.github
src/pdf_access		src/pdf_access
test-pdfs		test-pdfs
tests		tests
.ansible-lint		.ansible-lint
.bandit.yml		.bandit.yml
.coveragerc		.coveragerc
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.mdl_config.yaml		.mdl_config.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.yamllint		.yamllint
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
bump_version.sh		bump_version.sh
mypy.ini		mypy.ini
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
setup-env		setup-env
setup.py		setup.py
tag.sh		tag.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdf-access

Installation

Contributing

License

About

Releases

Sponsor this project

Packages

Contributors 11

Languages

License

felddy/pdf-access

Folders and files

Latest commit

History

Repository files navigation

pdf-access

Installation

Contributing

License

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 11

Languages

Packages