Skip to content
This repository has been archived by the owner on Nov 9, 2020. It is now read-only.

PDF Files

PunKeel edited this page Mar 29, 2017 · 1 revision

How are PDF documents sanitized?

PDF is an open format, with a dedicated documentation - and not a short one. In a sense, PDF looks like HTML: powerful markup language than can embed scripts, with a vendor-dependant support of features.

The only way for us to clean up a PDF document is to loop over the elements and remove what we know to be JavaScript (or, more precisely, Additional Actions).

This might break Acro Forms, if any.

Clone this wiki locally