This is a research repository on the direction of PDF documents and extracting various PDF metadata for information.
The research can be found here.
- Install NodeJS 12+.
- Install all the 3rd party libraries
npm install
. - Write out the environment variables in a
.env
file (follow.sample.env
). - Run
npm run download
to download all the PDF files locally. - Run
npm run parse
to parse the metadata and output the summary locally.