Skip to content

PDF data extraction parsers that get published onto npm. Standalone, but run in conjunction with the openlawnz-pipeline.

License

Notifications You must be signed in to change notification settings

openlawnz/openlawnz-parsers

Repository files navigation

openlawnz-parsers

This package is used in the openlawnz-pipeline during pdf conversion.

It is standalone so that it can be versioned and others can easily work on it.

Commands

npm install
npm build
npm build:watch
npm run test
npm run test:coverage
npm run lint

Input

Input is a JSON file being the output of either:

  • PDF.js text output; or
  • Azure Cognitive Services OCR

See /testData/initFromConversion for an example input file

About

PDF data extraction parsers that get published onto npm. Standalone, but run in conjunction with the openlawnz-pipeline.

Topics

Resources

License

Stars

Watchers

Forks