Don't default to OCRing everything #10

simonw · 2022-06-30T00:17:28Z

OCRing everything is a dangerous default: if someone runs this against a bucket with 10,000s of PDFs it could cost them a lot of money.

I'm going to switch it to working like this:

s3-ocr start name-of-bucket path/to/one.pdf path/to/two.pdf

If you fail to provide any paths it will show an error message.

To OCR everything, use:

s3-ocr start name-of-bucket --all

The text was updated successfully, but these errors were encountered:

simonw · 2022-06-30T00:17:50Z

This will replace the work I did in:

simonw added the enhancement New feature or request label Jun 30, 2022

simonw closed this as completed in 0444883 Jun 30, 2022

simonw added a commit that referenced this issue Jun 30, 2022

Use s3 fixture, refs #10

702f0f3

Provide feedback