Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't default to OCRing everything #10

Closed
simonw opened this issue Jun 30, 2022 · 1 comment
Closed

Don't default to OCRing everything #10

simonw opened this issue Jun 30, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@simonw
Copy link
Owner

simonw commented Jun 30, 2022

OCRing everything is a dangerous default: if someone runs this against a bucket with 10,000s of PDFs it could cost them a lot of money.

I'm going to switch it to working like this:

s3-ocr start name-of-bucket path/to/one.pdf path/to/two.pdf

If you fail to provide any paths it will show an error message.

To OCR everything, use:

s3-ocr start name-of-bucket --all
@simonw simonw added the enhancement New feature or request label Jun 30, 2022
@simonw
Copy link
Owner Author

simonw commented Jun 30, 2022

This will replace the work I did in:

@simonw simonw closed this as completed in 0444883 Jun 30, 2022
simonw added a commit that referenced this issue Jun 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant