-
-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a live demo #16
Comments
I added a new |
I need some PDF files! Internet Archive has a bunch of interesting ones that are out of copyright and that demonstrate hand-writing. |
I'm going to grab some of the PDFs from https://archive.org/search.php?query=creator%3A%22Harry+Houdini+Collection+%28Library+of+Congress%29+DLC%22 - "Harry Houdini Collection (Library of Congress) DLC" |
I grabbed those PDFs and uploaded them to the bucket like this:
|
I started OCR like this:
|
I ran this to keep an eye on how it was going (I should upgrade
|
After a while it was done:
Then I ran this:
|
I'm not going to bother with GitHub Actions for this - I'm going to generate and deploy the demo from my laptop. I may automate this with GitHub Actions at a later date. |
Deployed to Vercel:
I added a custom domain to it too (since It is now live at https://s3-ocr-demo.datasette.io/ |
The demo can run in GitHub Actions, against a demo S3 bucket created for the purpose.
I'll deploy the resulting database file using Datasette, at
s3-ocr-demo.datasette.io
.The text was updated successfully, but these errors were encountered: