From 0cc244dab1c961ab41eaeb3eaad946753bd7044f Mon Sep 17 00:00:00 2001 From: Simon Willison Date: Thu, 30 Jun 2022 13:52:42 -0700 Subject: [PATCH] Link to live demo, closes #16 --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index 88680c7..acba429 100644 --- a/README.md +++ b/README.md @@ -13,6 +13,10 @@ Install this tool using `pip`: pip install s3-ocr +## Demo + +You can see the results of running this tool against three PDFs from the Internet Archive ([one](https://archive.org/details/unmaskingrobert00houdgoog), [two](https://archive.org/details/practicalmagicia00harr), [three](https://archive.org/details/latestmagicbeing00hoff)) in [this example table](https://s3-ocr-demo.datasette.io/pages/pages?_facet=path#facet-path) hosted using [Datasette](https://datasette.io/). + ## Starting OCR against PDFs in a bucket The `start` command takes a list of keys and submits them to [Textract](https://aws.amazon.com/textract/) for OCR processing.