Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create copy-all button #3888

Closed
benwbrum opened this issue Dec 22, 2023 · 1 comment · Fixed by #3894
Closed

Create copy-all button #3888

benwbrum opened this issue Dec 22, 2023 · 1 comment · Fixed by #3894

Comments

@benwbrum
Copy link
Owner

We should add a feature that copies HTR text in bulk into the transcription screen, a bit similar to how Autolink applies links in bulk (without saving).

Implementation ideas:

  • Strip ALTO to plaintext on the server side, using the same code we use for loading OCR text from files or IIIF manifests.
  • Either render the plaintext in the form when the site loads or fetch it from a new controller action.
  • Append the plaintext into the editor.

We can experiment with replacing bad readings through a cleanup (either dictionary or ChatGPT-based) with deletion/replacement of the conflicts to make it easier for users to find bad readings.

@benwbrum
Copy link
Owner Author

For user testing, we should explore the following variations:

  • Copy all button vs. initial population (à la current OCR correction flow)
  • Plaintext from ALTO vs. diff/normalized plaintext with place-holders to fix.

benwbrum added a commit that referenced this issue Dec 30, 2023
benwbrum added a commit that referenced this issue Jan 4, 2024
benwbrum added a commit that referenced this issue Jan 4, 2024
benwbrum added a commit that referenced this issue Jan 4, 2024
benwbrum added a commit that referenced this issue Jan 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant