Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge Transkribus PAGE with OCR-D-Page #16

Open
M3ssman opened this issue Mar 3, 2022 · 2 comments
Open

Merge Transkribus PAGE with OCR-D-Page #16

M3ssman opened this issue Mar 3, 2022 · 2 comments
Labels
enhancement New feature or request

Comments

@M3ssman
Copy link
Collaborator

M3ssman commented Mar 3, 2022

Description

I'd like to have some mechanics that, besides transforming corrected Transkribus' PAGE 2013, can also merge information from OCR-D-PAGE 2019 when transforming.

Motivation

Our Transkribus-Import actually does its nasty transformations, but kindly stores the original OCR-D-PAGE in a sub directory, because it thinks it's of PAGE 2010 origin (well, that is another story ... ). But due XLST nearly all metadata information is being dropped, with only few being kept.

To preserve the provenience data on processors and their parameters, it would be really helpful to re-integrate this again at re-conversion time, if the data is available.

@M3ssman M3ssman added the enhancement New feature or request label Mar 3, 2022
@M3ssman
Copy link
Collaborator Author

M3ssman commented Mar 4, 2022

Here some test materials. (to view the corresponding image, please go to urn:nbn:de:gbv:3:1-113523-p0442-2

urn+nbn+de+gbv+3+1-113523-p0442-2_ger.zip
l

@kba
Copy link
Owner

kba commented Mar 4, 2022

If that's alright with you, let's discuss in detail in the next open tech call or have a call before depending on how pressing this is for you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants