Skip to content

Latest commit

 

History

History
57 lines (44 loc) · 7.1 KB

2019-12-13 ALTO Board Meeting Minutes.md

File metadata and controls

57 lines (44 loc) · 7.1 KB

2019-12-13 ALTO Board Meeting Minutes

  1. Welcome [All]
  2. Find and tell a non-offensive, maybe self-deprecating joke before the meeting begins and/or after it ends. [All]
  3. Review recent schema issues:
  4. Spring Face-to-Face Meeting:
  5. Upcoming Board Expirations:
    • December 31, 2019: Raju and Stefan
    • December 31, 2020: Art, Ashok, Evelien, Jukka, and Ralph
    • December 31, 2021: Frederick, Nate, Matthew, and Ahmed Samir
  6. Other business. [All]

Attending members

  • Ahmed Samir
  • Ashok Popat
  • Art Rhyno
  • Christian Clausner
  • Ciprian Dinu
  • Frederick Zarndt
  • Jukka Kervinen
  • Nate Trail
  • Raju Buddharaju
  • Stefan Pletschacher

Minutes

wrt agenda item 3. Review recent schema issues:

  • Change BASELINE to accommodate a list of points in addition to a single point - as per the 2019-09-27 meeting, the issue has been resolved and Art will send a message to the group to confirm that this schema change is open for voting.

  • ALTO support for encoding OCR segmentation ambiguity and Confidence value calculation (CC - WC - PC) - annotation extension - Cip described a linkage he made between these two issues in Ashok's summary document, in that a lattice structure might provide more information for an ALTO processor to compare scores from multiple sources. Ashok noted that confidence metrics are usually rooted in specific and varied algorithms, and there is a need to define something that is simple and general enough to be applied in all cases. He described Google's approach has involved attaching output tokens to the character level and estimating the edit-distance cost to ground-truth. One of the ideas captured in the summary document is to use a heat map attached to the underlying image so that there is an indication of regions where text has not been identified. Christian asked whether linking the issues adds too much complexity given that the original question was how to specify confidence values. Frederick pointed out that standardizing the computation of confidence was rejected in the past because it would be seen as dictating to OCR providers, and that there was general agreement that the goal is to record the factors that led to the confidence scoring. Ashok suggested that one path forward is to leverage the unfolding lattice discussion. As a representation of an n-best list that is weighted, the lattice could provide confidence data that can be computed and help surface use cases, particulary for handwriting recognition. To this end, Art will arrange for a single-topic meeting on OCR segmentation ambiguity in January and invite Gundram Leifert (Transkribus) and Robert Sachunsky (OCR-D). This will also allow further exploration of the use of lattices and the confmat representation for segmentation, and could align with Ashok's availability in Zürich during the week of January 20, when he might be able to travel to Innsbruck or some other location to physically join the discussion with some of the participants.

  • ALTO & IIIF integration - the recent IIIF Text Granularity Extension might have some overlap with this issue. Art will link the extension to the github discussion.

  • Should FONTSIZE be required? - Christian identified the problem with requiring FONTSIZE in TextStyle as part of his work on ALTO - PAGE mapping . This requirement precludes other style attributes, such as FONTCOLOR, from being encoded if the font size can not be determined. Jukka confirmed that the requirement has been in place since 2009, and that this aspect may have been missed. The issue will be updated with syntax to change the FONTSIZE attribute to optional in the schema so that any concerns can be brought forward.

  • Restrict float attribute values where possible to allow for better xml-validation - Jukka agreed to be champion for this issue. Christian suggested flagging this for the next major release since it could break existing ALTO files, or to document valid float ranges. After some discussion, it was agreed that negative values were valid for rotations.

wrt agenda item 4. Spring Face-to-Face Meeting:

Frederick suggested another strong option for the Spring face-to-face meeting could be the 2020 IFLA International News Media Conference at Universidad Nacional Autónoma de México in Mexico City. There is no limit on f2f meetings if more than one Board member is attending an event, so it is possible that there could be two in the Spring. Stefan, Christian, Clemens and Ashok may be attending ICPR in Milan, Italy in September 2020, which would make it a good candidate for the Fall face-to-face meeting.

wrt agenda item 5. Upcoming Board Expirations:

Raju and Stefan have kindly agreed to continue to serve on the ALTO Board. Ashok anticipates that he will exchange his position on the Board at some point in the next year with someone else from Google. There will be an effort to identify a new Chair before the end of 2020 to allow some overlap and give some time for the transition. Art reiterated what a postive experience it has been for him and thanked Frederick for his mentorship.

The next full Board meeting is tentatively scheduled for Friday, February 14, 2020. The single-topic meeting will hopefully be some time in January if scheduling works out.