layoutxlm

Star

Here are 3 public repositories matching this topic...

microsoft / unilm

Star

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Updated Dec 15, 2024
Python

mindspore-lab / mindocr

Star

A toolbox of ocr models and algorithms based on MindSpore

ocr deep-learning text-recognition text-detection layout-analysis crnn dbnet table-recognition mindspore key-information-extraction layoutxlm ocr-large-model tablemaster vary-toy

Updated Dec 17, 2024
Python

The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.

biases synthetic-dataset-generation layoutlm synthetic-dataset layoutxlm token-classification layoutlmv3 layoutlmv2 llms-benchmarking

Updated Sep 6, 2024
Python

Improve this page

Add a description, image, and links to the layoutxlm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the layoutxlm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

layoutxlm

Here are 3 public repositories matching this topic...

microsoft / unilm

mindspore-lab / mindocr

nachoDRT / MERIT-Dataset

Improve this page

Add this topic to your repo