This repository contains a comprehensive collection of resources related to OCR (Optical Character Recognition) and Document AI, such as papers, datasets, and APIs.
2025.01.05
Include papers that have been published in 2023 and 2024.
TODO
- HCIILAB Scene-Text-Detection. https://github.com/HCIILAB/Scene-Text-Detection
- HCIILAB Scene-Text-Recognition. https://github.com/HCIILAB/Scene-Text-Recognition
- HCIILAB Scene-Text-End2end. https://github.com/HCIILAB/Scene-Text-End2end
- A general list of resources to image text localization and recognition. https://github.com/whitelok/image-text-localization-recognition
- A curated list of resources dedicated to scene text localization and recognition. https://github.com/chongyangtao/Awesome-Scene-Text-Recognition
- A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods. https://github.com/hwalsuklee/awesome-deep-text-detection-recognition
- Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized. https://github.com/Jyouhou/SceneTextPapers
- Links to awesome OCR projects. https://github.com/kba/awesome-ocr
- A curated list of promising OCR resources. https://github.com/wanghaisheng/awesome-ocr
- Z. Chen, W. Wang, et al. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling. In ArXiv, 2024.