Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
-
Updated
Nov 28, 2024 - Python
Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
GlotCC Dataset and Pipline -- NeurIPS 2024
Meta_doc : basic document management (DM) system, with a search by metadata. Access to documents through a classification plan. The metadata categories as well as the types of relationships between documents are part of the Dublin Core repository. The ressource URL or attached documents URL are inside the field "text".
A translation frontend for the Global African Storybook Project
Multi-Encoder for ahk, text parsing, python, c++, java, and other languages like arabic, chinese, russian, or advanced text parsing for certain languages
Add a description, image, and links to the multlingual topic page so that developers can more easily learn about it.
To associate your repository with the multlingual topic, visit your repo's landing page and select "manage topics."