Skip to content

v2021.03.04.1

Compare
Choose a tag to compare
@j0ma j0ma released this 04 Mar 23:15
· 91 commits to main since this release
55afb25

This is the first complete release of the ParaNames corpus, containing parallel entity names in over 400 languages.
This release also contains source code and instructions for re-creating the corpus from a raw Wikidata JSON dump.

The specific Wikidata JSON dump this release was generated from is wikidata-20220110-all.json.

For notes about the data format and caveats, see the README for this version.

Collaborators

@j0ma
@ConstantineLignos