Skip to content

Latest commit

 

History

History
29 lines (22 loc) · 1.61 KB

KNOWN_CORPUSES.md

File metadata and controls

29 lines (22 loc) · 1.61 KB

Known Corpuses

Here we provide links to known text corpuses for specific African languages

Parallel

English-to-Target

Language Source
Zulu Autshumato Corpus
Setswana Autshumato Corpus
Xitsonga Autshumato Corpus
Northern-Sotho Autshumato Corpus
Afrikaans Autshumato Corpus
Xhosa Navy Corpus

Monolingual

Language Source
Zulu Zulu Wikipedia
Zulu NCHLT isiZulu Text Corpus
Zulu University of Leipzig Zulu Corpora
Zulu isiZulu National Corpus (currently not avail)
Zulu African Speech Technology
Zulu Zulu Bible (to be scraped)
Zulu Zulu Quoran (to be scraped)