Releases: senisioi/enntt-release
Releases · senisioi/enntt-release
Dialectal Varieties Experiments
v2.1 fix url
ACL 2016 release
LREC 2016 experiments dataset
Europarl corpus of native, non-native and translated texts - ENNTT
Europarl corpus of native, non-native and translated texts - ENNTT
Also available: http://nlp.unibuc.ro/resources.html
This is a monolingual English corpus of native, non-native and (human) translated texts extracted from the European Parliament. The translated texts from different source languages represent a subset of the Haifa Corpus of Translationese. We preserved the same annotation style and included an ID and the EU state that each member of the European Parliament represents. We hope this dataset will facilitate a unified comparative study of translations and language produced by highly fluent non-native speakers, two closely-related phenomena that have only been studied in isolation so far.
Raw dataset here.