snli_translated

German version of the Stanford Natural Language Inference (SNLI) data set, machine-translated using DeepL.

The training set has been downsampled to 100 000 examples, while the development and test set were kept at their original sizes of 10 000 examples each. The format is similar to the original SNLI data set, which can be found at https://nlp.stanford.edu/projects/snli/. The constituency parsing has been created using the Stanford CoreNLP parser.

If you use this data set in your research, please cite the following paper:

@inproceedings{cidm2019,
author = {Sifa, Rafet and Pielka, Maren and Ramamurthy, Rajkumar and Ladi, Anna and Hillebrand, Lars and Bauckhage, Christian},
year = {2019},
title = {Towards Contradiction Detection in German: A Translation-driven Approach},
booktitle={Proc. of IEEE SSCI 2019},
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
snli_1.0_test_translated.jsonl		snli_1.0_test_translated.jsonl
snli_1.0_train_translated.jsonl		snli_1.0_train_translated.jsonl
snli_1.0_val_translated.jsonl		snli_1.0_val_translated.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

snli_translated

About

Releases

Packages

fraunhofer-iais/snli_translated

Folders and files

Latest commit

History

Repository files navigation

snli_translated

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages