If you are interested in the dataset from the paper
Katrin Ortmann and Stefanie Dipper (2019). Variation between Different Discourse Types: Literate vs. Oral. In Proceedings of the NAACL-Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), pp. 64–79. Minneapolis, MN. PDF
please contact us at ortmann@linguistics.rub.de or dipper@linguistics.rub.de.