Skip to content

Open NLU and NLG datasets created within the Latvian Language Technology Initiative

Notifications You must be signed in to change notification settings

LUMII-AILab/VTI-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VTI-Data

NLU and NLG datasets developed within the Latvian Language Technology Initiative

  1. Alpaca Latvian dataset

    ALPACA-LV is a machine translated Alpaca instruction dataset for Latvian.

  2. COPA

    COPA is a machine translated COPA benchmark dataset for Latvian.

  3. MMLU

    MMLU is a machine translated MMLU benchmark dataset for Latvian. The sociology_postedited.json file contains a post-edited collection of the first 100 tasks in the sociology subject.

  4. LV-exams

    Multiple-choice questions (MCQ) from Latvian Centralized High School Exams.

About

Open NLU and NLG datasets created within the Latvian Language Technology Initiative

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published