Skip to content

aisingapore/llm-benchmarks

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

llm-benchmarks

Benchmarking LLM for Malay tasks, HuggingFace space at https://huggingface.co/spaces/mesolitica/malay-llm-leaderboard

📈 We evaluate models based on 3 datasets,

  1. BM-PT3 Paper 1, contains 54 questions, https://github.com/mesolitica/malaysian-dataset/tree/master/llm-benchmark/BM-pt3
  2. Tatabahasa, contains 349 questions, https://github.com/mesolitica/malaysian-dataset/tree/master/llm-benchmark/tatabahasabm.tripod.com

Contributions

  1. Claude Tatabahasa contributed by https://www.linkedin.com/in/fahim-surani

About

Benchmarking LLM for Malay tasks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.0%
  • Python 1.0%