Skip to content

adamlin120/TCEval

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TCEval v2

Install

cd lm-evaluation-harness_mr-revised
pip3 install -e ".[vllm]"
pip3 install -U vllm
cd ..

Evaluate Local Models (MMLU, TMMLU+, and Penguin_Table)

please reference examples

Evaluate API Models (MMLU, TMMLU+, and Penguin_Table)

please check scripts/cal_likelihood_by_api.py

Evaluate MTBench-tw

please reference here.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 91.6%
  • Jupyter Notebook 7.7%
  • C++ 0.7%