Welcome to Anthropic's comprehensive prompt evaluations course. Across nine lessons, you will learn everything you need to know to implement evaluations successfully in your workflows with the Anthropic API. We recommend that you start from the beginning with the Evaluations 101 lesson, as each lesson builds on key concepts taught in previous ones.
- Evaluations 101
- Writing human-graded evals with Anthropic's Workbench
- Writing simple code-graded evals
- Writing a classification eval
- Promptfoo for evals: an introduction
- Writing classification evals with promptfoo
- Custom graders with promptfoo
- Model-graded evals with promptfoo
- Custom model-graded evals with promptfoo