Prompt evaluations

Welcome to Anthropic's comprehensive prompt evaluations course. Across nine lessons, you will learn everything you need to know to implement evaluations successfully in your workflows with the Anthropic API. We recommend that you start from the beginning with the Evaluations 101 lesson, as each lesson builds on key concepts taught in previous ones.

Evaluations 101
Writing human-graded evals with Anthropic's Workbench
Writing simple code-graded evals
Writing a classification eval
Promptfoo for evals: an introduction
Writing classification evals with promptfoo
Custom graders with promptfoo
Model-graded evals with promptfoo
Custom model-graded evals with promptfoo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Prompt evaluations

Table of contents

Files

README.md

Latest commit

History

README.md

File metadata and controls

Prompt evaluations

Table of contents