From c6dcd165b051b344152969c400210133d21a87f8 Mon Sep 17 00:00:00 2001 From: Mahmoud Mabrouk Date: Sun, 22 Sep 2024 13:35:59 +0200 Subject: [PATCH] docs(app): changelog-22092024 changelog --- docs/blog/main.mdx | 22 ++++++++++++++++++++++ 1 file changed, 22 insertions(+) diff --git a/docs/blog/main.mdx b/docs/blog/main.mdx index 8326cf6ed4..1960246c37 100644 --- a/docs/blog/main.mdx +++ b/docs/blog/main.mdx @@ -2,7 +2,29 @@ title: "Changelog" --- +```mdx-code-block +import { Stream } from '@cloudflare/stream-react'; +``` +
+### Evaluator Testing Playground and a New Evaluation View + +_22 September 2024_ + +**v0.25.0** + + +
+Many users faced challenges configuring evaluators in the web UI. +Someevaluators, such as `LLM as a judge`, `custom code`, or RAG evaluators can +be tricky to set up correctly on the first try. Until now, users needed to set +up, run an evaluation, check the errors, then do it again. + +To address this, we've introduced a new evaluator test/debug playground. This feature allows you to test the evaluator live on real data, helping you fine-tune the configuration before committing to it and using it for evaluations. + +Additionally, we have improved and redesigned the evaluation view. Both automatic and human evaluations are now within the same view but in different tabs. We're moving towards unifying all evaluator results and consolidating them in one view, allowing you to quickly get an overview of what's working. + +--- ### UI Redesign and Configuration Management and Overview View