From 444cac44a782896e59586162976d22642e936a27 Mon Sep 17 00:00:00 2001 From: Mahmoud Mabrouk Date: Sun, 22 Sep 2024 13:35:59 +0200 Subject: [PATCH] docs(app): changelog-22092024 changelog --- docs/blog/main.mdx | 22 ++++++++++++++++++++++ 1 file changed, 22 insertions(+) diff --git a/docs/blog/main.mdx b/docs/blog/main.mdx index 8326cf6ed4..4f77259937 100644 --- a/docs/blog/main.mdx +++ b/docs/blog/main.mdx @@ -2,7 +2,29 @@ title: "Changelog" --- +```mdx-code-block +import { Stream } from '@cloudflare/stream-react'; +``` +
+### Evaluator Testing Playground and a New Evaluation View + +_22 September 2024_ + +**v0.25.0** + + +
+Many users faced challenges configuring evaluators in the web UI. Some +evaluators, such as `LLM as a judge`, `custom code`, or RAG evaluators can be +tricky to set up correctly on the first try. Until now, users needed to setup, +run an evaluation, check the errors, then do it again. + +To address this, we've introduced a new evaluator test/debug playground. This feature allows you to test the evaluator live on real data, helping you test the configuration before committing to it and using it for evaluations. + +Additionally, we have improved and redesigned the evaluation view. Both automatic and human evaluations are now within the same view but in different tabs. We're moving towards unifying all evaluator results and consolidating them in one view, allowing you to quickly get an overview of what's working. + +--- ### UI Redesign and Configuration Management and Overview View