Feat: implemented human evaluations #2047

ashrafchowdury · 2024-08-30T16:22:33Z

Descriptions

Implemented human evaluations on evaluation page reusing the SingleModelEvalOverview.tsx and AbTestingEvalOverview.tsx component from overview page by conditionally made them adjustable, so that both use in overview and evaluations page.

vercel · 2024-08-30T16:22:37Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
agenta	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 3, 2024 11:55am
agenta-documentation	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 3, 2024 11:55am

bekossy

Thanks for the work on this PR. Here are some changes to address:

Let's make it clear that both the SingleModelEvaluation and AbTestingEvaluation are reusable components and import that where needed.
The "Compare" button in AB Testing should navigate to the evaluation compare view.
We currently don’t have new changes in the Automatic Evaluation view?
Please resolve the merge conflicts.

- refactored code

ashrafchowdury · 2024-09-02T11:26:03Z

Changes summary:

Made it clear that both the SingleModel and AbTesting are reusable components.
Removed the compare button from SingleModel and AbTesting after talking with Ahmed
Some UI changes were made in the 'Create Human Eval Model'
Merge conflict resolved

bekossy

Let’s also remove the previous Annotation code
http://localhost/apps/APP_ID/annotations/single_model_test should not be a valid path

And fix the failing cypress tests

ashrafchowdury · 2024-09-03T06:44:37Z

Changes summary:

Removed single_model_test.tsx and human_a_b_testing.tsx pages.
Removed AutomaticResultsEvaluation.tsx and HumanEvaluationResults.tsx components.
Adjusted cypress test according to new changes.
Minor UI update in the 'create new human eval model'.

…ement-human-evaluations

agenta-web/cypress/e2e/app-navigation.cy.ts

ui(frontend): implemented human evaluations

35d5eb3

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. Frontend labels Aug 30, 2024

ashrafchowdury requested a review from bekossy August 30, 2024 16:29

bekossy requested changes Sep 2, 2024

View reviewed changes

ashrafchowdury added 2 commits September 2, 2024 14:30

fix: resolved merge conflict

704ef9b

refactor: improved structure

c8bc24e

- refactored code

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Sep 2, 2024

vercel bot deployed to Preview – agenta September 2, 2024 11:23 View deployment

vercel bot had a problem deploying to Preview – agenta-documentation September 2, 2024 11:23 Failure

ashrafchowdury requested a review from bekossy September 2, 2024 11:26

bekossy requested changes Sep 2, 2024

View reviewed changes

ashrafchowdury added 2 commits September 3, 2024 12:37

refactor(frontend): removed unused codes

fe0bc46

fix(frontend): failing cypress tests

0a02a48

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Sep 3, 2024

vercel bot had a problem deploying to Preview – agenta-documentation September 3, 2024 06:38 Failure

vercel bot deployed to Preview – agenta September 3, 2024 06:41 View deployment

ashrafchowdury requested a review from bekossy September 3, 2024 06:46

bekossy temporarily deployed to oss September 3, 2024 08:48 — with GitHub Actions Inactive

vercel bot deployed to Preview – agenta-documentation September 3, 2024 08:49 View deployment

vercel bot deployed to Preview – agenta September 3, 2024 08:52 View deployment

bekossy force-pushed the feat/implement-human-evaluations branch from 3cbc196 to 0a02a48 Compare September 3, 2024 08:57

Merge branch 'AGE-587/-implement-evaluation-main-page' into feat/impl…

4732b7b

…ement-human-evaluations

vercel bot deployed to Preview – agenta September 3, 2024 09:01 View deployment

vercel bot deployed to Preview – agenta-documentation September 3, 2024 09:02 View deployment

bekossy reviewed Sep 3, 2024

View reviewed changes

agenta-web/cypress/e2e/app-navigation.cy.ts Show resolved Hide resolved

test(frontend): tests for eval tabs

959e773

ashrafchowdury requested a review from bekossy September 3, 2024 11:32

vercel bot deployed to Preview – agenta September 3, 2024 11:34 View deployment

vercel bot deployed to Preview – agenta-documentation September 3, 2024 11:34 View deployment

fix(frontend): prettier error

538b852

vercel bot deployed to Preview – agenta September 3, 2024 11:54 View deployment

vercel bot deployed to Preview – agenta-documentation September 3, 2024 11:55 View deployment

bekossy approved these changes Sep 3, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 3, 2024

bekossy merged commit 60d237e into AGE-587/-implement-evaluation-main-page Sep 3, 2024
5 of 6 checks passed

bekossy deleted the feat/implement-human-evaluations branch September 3, 2024 12:00

bekossy restored the feat/implement-human-evaluations branch September 3, 2024 12:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: implemented human evaluations #2047

Feat: implemented human evaluations #2047

ashrafchowdury commented Aug 30, 2024

vercel bot commented Aug 30, 2024 •

edited

Loading

bekossy left a comment •

edited

Loading

ashrafchowdury commented Sep 2, 2024

bekossy left a comment •

edited

Loading

ashrafchowdury commented Sep 3, 2024 •

edited

Loading

Feat: implemented human evaluations #2047

Feat: implemented human evaluations #2047

Conversation

ashrafchowdury commented Aug 30, 2024

Descriptions

vercel bot commented Aug 30, 2024 • edited Loading

bekossy left a comment • edited Loading

Choose a reason for hiding this comment

ashrafchowdury commented Sep 2, 2024

bekossy left a comment • edited Loading

Choose a reason for hiding this comment

ashrafchowdury commented Sep 3, 2024 • edited Loading

vercel bot commented Aug 30, 2024 •

edited

Loading

bekossy left a comment •

edited

Loading

bekossy left a comment •

edited

Loading

ashrafchowdury commented Sep 3, 2024 •

edited

Loading