feat: add `RankingQuestion` in the Python client #3275

alvarobartt · 2023-06-27T13:35:27Z

Description

This PR includes the RankingQuestion, introduced in #3232, in the Python client, to allow users to create datasets with RankingQuestions and to submit responses for those too.

Usage

import argilla as rg

rg.init(
    api_url="<ARGILLA_API_URL>",
    api_key="<ARGILLA_API_KEY>,
)

ds = rg.FeedbackDataset(
    fields=[
        rg.TextField(name="prompt-1"),
        rg.TextField(name="prompt-2"),  
    ],
    questions=[
        rg.RankingQuestion(
            name="prompt-ranking",
            description="Rank the prompts from most to least natural.",
            required=True,
            values=["prompt-1", "prompt-2"],
        ),
    ],
)

ds.add_records(
    [
        rg.FeedbackRecord(
            fields={
                "prompt-1": "Explain to a broad audience why banana bread is so fluffy.",
                "prompt-2": "Explain banana banana banana.",
            },
            responses=[
                {
                    "values": {
                        "prompt-ranking": {"value": [{"value": "prompt-1", "rank": 1}, {"value": "prompt-2", "rank": 2}]},
                    },
                    "status": "submitted"
                },
            ],
        ), 
    ]
)

ds.push_to_argilla(name="new-dataset", workspace="new-workspace")
ds = rg.FeedbackDataset.from_argilla(name="new-dataset", workspace="new-workspace")

ds.push_to_huggingface(repo_id="<HUGGINGFACE_REPO_ID>", token="<HUGGINGFACE_TOKEN>")
ds = rg.FeedbackDataset.from_huggingface(repo_id="<HUGGINGFACE_REPO_ID>", token="<HUGGINGFACE_TOKEN>")

Type of change

New feature (non-breaking change which adds functionality)

How Has This Been Tested

Added unit tests to cover the new RankingQuestion

Checklist

I added relevant documentation
follows the style guidelines of this project
I did a self-review of my code
I made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I filled out the contributor form (see text above)
I have added relevant notes to the CHANGELOG.md file (See https://keepachangelog.com/)

gabrielmbmb

LGTM! Just two minor comments

src/argilla/client/feedback/dataset.py

src/argilla/client/feedback/schemas.py

codecov · 2023-06-27T13:57:14Z

Codecov Report

Patch coverage: 84.03% and project coverage change: -0.86 ⚠️

Comparison is base (51751ac) 90.91% compared to head (32555c9) 90.05%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #3275      +/-   ##
===========================================
- Coverage    90.91%   90.05%   -0.86%     
===========================================
  Files          215      233      +18     
  Lines        11304    12410    +1106     
===========================================
+ Hits         10277    11176     +899     
- Misses        1027     1234     +207

Flag	Coverage Δ
pytest	`90.05% <84.03%> (-0.86%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/argilla/__init__.py	`86.66% <ø> (+3.33%)`	⬆️
...illa/client/feedback/training/frameworks/openai.py	`0.00% <0.00%> (ø)`
...rgilla/client/feedback/training/frameworks/peft.py	`0.00% <0.00%> (ø)`
...client/feedback/training/frameworks/span_marker.py	`0.00% <0.00%> (ø)`
src/argilla/server/contexts/datasets.py	`96.01% <ø> (ø)`
src/argilla/server/seeds.py	`0.00% <ø> (ø)`
src/argilla/tasks/users/create.py	`91.11% <ø> (-4.45%)`	⬇️
src/argilla/training/autotrain_advanced.py	`0.00% <0.00%> (ø)`
src/argilla/training/peft.py	`0.00% <0.00%> (ø)`
src/argilla/training/openai.py	`42.66% <50.00%> (+0.20%)`	⬆️
... and 60 more

... and 5 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

src/argilla/client/feedback/dataset.py

alvarobartt · 2023-06-28T06:40:09Z

Failing tests are unrelated (as most of the times), I guess we could try to work on a PR to add HTTP retries, or split the integration tests from the unit tests to avoid the CI/CD runs from failing so much, WDYT @frascuchon @gabrielmbmb?

Co-authored-by: Francisco Aranda <francis@argilla.io>

Co-authored-by: Gabriel Martin <gabriel@argilla.io>

gabrielmbmb · 2023-06-28T14:29:38Z

src/argilla/client/feedback/config.py

+        ...             labels={"cat-1": "Category 1" , "cat-2": "Category 2"},
+        ...             required=False,
+        ...             visible_labels=4
+        ...         ),


do we want to add here a rg.RankingQuestion? is the only one missing

Indeed I think we should shorten the rest of them, as we have a lot of information that ends up being more than the actual code and it's hard to navigate through it, I think we can tackle the docstrings in the next release and use a clearer approach

alvarobartt added 11 commits June 27, 2023 14:20

style: remove useless code-comment

2dcd9a0

fix(typo): Unificatied -> Unified

367dc81

fix: _unified_responses type-hint

8d6cb26

feat: add RankingQuestion schema

799f821

chore: add RankingQuestion in __init__

c8aa7a2

feat: add RankingQuestion in FeedbackDataset

beeb157

feat: include datasets formatting for RankingQuestion responses

313c831

docs: add Ranking-related docstrings

f5abeb2

fix: add FeedbackRankingValueModel

614c9b1

test: update unit tests for RankingQuestion

fee2bec

docs: update CHANGELOG.md

1182cb0

alvarobartt requested review from frascuchon and gabrielmbmb June 27, 2023 13:35

gabrielmbmb approved these changes Jun 27, 2023

View reviewed changes

src/argilla/client/feedback/dataset.py Outdated Show resolved Hide resolved

src/argilla/client/feedback/schemas.py Show resolved Hide resolved

frascuchon mentioned this pull request Jun 15, 2023

Support Ranking question #3097

Closed

frascuchon reviewed Jun 27, 2023

View reviewed changes

src/argilla/client/feedback/dataset.py Outdated Show resolved Hide resolved

frascuchon reviewed Jun 27, 2023

View reviewed changes

src/argilla/client/feedback/dataset.py Outdated Show resolved Hide resolved

frascuchon approved these changes Jun 27, 2023

View reviewed changes

alvarobartt and others added 9 commits June 28, 2023 08:50

feat: move AllowedFieldTypes and AllowedQuestionTypes to typing.py

fc2fb7a

fix: re-use AllowedQuestionTypes

46f648e

Co-authored-by: Francisco Aranda <francis@argilla.io>

docs: add RankingQuestion Python ref

6cf2561

fix: detach FeedbackDatasetConfig from schemas.py

776dd0f

docs: fix Python ref for FeedbackDatasetConfig

d88b61a

fix: remove unused variables

03b3604

docs: add missing docstring for RankingQuestion

4e64ab3

Co-authored-by: Gabriel Martin <gabriel@argilla.io>

fix: downcast rank from int32 to uint8

e45a801

Co-authored-by: Gabriel Martin <gabriel@argilla.io>

fix(test): remove static exception match

32555c9

alvarobartt requested a review from gabrielmbmb June 28, 2023 14:24

gabrielmbmb approved these changes Jun 28, 2023

View reviewed changes

alvarobartt added this to the v1.12.0 milestone Jun 28, 2023

gabrielmbmb merged commit c1f7aac into develop Jun 28, 2023

gabrielmbmb deleted the feat/add-ranking-question branch June 28, 2023 14:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add `RankingQuestion` in the Python client #3275

feat: add `RankingQuestion` in the Python client #3275

alvarobartt commented Jun 27, 2023 •

edited

Loading

gabrielmbmb left a comment

codecov bot commented Jun 27, 2023 •

edited

Loading

alvarobartt commented Jun 28, 2023

gabrielmbmb Jun 28, 2023

alvarobartt Jun 28, 2023

feat: add RankingQuestion in the Python client #3275

feat: add RankingQuestion in the Python client #3275

Conversation

alvarobartt commented Jun 27, 2023 • edited Loading

Description

Usage

gabrielmbmb left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 27, 2023 • edited Loading

Codecov Report

alvarobartt commented Jun 28, 2023

gabrielmbmb Jun 28, 2023

Choose a reason for hiding this comment

alvarobartt Jun 28, 2023

Choose a reason for hiding this comment

feat: add `RankingQuestion` in the Python client #3275

feat: add `RankingQuestion` in the Python client #3275

alvarobartt commented Jun 27, 2023 •

edited

Loading

codecov bot commented Jun 27, 2023 •

edited

Loading