Skip to content

Releases: embeddings-benchmark/mteb

1.6.21

18 Apr 10:47
Compare
Choose a tag to compare

1.6.21 (2024-04-18)

Fix

  • fix: Add Croatian Sentiment Classification (#416)

  • add CroatianSentimentClassification

  • fix citation

  • actually fix citation

  • add results

  • add e5-base results

  • points!


Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com> (e7c0362)

Unknown

  • Fix: EstQA is now properly formulated as a retrieval task (#418)

  • fix: EstQA is now properly formulated

  • Ran linting (ae0da50)

1.6.20

18 Apr 10:08
Compare
Choose a tag to compare

1.6.20 (2024-04-18)

Documentation

  • docs: Update contributor information (#417)

Co-authored-by: rposwiata <rposwiata@opi.org.pl> (36de85e)

Fix

  • fix: add italian HateSpeech dataset (#385) (#420)

  • fix: add italian HateSpeech dataset (#385)

  • add italian HateSpeech dataset

  • add points

  • update dialect, socioeconomic status, domains and points

  • add PR review points

  • add task_domain for constructed data + rerun models

  • update points

  • minor fix

  • merge points from main

  • add review points to main


Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

  • run linting

Co-authored-by: Roberta Rocca <32483140+rbroc@users.noreply.github.com> (4ce7f35)

1.6.19

18 Apr 09:53
Compare
Choose a tag to compare

1.6.19 (2024-04-18)

Fix

  • fix: add custom load dataset function for MLSUM tasks (#405)

  • fix: add custom load dataset function

  • fix: fix linter

  • docs: added points


Co-authored-by: Imene Kerboua <imene.kerboua@esker.com>
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com> (c549af2)

1.6.18

18 Apr 09:50
Compare
Choose a tag to compare

1.6.18 (2024-04-18)

Fix

  • fix: Added Hungarian Roma Tales bitext task and Romani Bible clustering task (#396)

  • Added Romani-Hungarian bitext task

  • Added results for Roma Tales

  • Added Romani Bible clustering task with results

  • Changed task subtype from None to [] in Roma Tales

  • Added points for Marton for datasets and Kenneth for review

  • Made Roma Tales a CrosslingualTask and readded results

  • style: Use rename_columns instead of rename_column

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>

  • Ran linting

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com> (6f9d19f)

1.6.17

18 Apr 09:36
Compare
Choose a tag to compare

1.6.17 (2024-04-18)

Fix

  • fix: Id clickbait (#411)

  • add indonesian clickbait

  • Update points.md

  • update


Co-authored-by: Manan Dey <159106637+manandey-sfdx@users.noreply.github.com> (a0fa60b)

Unknown

  • Add Czech Subjectivity Classification Dataset (#413)

  • CzechSubjectivityClassification

  • add results

  • review comments

  • points

  • update results


Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com> (deebcb0)

  • Add 3 Arabic classification datasets (#410)

  • add ara-sarcasm dataset

  • add tweet emotion dataset

  • add hard dataset

  • fix formatting

  • Update mteb/tasks/Classification/ara/HotelReviewSentimentClassification.py

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>

  • Update mteb/tasks/Classification/ara/HotelReviewSentimentClassification.py

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>

  • Update mteb/tasks/Classification/ara/HotelReviewSentimentClassification.py

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>

  • Update mteb/tasks/Classification/ara/TweetSarcasmClassification.py

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>

  • add points and suggestions

  • Update points.md


Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com> (c874110)

  • Turkish multidomain product review data (#406)

  • Tur multidomain product review data

  • Update mteb/tasks/Classification/tur/TurkishProductSentimentClassification.py

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com>

  • point update

Co-authored-by: Imene Kerboua <33312980+imenelydiaker@users.noreply.github.com> (86db1b8)

  • Add Romanian Sentiment Classification Task (#404)

  • add RomanianSentimentClassification

  • add run results

  • e5-base results

  • add points

  • only sample test set

  • thanks Imene! (268324b)

  • Add points missing from PR#389 (#403)

add points missing from PR 389 (628beb4)

  • Adding Turkish Movie Sentiment Dataset (#389)

  • Adding Turkish Movie Sentiment Dataset

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update avg_character length and socioecon status

  • Updating licence

  • Update TurkishMovieSentimentClassification.py

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
    Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>

  • Update TurkishMovieSentimentClassification.py

  • Update TurkishMovieSentimentClassification.py

  • Create init.py

  • Update points.md

  • Update TurkishMovieSentimentClassification.py

  • Update points.md

  • Update TurkishMovieSentimentClassification.py

  • formatting

  • Update points.md

  • Update points.md

  • Update points.md


Co-authored-by: Isaac Chung <chungisaac1217@gmail.com> (b01382b)

1.6.16

17 Apr 14:08
Compare
Choose a tag to compare

1.6.16 (2024-04-17)

Documentation

  • docs: Added missing point for pr (#394) (a2d4705)

  • docs: Added missing point for pr (32d8979)

Fix

  • fix: Add BengaliHateSpeechClassification (#398)

  • Add BengaliHateSpeechClassification

  • Update mteb/tasks/Classification/ben/BengaliHateSpeechClassification.py

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

  • update scores

  • make lint

  • add points

  • Update points.md


Co-authored-by: Manan Dey <159106637+manandey-sfdx@users.noreply.github.com>
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com> (2772ec8)

Unknown

1.6.15

17 Apr 12:26
Compare
Choose a tag to compare

1.6.15 (2024-04-17)

Fix

  • fix: Add Macedonian Tweet Sentiment Classification Task (#392)

  • add MacedonianTweetSentimentClassification

  • add results from runs

  • edits from review

  • add points (206f2c9)

1.6.14

17 Apr 11:43
Compare
Choose a tag to compare

1.6.14 (2024-04-17)

Ci

  • ci: Ensure that linting fails when files are not linted (#394)

  • style: This should fail linting

  • ci: update linting to check if project is linted

  • style: running linting

This should pass tests

  • Add comment to ci rule to ensure that it isn't removed in the future (ff3cbfc)

Fix

  • fix: Add Dutch Book Review Sentiment Classification Task (#388)

  • add DutchBookReviewSentimentClassification and run models

  • add points

  • add name

  • minor edits from review

  • add multilingual-e5-base results

  • add pointer for reviewers (10d50b6)

1.6.13

17 Apr 09:17
Compare
Choose a tag to compare

1.6.13 (2024-04-17)

Fix

  • fix: Added HunSum2 dataset with results (#384)

  • Added HunSum2 dataset with results

  • Minor fixes to HunSum2

  • Updated points for Kenneth and Marton


Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com> (c65e1d7)

1.6.12

17 Apr 08:03
Compare
Choose a tag to compare

1.6.12 (2024-04-17)

Fix

  • fix: Added EstQA and Eesti Valentsikorpus datasets (#382)

  • Added EstQA dataset with results.

  • Added new points and e-mail address to x-tabdeveloping

  • Added Estonian Valence task + results

  • Added new points for Estonian Valence task

  • Addressed issues highlighted by Kenneth

  • Reran EstQA (282d421)