CharCut: another character-based MT evaluation metric #290

BramVanroy · 2022-09-11T09:39:10Z

Similar to CharacTER, CharCut implements a character-based evaluation metric. First proposed in CHARCUT: Human-Targeted Character-Based MT Evaluation with Loose Differences.

Specifically, this implementation uses the repackaged version of the original for usability reasons.

HuggingFaceDocBuilderDev · 2022-09-11T09:42:13Z

The documentation is not available anymore as the PR was closed or merged.

No verbosity anymore

BramVanroy · 2022-12-06T10:45:58Z

@lvwerra I fixed the doctest, and also updated the underlying charcut library so that we do not get annoying outputs printed to stdout anymore.

lvwerra

Just a few minor things, then it's good to go 🚀

metrics/charcut_mt/.gitattributes

metrics/charcut_mt/requirements.txt

setup.py

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

lvwerra · 2022-12-07T13:14:13Z

Could you also merge main into your branch again, the new CI is merged :)

BramVanroy · 2022-12-08T11:25:52Z

@lvwerra So I had a look to convert this to a multi-reference format, but I am not sure how to handle this. CharCUT is calculated on the document-level. So I do not think it is feasible to add multiple references here.

lvwerra · 2022-12-08T11:52:29Z

Ok, then let's leave it as is.

lvwerra

Looks good, you could also delete the tests here.

BramVanroy · 2022-12-08T12:21:21Z

Done!

Bram Vanroy added 2 commits September 11, 2022 11:36

add charcut

b930d6d

make style

4e07652

Bram Vanroy added 5 commits September 12, 2022 10:41

add charcut dependency

52e6f3f

Merge branch 'huggingface:main' into charcut

3256938

fix doctest example

ae6647a

update charcut to v1.1.1

34a1b5d

No verbosity anymore

fix doctest

c8e205b

lvwerra reviewed Dec 6, 2022

View reviewed changes

metrics/charcut_mt/.gitattributes Outdated Show resolved Hide resolved

metrics/charcut_mt/requirements.txt Outdated Show resolved Hide resolved

setup.py Outdated Show resolved Hide resolved

Bram Vanroy and others added 4 commits December 6, 2022 12:31

Update metrics/charcut_mt/requirements.txt

a466f18

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

Update setup.py

1576758

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

Delete .gitattributes

2a384b8

fix features: always working on strings as samples

7733098

lvwerra mentioned this pull request Dec 6, 2022

CharacTER: MT metric #286

Merged

BramVanroy added 3 commits December 7, 2022 15:10

Merge remote-tracking branch 'upstream/main' into charcut

ec8bf1a

Merge branch 'main' into charcut

1abe226

add typing

332c5a7

lvwerra approved these changes Dec 8, 2022

View reviewed changes

Bram Vanroy added 3 commits December 8, 2022 13:16

Delete tests.py

bb745b7

Update charcut_mt.py

6e62bf9

make quality

5ac8f36

Merge branch 'main' into charcut

22abf09

lvwerra merged commit 83129c0 into huggingface:main Dec 8, 2022

BramVanroy deleted the charcut branch December 8, 2022 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CharCut: another character-based MT evaluation metric #290

CharCut: another character-based MT evaluation metric #290

BramVanroy commented Sep 11, 2022

HuggingFaceDocBuilderDev commented Sep 11, 2022 •

edited

Loading

BramVanroy commented Dec 6, 2022

lvwerra left a comment

lvwerra commented Dec 7, 2022

BramVanroy commented Dec 8, 2022

lvwerra commented Dec 8, 2022

lvwerra left a comment

BramVanroy commented Dec 8, 2022

CharCut: another character-based MT evaluation metric #290

CharCut: another character-based MT evaluation metric #290

Conversation

BramVanroy commented Sep 11, 2022

HuggingFaceDocBuilderDev commented Sep 11, 2022 • edited Loading

BramVanroy commented Dec 6, 2022

lvwerra left a comment

Choose a reason for hiding this comment

lvwerra commented Dec 7, 2022

BramVanroy commented Dec 8, 2022

lvwerra commented Dec 8, 2022

lvwerra left a comment

Choose a reason for hiding this comment

BramVanroy commented Dec 8, 2022

HuggingFaceDocBuilderDev commented Sep 11, 2022 •

edited

Loading