-
Notifications
You must be signed in to change notification settings - Fork 164
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BLEU with a variable number of references #130
Comments
It seems that this feature is often needed (e.g. #69) and the current workaround (duplicate another reference for a given segment) is not user friendly (and it is not efficient), so I would welcome such a PR. File formatCurrently, sacreBLEU supports both a single file with tab-separated references and multiple files (each reference in a single file). The former format currently needs Do we want to allow variable number of references also in the latter format? That would mean e.g. re-defining an empty string as a missing reference (which should not affect the brevity penalty). How would then someone represent a real empty segment as a reference? (Suppose there is a language where the best translation of "Um, heh." is to omit it.) We could disambiguate the two use cases by special options such as The Python APICurrently, we have |
Thanks for such a quick and detailed answer! I'll try to implement it then 🙂! I was actually only considering the Python API so far (for the upcoming GEM NLG shared task/benchmark), so having the possibility just via the API and not exposed on the command line would be fine for my use case. I agree that representing missing references with Re. Command line/file format: I think that allowing variable numbers of references just in the tab-separated format is fine. Allowing a range for Regardless of whether we want to support the multi-file format or not, would it make sense to represent real empty references with a space character instead of a truly empty line/tab column, or is that too obscure? |
Hello Thanks for the issue and the PR! I am starting to think that all these should be discussed to allow for a well-defined API, instead of yet another processing trick through In overall, I think that the range of possibilities for the |
@ozancaglayan: I think this issue (variable number of references for different sentences) is almost* orthogonal to the *) There is an option to change the |
Okay then. Let's also think about adding documentation for this to the README file in the future. |
The current implementation of BLEU in SacreBLEU doesn't allow a variable number of references (different segments having a different number of references). Some NLG datasets (e.g. E2E, WebNLG) do have a variable number of references and it would be great if SacreBLEU worked on them.
Is that a design decision, or would you be open to a PR that would enable this?
The text was updated successfully, but these errors were encountered: