Skip to content

thammegowda/007-mt-eval-macro

Repository files navigation

Macro Average: Rare Types are Important Too

This repository contains all the tools, analysis and outcomes of our "Macro-Average: Rare Types Are Important Too" paper.

This repository has git submodules, so the correct way to clone this is:

git clone --recursive git@github.com:thammegowda/007-mt-eval-macro.git

You may also refer to .gitmodules file to see the URLs of submodules.

The organization of this directory is as follows:

  + justifs/ 
     + webnlg-human-evaluation/:  WebNLG task
     + wmt-metrics/ :  WMT metrics task
     + material/     :  CLIR task CLSSTS datasets
     + mtir-lignos-etal-2019/ : CLIR task Europarl dataset
     + bleurt-bias/   : showing model-based metrics e.g. BLEUT are biased
  +  snmt-vs-unmt/  : comparison between SNMT v/s UNMT

Please refer to paper for the role of each task. Each directory have scripts required to obtain the results.

Citation

@inproceedings{gowda-etal-2021-macro,
    title = "Macro-Average: Rare Types Are Important Too",
    author = "Gowda, Thamme  and
      You, Weiqiu  and
      Lignos, Constantine  and
      May, Jonathan",
    booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.naacl-main.90",
    doi = "10.18653/v1/2021.naacl-main.90",
    pages = "1138--1157",
}```

About

MT evaluation, use macro-average because rare types are important too.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published