Implement the LAMBADA evaluation #6

StellaAthena · 2020-09-16T16:32:45Z

The LAMBADA dataset [PKL+16] tests the modeling of long-range dependencies in text – the model is asked to predict the last word of sentences which require reading a paragraph of context. It has recently been suggested that the continued scaling of language models is yielding diminishing returns on this difficult benchmark. [BHT+20] reflect on the small 1.5% improvement achieved by a doubling of model size between two recent state of the art results ([SPP+19]and [Tur20]) and argue that "continuing to expand hardware and data sizes by orders of magnitude is not the path forward”. We find that path is still promising and in a zero-shot setting GPT-3 achieves 76% on LAMBADA, a gain of 8% over the previous state of the art.

cfoster0 · 2020-10-05T06:49:28Z

Original dataset is available at the link below:

https://zenodo.org/record/2630551

sdtblck · 2020-10-05T20:24:03Z

this is the dataset openAI used for evaluating GPT2 #39
(reference: openai/gpt-2#131)

leogao2 · 2020-11-29T22:59:32Z

@anishthite is this done/are you still working on this?

leogao2 · 2020-11-29T23:05:58Z

I think I'll be taking over this one

Implement LAMBADA (#6)

Update with new PR

Implement LAMBADA (EleutherAI#6)

Update with new PR

update metrics for afrixnli

…_filter Fix strip whitespace filter

StellaAthena added the feature request A feature that isn't implemented yet. label Sep 16, 2020

StellaAthena changed the title ~~LAMBDA~~ Implement the LAMBDA evaluation Sep 16, 2020

leogao2 changed the title ~~Implement the LAMBDA evaluation~~ Implement the LAMBADA evaluation Oct 6, 2020

StellaAthena added Eval Set and removed feature request A feature that isn't implemented yet. labels Oct 23, 2020

StellaAthena pinned this issue Oct 23, 2020

StellaAthena unpinned this issue Oct 23, 2020

anishthite self-assigned this Oct 24, 2020

leogao2 assigned leogao2 and unassigned anishthite Nov 29, 2020

StellaAthena removed the Eval Set label Dec 23, 2020

StellaAthena closed this as completed Jan 4, 2021

leogao2 reopened this Jan 28, 2021

leogao2 mentioned this issue Jan 29, 2021

Implement LAMBADA (#6) #101

Merged

leogao2 closed this as completed Jan 29, 2021

StellaAthena added a commit that referenced this issue Jan 29, 2021

Merge pull request #101 from EleutherAI/lambada

a2f5b74

Implement LAMBADA (#6)

StellaAthena linked a pull request Jan 30, 2021 that will close this issue

Implement LAMBADA (#6) #101

Merged

StellaAthena added the feature request A feature that isn't implemented yet. label Jan 30, 2021

StellaAthena added a commit to dirkgr/lm-evaluation-harness that referenced this issue Apr 27, 2022

Merge pull request EleutherAI#6 from cjlovering/master

025547c

Update with new PR

qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this issue Aug 17, 2023

Merge pull request EleutherAI#101 from EleutherAI/lambada

ba7f22d

Implement LAMBADA (EleutherAI#6)

qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this issue Aug 17, 2023

Merge pull request EleutherAI#6 from cjlovering/master

c76ce6f

Update with new PR

LZY-the-boys pushed a commit to LZY-the-boys/lm-evaluation-harness-fast that referenced this issue Sep 12, 2023

Merge pull request EleutherAI#6 from cjlovering/master

a4a472c

Update with new PR

lintangsutawika pushed a commit that referenced this issue Jul 8, 2024

Merge pull request #6 from JessicaOjo/africanli

a82623e

update metrics for afrixnli

penfever pushed a commit to penfever/lm-evaluation-harness that referenced this issue Aug 14, 2024

Merge pull request EleutherAI#6 from huggingface/fix_strip_whitespace…

87bb1c4

…_filter Fix strip whitespace filter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the LAMBADA evaluation #6

Implement the LAMBADA evaluation #6

StellaAthena commented Sep 16, 2020

cfoster0 commented Oct 5, 2020

sdtblck commented Oct 5, 2020 •

edited

Loading

leogao2 commented Nov 29, 2020

leogao2 commented Nov 29, 2020

Implement the LAMBADA evaluation #6

Implement the LAMBADA evaluation #6

Comments

StellaAthena commented Sep 16, 2020

cfoster0 commented Oct 5, 2020

sdtblck commented Oct 5, 2020 • edited Loading

leogao2 commented Nov 29, 2020

leogao2 commented Nov 29, 2020

sdtblck commented Oct 5, 2020 •

edited

Loading