LAMBADA metric_fn inaccurate for multi-token responses #75

sdtblck · 2020-11-18T20:35:47Z

we need to check if the answer is split across multiple tokens rather than just calculating accuracy from the very last token.

should just be able to parse whether the token has a space at the beginning of the word and if not, take the previous token as well.

https://github.com/EleutherAI/GPTNeo/blob/master/model_fns.py#L255

kevinwatkins · 2020-11-18T21:37:36Z

My understanding was that that was how OpenAI had done it (last token rather than all the tokens of the last word), based on this remark at openai/gpt-2#131 (comment). But of course that could be my misinterpretation, or you might not want to follow them in that; it does seem rather odd.

Simplifying the procedure to test accuracy by comparing for equality of last BPE token instead of last word the accuracy is up to 46.89

sdtblck · 2020-11-21T20:39:47Z

Hm, we had "WutheFwasthat" from OpenAI from the discord in our server the other day, and he seemed to concur with my statement above. I think the other guy in that thread is not OpenAI affiliated.

kevinwatkins · 2020-11-23T03:54:57Z

Cool... I should catch up with the Discord, sorry about that

leogao2 · 2020-11-27T05:24:57Z

We probably want this fixed asap so we can run LAMBADA on the Pile ablation

StellaAthena · 2021-01-06T16:53:57Z

@leogao2 @sdtblck Is this still a concern?

StellaAthena added the bug Something isn't working. label Jan 6, 2021

StellaAthena closed this as completed Mar 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LAMBADA metric_fn inaccurate for multi-token responses #75

LAMBADA metric_fn inaccurate for multi-token responses #75

sdtblck commented Nov 18, 2020

kevinwatkins commented Nov 18, 2020

sdtblck commented Nov 21, 2020

kevinwatkins commented Nov 23, 2020

leogao2 commented Nov 27, 2020

StellaAthena commented Jan 6, 2021

LAMBADA metric_fn inaccurate for multi-token responses #75

LAMBADA metric_fn inaccurate for multi-token responses #75

Comments

sdtblck commented Nov 18, 2020

kevinwatkins commented Nov 18, 2020

sdtblck commented Nov 21, 2020

kevinwatkins commented Nov 23, 2020

leogao2 commented Nov 27, 2020

StellaAthena commented Jan 6, 2021