Tokens per sample upper limit for GPTJ #1728

arjunsuresh · 2024-06-11T16:17:55Z

Is there any reason why we have an accuracy upper limit for LLAMA2 Tokens per sample but not for GPT-J? It's good to document this reason for users.

Oseltamivir · 2024-10-30T05:47:09Z

Apparently mixtral too: https://github.com/mlcommons/inference/blob/master/tools/submission/submission_checker.py#L381

Just my two cents but might it be because these models tend to keep repeating: issue on TRT for llama, and for mixtral or maybe I have just confirmation bias.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tokens per sample upper limit for GPTJ #1728

Tokens per sample upper limit for GPTJ #1728

arjunsuresh commented Jun 11, 2024

Oseltamivir commented Oct 30, 2024 •

edited

Loading

Tokens per sample upper limit for GPTJ #1728

Tokens per sample upper limit for GPTJ #1728

Comments

arjunsuresh commented Jun 11, 2024

Oseltamivir commented Oct 30, 2024 • edited Loading

Oseltamivir commented Oct 30, 2024 •

edited

Loading