Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于llama3和mistral config的设置 #47

Open
ehuaa opened this issue Aug 2, 2024 · 0 comments
Open

关于llama3和mistral config的设置 #47

ehuaa opened this issue Aug 2, 2024 · 0 comments

Comments

@ehuaa
Copy link

ehuaa commented Aug 2, 2024

在4.2节中,即Table 1的InfiniBench的测试结果中,对于Mistral 7B window是16K,然后Llama3-8B的window是8K
但是在Appendix 里的Table 5中,对于LongBench它的window size对于Mistral变为了 12K,6K

这里有下面两个问题想请教一下:
1.那对于不同的任务是不是还要离线的先手动选择window size呢
2.对llama3 来说,paper中说的windowsize 是8k,但是repo中的配置我看是16*128+4k=6k,想问下是最后经过测试发现llama3 6k windowsize也可以么

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant