Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] GPU overflow during generation #14

Open
Lyaaaaaaaaaaaaaaa opened this issue Sep 4, 2022 · 0 comments
Open

[BUG] GPU overflow during generation #14

Lyaaaaaaaaaaaaaaa opened this issue Sep 4, 2022 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@Lyaaaaaaaaaaaaaaa
Copy link
Member

Describe the bug
The GPU runs out of memory when generating text.

If your GPU has enough (but there is not much left) VRAM to barely load the model, it will start up with gpu(no fall back to cpu) somehow. But in that case, the VRAM usage overshoots when trying to generate the first sentence. Therefore, an error does not occur immediately after loading, and an error occurs in the first generation.

Expected behavior
An error message and a fall back to the CPU or emptying the GPU cache, something like that.

@Lyaaaaaaaaaaaaaaa Lyaaaaaaaaaaaaaaa added the bug Something isn't working label Sep 4, 2022
@Lyaaaaaaaaaaaaaaa Lyaaaaaaaaaaaaaaa self-assigned this Sep 4, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant