Fix memory issue with llama.cpp LLM pipeline #824

davidmezzetti · 2024-11-30T15:35:49Z

The current behavior of the llama.cpp LLM pipeline is to always set n_ctx=0. When n_ctx=0, the context size defaults to n_ctx_train which can be very large with some models.

This change will fallback to the default n_ctx when n_ctx=0 fails due to being out of memory. It will also allow n_ctx as a input parameter. If a manually set n_ctx is too large, this will fail since it's user-specified.

The text was updated successfully, but these errors were encountered:

davidmezzetti added the bug Something isn't working label Nov 30, 2024

davidmezzetti added this to the v8.1.0 milestone Nov 30, 2024

davidmezzetti self-assigned this Nov 30, 2024

davidmezzetti closed this as completed in cb22635 Nov 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory issue with llama.cpp LLM pipeline #824

Fix memory issue with llama.cpp LLM pipeline #824

davidmezzetti commented Nov 30, 2024

Fix memory issue with llama.cpp LLM pipeline #824

Fix memory issue with llama.cpp LLM pipeline #824

Comments

davidmezzetti commented Nov 30, 2024