You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Environment, CPU architecture, OS, and Version:
Linux 9a4562508d46 5.15.0-88-generic #98-Ubuntu SMP Mon Oct 2 15:18:56 UTC 2023 x86_64 GNU/Linux
The build is executed in a Docker container based on golang:1.21-bookworm from https://hub.docker.com/_/golang
Describe the bug
All CMAKE_ARGS from the environment are overwritten in the Makefile target backend/cpp/llama/grpc-server:
We have to use CMAKE_ARGS="-DLLAMA_NATIVE=OFF" to fix the Error: inlining failed in call to ‘always_inline’ ‘_mm256_cvtph_ps’ as described in: ggml-org/llama.cpp#107
The text was updated successfully, but these errors were encountered:
LocalAI version:
https://github.com/mudler/LocalAI/tree/763f94ca80827981d0b5e5e41ee6a21fec5f5f67
Environment, CPU architecture, OS, and Version:
Linux 9a4562508d46 5.15.0-88-generic #98-Ubuntu SMP Mon Oct 2 15:18:56 UTC 2023 x86_64 GNU/Linux
The build is executed in a Docker container based on
golang:1.21-bookworm
from https://hub.docker.com/_/golangDescribe the bug
All
CMAKE_ARGS
from the environment are overwritten in the Makefile targetbackend/cpp/llama/grpc-server
:LocalAI/Makefile
Line 420 in 763f94c
To Reproduce
Build the following
Dockerfile
:Expected behavior
It should pass the
CMAKE_ARGS="-DLLAMA_NATIVE=OFF"
to the build context of the grpc-server.Logs
Additional context
As a quick workaround you can add the missing CMAKE_ARGS like:
This also fixes #1196
We have to use
CMAKE_ARGS="-DLLAMA_NATIVE=OFF"
to fix theError: inlining failed in call to ‘always_inline’ ‘_mm256_cvtph_ps’
as described in: ggml-org/llama.cpp#107The text was updated successfully, but these errors were encountered: