llama : reorganize source code + improve CMake #8006

ggerganov · 2024-06-19T11:26:49Z

Adopt a new structure of the source code that makes ggml source code and build scripts easier to reuse.

Note that the build options that are relevant to the ggml library are now prefixed with GGML_. For example, LLAMA_CUDA and WHISPER_METAL are now GGML_CUDA and GGML_METAL. However, WHISPER_COREML and WHISPER_OPENVINO are still named the same because the CoreML and OpenVINO functionality in whisper.cpp is not part of the ggml library.

The Makefiles in llama.cpp and whisper.cpp have been updated to be more similar with each other.

Header files (such as ggml.h, llama.h and whisper.h) are now placed in include subfolders, while the source files are placed in src.

PRs in other projects that will be updated and merged together with this one:

TODOs:

Changes:

deprecate LLAMA_XXX in favor of GGML_XXX

Resolved PRs:

TODO in follow-up PRs:

move relevant tests from tests to ggml/tests
avoid using CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS and build and link shared libraries properly
link shared libs to examples by default
simplify sync scripts to work on the ggml folder level

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggerganov · 2024-06-21T12:23:01Z

The llama.cpp reorganization should be now mostly ready and can be tested. The ggml code, tests and cmake scripts will be shared across all 3 repos (as if it is a git submodule).

Some CI workflows are still failing - any help with resolving these is appreciated. I'll now focus on updating whisper.cpp to reuse ggml in a similar way and then look to merge this sometime next week. Extra attention to the new build options (e.g. LLAMA_CUDA is now GGML_CUDA)

slaren · 2024-06-21T13:40:12Z

It works with -DBUILD_SHARED_LIBS=OFF, so I am probably misunderstanding the error.

slaren · 2024-06-21T14:19:37Z

I noticed now that CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS is set in ggml when building on windows with shared libs. I think that the problem actually is that CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS is not set early enough, because configuring with -DCMAKE_WINDOWS_EXPORT_ALL_SYMBOLS=ON also works, without requiring the changes that I listed before.

This seems to work. My assumption is that CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS needs to be set per project, unless passed through the command line, then it is applied to every subproject as well, but I am not sure if that's correct.

However, all of this seems like a hack, we go through the effort to use dllexport in ggml.h and llama.h with all the symbols that should be exported, but then we just throw all of this away and use CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS to export every symbol.

diff --git a/CMakeLists.txt b/CMakeLists.txt
index 6b9b5413..96718d75 100644
--- a/CMakeLists.txt
+++ b/CMakeLists.txt
@@ -6,6 +6,7 @@ include(CheckIncludeFileCXX)
 set(CMAKE_WARN_UNUSED_CLI YES)

 set(CMAKE_EXPORT_COMPILE_COMMANDS ON)
+set(CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS ON)

 if (NOT XCODE AND NOT MSVC AND NOT CMAKE_BUILD_TYPE)
     set(CMAKE_BUILD_TYPE Release CACHE STRING "Build type" FORCE)
diff --git a/common/CMakeLists.txt b/common/CMakeLists.txt
index c6fccc02..02415f2d 100644
--- a/common/CMakeLists.txt
+++ b/common/CMakeLists.txt
@@ -1,5 +1,7 @@
 # common

+set(CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS ON)
+
 find_package(Threads REQUIRED)

 # Build info header
diff --git a/ggml/CMakeLists.txt b/ggml/CMakeLists.txt
index bdbda425..21ef7e4a 100644
--- a/ggml/CMakeLists.txt
+++ b/ggml/CMakeLists.txt
@@ -3,6 +3,7 @@ project("ggml" C CXX)
 include(CheckIncludeFileCXX)

 set(CMAKE_EXPORT_COMPILE_COMMANDS ON)
+set(CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS ON)

 if (NOT XCODE AND NOT MSVC AND NOT CMAKE_BUILD_TYPE)
     set(CMAKE_BUILD_TYPE Release CACHE STRING "Build type" FORCE)
diff --git a/ggml/src/CMakeLists.txt b/ggml/src/CMakeLists.txt
index 84bc8e19..b7c79321 100644
--- a/ggml/src/CMakeLists.txt
+++ b/ggml/src/CMakeLists.txt
@@ -825,10 +825,6 @@ endif()

 if (WIN32)
     add_compile_definitions(_CRT_SECURE_NO_WARNINGS)
-
-    if (BUILD_SHARED_LIBS)
-        set(CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS ON)
-    endif()
 endif()

 if (GGML_LTO)

slaren · 2024-06-21T14:21:04Z

CMakeLists.txt

-        " to set correct LLAMA_BLAS_VENDOR")
+# override ggml options
+set(GGML_CCACHE             ${LLAMA_CCACHE})
+set(GGML_BUILD_SHARED_LIBS  ${LLAMA_BUILD_SHARED_LIBS})


GGML_BUILD_SHARED_LIBS and LLAMA_BUILD_SHARED_LIBS do not exist, BUILD_SHARED_LIBS is used directly.

mofosyne · 2024-06-21T14:43:35Z

if you are reorganizing source codes, I would like to suggest moving some compiled tools from the example folder into a dedicated folder. It's not quite a script, but in my opinion its not examples as well... considering it's more for maintainers internal usage

OuadiElfarouki · 2024-06-26T16:44:43Z

@slaren The LLAMA_CUDA_FORCE_CUBLAS cmake option got mistakenly removed but is still used. I believe it's intended to be mutually exclusive with GGML_CUDA_FORCE_MMQ so some changes might be needed (cmake, mmq.cu, ggml-cuda.cu ..)

ggerganov · 2024-06-26T17:16:50Z

There is now GGML_CUDA_FORCE_CUBLAS

Edit: nvm #8140

Update build recipes with ggerganov/llama.cpp#8006 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

PR ggerganov#8006 changes defaults to build shared libs. However, CI for release builds expects static builds.

PR ggerganov#8006 changes defaults to build shared libs. However, CI for releases expects static builds.

* CI: fix release build (Ubuntu) PR #8006 changes defaults to build shared libs. However, CI for releases expects static builds. * CI: fix release build (Mac) --------- Co-authored-by: loonerin <loonerin@users.noreply.github.com>

* arrow_up: Update ggerganov/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * deps(llama.cpp): update build variables to follow upstream Update build recipes with ggerganov/llama.cpp#8006 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable shared libs by default in llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable shared libs in llama.cpp Makefile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable metal embedding for now, until it is tested Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(mac): explicitly enable metal Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix typo Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>

* scripts : update sync [no ci] * files : relocate [no ci] * ci : disable kompute build [no ci] * cmake : fixes [no ci] * server : fix mingw build ggml-ci * cmake : minor [no ci] * cmake : link math library [no ci] * cmake : build normal ggml library (not object library) [no ci] * cmake : fix kompute build ggml-ci * make,cmake : fix LLAMA_CUDA + replace GGML_CDEF_PRIVATE ggml-ci * move public backend headers to the public include directory (ggerganov#8122) * move public backend headers to the public include directory * nix test * spm : fix metal header --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * scripts : fix sync paths [no ci] * scripts : sync ggml-blas.h [no ci] --------- Co-authored-by: slaren <slarengh@gmail.com>

* CI: fix release build (Ubuntu) PR ggerganov#8006 changes defaults to build shared libs. However, CI for releases expects static builds. * CI: fix release build (Mac) --------- Co-authored-by: loonerin <loonerin@users.noreply.github.com>

* scripts : update sync [no ci] * files : relocate [no ci] * ci : disable kompute build [no ci] * cmake : fixes [no ci] * server : fix mingw build ggml-ci * cmake : minor [no ci] * cmake : link math library [no ci] * cmake : build normal ggml library (not object library) [no ci] * cmake : fix kompute build ggml-ci * make,cmake : fix LLAMA_CUDA + replace GGML_CDEF_PRIVATE ggml-ci * move public backend headers to the public include directory (ggerganov#8122) * move public backend headers to the public include directory * nix test * spm : fix metal header --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * scripts : fix sync paths [no ci] * scripts : sync ggml-blas.h [no ci] --------- Co-authored-by: slaren <slarengh@gmail.com>

* CI: fix release build (Ubuntu) PR ggerganov#8006 changes defaults to build shared libs. However, CI for releases expects static builds. * CI: fix release build (Mac) --------- Co-authored-by: loonerin <loonerin@users.noreply.github.com>

* changes upstream in ggerganov/llama.cpp#8006 - ggerganov/llama.cpp@f3f6542

github-actions bot added build Compilation issues python python script changes labels Jun 19, 2024

ggerganov force-pushed the gg/reorganize-project branch from 729c7cc to cde2512 Compare June 19, 2024 14:53

ggerganov mentioned this pull request Jun 19, 2024

ggml : reorganize source code + improve CMake ggerganov/ggml#865

Merged

mofosyne added the Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level label Jun 19, 2024

ggerganov force-pushed the gg/reorganize-project branch 10 times, most recently from ace2b97 to 8216c4c Compare June 21, 2024 09:33

github-actions bot added the script Script related label Jun 21, 2024

ggerganov force-pushed the gg/reorganize-project branch from 8216c4c to 5b1490a Compare June 21, 2024 09:34

github-actions bot added the devops improvements to build systems and github actions label Jun 21, 2024

ggerganov force-pushed the gg/reorganize-project branch from c9a3dc8 to 73d5f90 Compare June 21, 2024 10:13

github-actions bot added documentation Improvements or additions to documentation nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment examples SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Jun 21, 2024

ggerganov force-pushed the gg/reorganize-project branch from e550e3e to 7c1c378 Compare June 21, 2024 12:24

This comment was marked as outdated.

Sign in to view

slaren reviewed Jun 21, 2024

View reviewed changes

slaren approved these changes Jun 26, 2024

View reviewed changes

ggerganov merged commit f3f6542 into master Jun 26, 2024
1 check passed

ggerganov deleted the gg/reorganize-project branch June 26, 2024 15:33

ggerganov mentioned this pull request Jun 26, 2024

devops : remove clblast + LLAMA_CUDA -> GGML_CUDA #8139

Merged

4 tasks

mudler added a commit to mudler/LocalAI that referenced this pull request Jun 27, 2024

deps(llama.cpp): update build variables to follow upstream

22aa238

Update build recipes with ggerganov/llama.cpp#8006 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler mentioned this pull request Jun 27, 2024

deps(llama.cpp): bump to latest, update build variables mudler/LocalAI#2669

Merged

mudler added a commit to mudler/LocalAI that referenced this pull request Jun 27, 2024

deps(llama.cpp): update build variables to follow upstream

d1fa886

Update build recipes with ggerganov/llama.cpp#8006 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler added a commit to mudler/LocalAI that referenced this pull request Jun 27, 2024

deps(llama.cpp): update build variables to follow upstream

05ac760

Update build recipes with ggerganov/llama.cpp#8006 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

lcarrere mentioned this pull request Jun 27, 2024

Bug: GGML can no longer be statically linked to llama.cpp due to the source code reorganization #8166

Closed

mudler added a commit to mudler/LocalAI that referenced this pull request Jun 27, 2024

deps(llama.cpp): update build variables to follow upstream

119e61d

Update build recipes with ggerganov/llama.cpp#8006 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

loonerin added a commit to loonerin/llama.cpp that referenced this pull request Jun 27, 2024

CI: fix release build (Ubuntu)

ac410cb

PR ggerganov#8006 changes defaults to build shared libs. However, CI for release builds expects static builds.

loonerin added a commit to loonerin/llama.cpp that referenced this pull request Jun 27, 2024

CI: fix release build (Ubuntu)

e040149

PR ggerganov#8006 changes defaults to build shared libs. However, CI for releases expects static builds.

This was referenced Jun 27, 2024

Bug: llama.cpp binaries are compiled dynamically and the library is missing! #8161

Closed

CI: fix release build (Ubuntu+Mac) #8170

Merged

tommyip mentioned this pull request Jun 29, 2024

Updated llama-cpp (bot) utilityai/llama-cpp-rs#369

Closed

luoyu-intel mentioned this pull request Jul 1, 2024

[SYCL] Fix win build conflict of math library #8230

Merged

4 tasks

ggerganov mentioned this pull request Jul 7, 2024

Arm AArch64: optimized GEMV and GEMM kernels for q4_0_q8_0, and q8_0_q8_0 quantization #5780

Merged

HanClinto mentioned this pull request Jul 8, 2024

Bug: ld: symbol(s) not found for architecture arm64 #8211

Closed

EZForever added a commit to EZForever/llama.cpp-static that referenced this pull request Jul 18, 2024

Fix static build instructions (see ggerganov/llama.cpp#8006)

fa4be76

brittlewis12 added a commit to brittlewis12/llama-cpp-rs that referenced this pull request Jul 25, 2024

Fix metal compilation on macos

f815079

* changes upstream in ggerganov/llama.cpp#8006 - ggerganov/llama.cpp@f3f6542

brittlewis12 added a commit to brittlewis12/llama-cpp-rs that referenced this pull request Jul 26, 2024

Fix metal compilation on macos

d3d6523

* changes upstream in ggerganov/llama.cpp#8006 - ggerganov/llama.cpp@f3f6542

jeroen-mostert mentioned this pull request Aug 15, 2024

Bug: OpenBLAS compile for Android doesn‘t work in Ubuntu 22.04 #9039

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : reorganize source code + improve CMake #8006

llama : reorganize source code + improve CMake #8006

ggerganov commented Jun 19, 2024 •

edited by LostRuins

Loading

ggerganov commented Jun 21, 2024

This comment was marked as outdated.

slaren commented Jun 21, 2024

slaren commented Jun 21, 2024 •

edited

Loading

slaren Jun 21, 2024 •

edited

Loading

mofosyne commented Jun 21, 2024

OuadiElfarouki commented Jun 26, 2024

ggerganov commented Jun 26, 2024 •

edited

Loading

llama : reorganize source code + improve CMake #8006

llama : reorganize source code + improve CMake #8006

Conversation

ggerganov commented Jun 19, 2024 • edited by LostRuins Loading

ggerganov commented Jun 21, 2024

This comment was marked as outdated.

slaren commented Jun 21, 2024

slaren commented Jun 21, 2024 • edited Loading

slaren Jun 21, 2024 • edited Loading

Choose a reason for hiding this comment

mofosyne commented Jun 21, 2024

OuadiElfarouki commented Jun 26, 2024

ggerganov commented Jun 26, 2024 • edited Loading

ggerganov commented Jun 19, 2024 •

edited by LostRuins

Loading

slaren commented Jun 21, 2024 •

edited

Loading

slaren Jun 21, 2024 •

edited

Loading

ggerganov commented Jun 26, 2024 •

edited

Loading