Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch to llama.cpp for gguf conversion #1625

Merged
merged 3 commits into from
Feb 8, 2025
Merged

Conversation

prabhu
Copy link
Collaborator

@prabhu prabhu commented Feb 8, 2025

Switched to llama.cpp master for gguf conversion and Q8_0 quantization. Works on my machine.

❯ ollama create cdx1-gguf -f Modelfile
gathering model components
copying file sha256:39ef2abdaad48bdbd1a8466c85b14a9646caf8049ec62e2b8d6383c56e035dc9 100%
parsing GGUF
using existing layer sha256:39ef2abdaad48bdbd1a8466c85b14a9646caf8049ec62e2b8d6383c56e035dc9
using autodetected template chatml
using existing layer sha256:a9f47bf267b28cae24d9d2b7e1d9204045da3b72568e4dcf5f66968e9d1fdf18
using existing layer sha256:de1eea7f14a061696a1490b000e7c36b45bb768c07a236acc828c69580770887
creating new layer sha256:6ee123ab45fc93d2d4223e606e0866d49b943b3826853a5db46fed97cdc50001
writing manifest
success
❯ ollama show cdx1-gguf
  Model
    architecture        llama
    parameters          14.7B
    context length      16384
    embedding length    5120
    quantization        Q8_0

  Parameters
    num_ctx        16384
    stop           "<|im_start|>"
    stop           "<|im_end|>"
    temperature    0.05
    top_k          10
    top_p          0.5

  System
    You are cdxgen, a CycloneDX and an xBOM expert.

  License
    apache-2.0
❯ ollama run cdx1-gguf
>>> tell me about cdxgen
cdxgen is a command-line tool that generates BOMs in various formats including JSON, XML, YAML, and CycloneDX. It supports multiple programming languages and frameworks.

>>>

Signed-off-by: Prabhu Subramanian <prabhu@appthreat.com>
Signed-off-by: Prabhu Subramanian <prabhu@appthreat.com>
@prabhu
Copy link
Collaborator Author

prabhu commented Feb 8, 2025

  • Detailed docs on using the model with ollama

Signed-off-by: Prabhu Subramanian <prabhu@appthreat.com>
@prabhu prabhu marked this pull request as ready for review February 8, 2025 11:12
@prabhu prabhu merged commit a00579e into master Feb 8, 2025
33 checks passed
@prabhu prabhu deleted the feature/cdx1-gguf-part2 branch February 8, 2025 11:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant