b3306 #208

Nexesenex · 2024-07-04T20:58:04Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

* ppl : fix n_seq_max for perplexity * use 1 seq for kl_divergence

ggml-ci

* llama : suppress unref var in Windows MSVC This commit suppresses two warnings that are currently generated for src/llama.cpp when building on Windows MSVC ```console C:\llama.cpp\src\llama.cpp(14349,45): warning C4101: 'ex': unreferenced local variable [C:\llama.cpp\build\src\llama.vcxproj] C:\llama.cpp\src\llama.cpp(19285,44): warning C4101: 'e': unreferenced local variable [C:\llama.cpp\build\src\llama.vcxproj] ``` * Update src/llama.cpp --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit adds the compile definition `_CRT_SECURE_NO_WARNINGS` to the root cmake subproject. The motivation for this is that currently the following warnings are displayed when compiling the tests and common cmake subprojects: ```console test-llama-grammar.cpp C:\llama.cpp\src\.\llama.cpp(1406,77): warning C4996: 'strerror': This function or variable may be unsafe. Consider using strerror_s instead. To disable deprecation, use _CRT_SECURE_NO_WARNINGS. See online help for details. [C:\llama.cpp\build\tests\test-llama-grammar.vcxproj] ... ``` This compile definition is currently set for the `src` subproject and this change moves into the root cmake project so that it is applied to all cmake subprojects.

* llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Not namespaced though :(

This commit adds a new option to the tokenize example, --show-count. When this is set the total number of tokens are printed to stdout. This was added as an option as I was concerned that there might be scripts that use the output from this program and it might be better to not print this information by default. The motivation for this is that can be useful to find out how many tokens a file contains, for example when trying to determine prompt input file sizes for testing. Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com>

* Initial OpenELM support (270M only so far) * Fill out missing entries in llama_model_type_name * fixup! Initial OpenELM support (270M only so far) Fix formatting * llama : support all OpenELM models * llama : add variable GQA and variable FFN sizes Some metadata keys can now also be arrays to support setting their value per-layer for models like OpenELM. * llama : minor spacing changes Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * llama : use std::array for per-layer hparams * llama : fix save/load state * llama : do not print hparams for vocab-only models * llama : handle n_head == 0 * llama : use const ref for print_f and fix division by zero * llama : fix t5 uses of n_head and n_ff * llama : minor comment --------- Co-authored-by: Francis Couture-Harpin <git@compilade.net> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* main: add need_insert_eot * do not format system prompt if it is empty

slaren and others added 18 commits July 3, 2024 20:33

ppl : fix n_seq_max for perplexity (#8277)

5f2d4e6

* ppl : fix n_seq_max for perplexity * use 1 seq for kl_divergence

Define and optimize RDNA1 (#8085)

d23287f

[SYCL] Remove unneeded semicolons (#8280)

f619024

convert : fix gemma v1 tokenizer convert (#8248)

20fc380

ggml-ci

build(python): Package scripts with pip-0517 compliance

b0a4699

fix: Actually include scripts in build

b1c3f26

Not namespaced though :(

fix: Update script paths in CI scripts

8219229

chore: ignore all __pychache__

de14e2e

chore: Fixup requirements and build

07786a6

chore: Remove rebase artifacts

01a5f06

doc: Add context for why we add an explicit pytorch source

1e92001

build: Export hf-to-gguf as snakecase

51d2eba

cli: add EOT when user hit Ctrl+C (#8296)

a38b884

* main: add need_insert_eot * do not format system prompt if it is empty

Nexesenex merged commit 21c8ba4 into Nexesenex:spacestream Jul 4, 2024
27 checks passed

github-actions bot added Nvidia GPU examples python devops build script labels Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

b3306 #208

b3306 #208

Nexesenex commented Jul 4, 2024

b3306 #208

b3306 #208

Conversation

Nexesenex commented Jul 4, 2024