Pull upstream changes into GptNeox #270

LLukas22 · 2023-05-23T09:52:54Z

Fixes #246.

The tokenization behaviour has to be changed to make this work with every gpt-neox based model.
e.g. Redpajama doesn't like the added padding token at the start of a sequence.

philpax

Great work! Minor style nits but apart from that this is good to go from my end.

Regarding the BOS: does upstream have the same issue?

crates/models/gptneox/src/lib.rs

LLukas22 · 2023-05-23T18:40:05Z

Nope upstream is not affected by the BOS token schenanigans. Seams like they don't append anything at the beginning.

philpax · 2023-05-23T19:04:17Z

Maybe we should just set bos=false for GPT-NeoX? That feels like a weird hack, but I can't see anything in the HF config that controls this.

@Narsil sorry about summoning you here, but for a given HF model, how do we know if the BOS token should be added / what I presume is add_special_tokens here (we're switching to HF in #271) should be turned on? Does it have to be bounced up to the user?

Narsil · 2023-05-24T08:42:02Z

@philpax No worries for the summon, happy to help.

add_special_tokens should be True by default yes.

But I think also what you are looking for is post_process.

The post_processor in tokenizers is the method that takes a vec of encoded strings (usually a single or a pair of tokenized inputs, like question and context for question answering)
This merges them into a single sequence to be consumed by the model:

Here is an example
https://github.com/Narsil/smelte-rs/blob/main/examples/gpt2.rs#L261-L262

Unfortunately, some tokenizers are not necessarily correctly configured on the hub, because some special tokens are sent directly within the string itself (especially useful for when it's a multilingual model like Whisper or a translation model). So some knowledge might remain out of the tokenizer itself.

encode + post_process should be the go-to I think.

LLukas22 added 2 commits May 23, 2023 11:10

Sync with upstream implementation

0d2f8a1

clippy & fmt

62f21bb

philpax approved these changes May 23, 2023

View reviewed changes

crates/models/gptneox/src/lib.rs Outdated Show resolved Hide resolved

crates/models/gptneox/src/lib.rs Outdated Show resolved Hide resolved

crates/models/gptneox/src/lib.rs Outdated Show resolved Hide resolved

crates/models/gptneox/src/lib.rs Show resolved Hide resolved

philpax marked this pull request as ready for review May 23, 2023 19:22

LLukas22 mentioned this pull request May 24, 2023

Inference without ONNX / usage of WONNX as backend for LLMs webonnx/wonnx#169

Open

Review fixes

f053427

philpax merged commit 0d36ab7 into rustformers:main May 24, 2023

hhamud mentioned this pull request Aug 7, 2023

Write a 0.2 changelog #244

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull upstream changes into GptNeox #270

Pull upstream changes into GptNeox #270

LLukas22 commented May 23, 2023

philpax left a comment

LLukas22 commented May 23, 2023

philpax commented May 23, 2023

Narsil commented May 24, 2023

Pull upstream changes into GptNeox #270

Pull upstream changes into GptNeox #270

Conversation

LLukas22 commented May 23, 2023

philpax left a comment

Choose a reason for hiding this comment

LLukas22 commented May 23, 2023

philpax commented May 23, 2023

Narsil commented May 24, 2023