Struggling to get LLMs to follow your instructions? LogitsProcessorZoo offers a zoo of tools to use LLMs for specific tasks, beyond just grammar enforcement!
pip install logits-processor-zoo
- transformers
- vLLM
- TensorRT-LLM
import vllm
from logits_processor_zoo.vllm import GenLengthLogitsProcessor, CiteFromPromptLogitsProcessor, ForceLastPhraseLogitsProcessor
model = vllm.LLM(
model_name,
trust_remote_code=True,
dtype="half",
enforce_eager=True
)
tokenizer = model.get_tokenizer()
logits_processors = [
CiteFromPromptLogitsProcessor(tokenizer, boost_factor=2.0),
GenLengthLogitsProcessor(tokenizer, boost_factor=-0.2, p=1),
ForceLastPhraseLogitsProcessor("\n\nReferences:\n", tokenizer)
]
gen_output = model.generate(
prompts,
vllm.SamplingParams(
n=1,
temperature=0,
seed=0,
skip_special_tokens=True,
max_tokens=64,
logits_processors=logits_processors
),
use_tqdm=False
)
For the detailed examples in each framework, please have a look at example_notebook directory.
A logits processor that adjusts the likelihood of the end-of-sequence (EOS) token based on the length of the generated sequence, encouraging or discouraging shorter answers.
A logits processor which boosts or diminishes the likelihood of tokens present in the prompt (and optionally EOS token) to encourage the model to generate tokens similar to those seen in the prompt or vice versa.
A logits processor which forces LLMs to use the given phrase before they finalize their answers. Most common use cases can be providing references, thanking user with context etc.
A logits processor to answer multiple choice questions with one of the choices. A multiple choice question is like:
I am getting a lot of calls during the day. What is more important for me to consider when I buy a new phone?
0. Camera
1. Screen resolution
2. Operating System
3. Battery
The goal is to make LLM generate "3" as an answer.