FIX: Generating with mixed adapter batches and with beam search enabled #2287

BenjaminBossan · 2024-12-17T12:40:43Z

Right now, using mixed adapter batches (introduced in #1558) with beam search generations does not work. This is because users need to pass the adapter names associated with each sample, i.e. the number of adapter names should be identical to the number of samples in the input.

When applying beam search, transformers internally repeats the samples once per beam (or so it looks like). Therefore, we have more samples during generation than samples in the input. Consequently, the adapter names have to be extended accordingly. This is now taken care of.

For encoder-decoder models, we need to be careful. I seems like only the decoder needs to be extended, whereas the encoder receives the original number of inputs. Therefore, when an encoder-decoder model is identified, the extension is only applied to the decoder part.

See huggingface#2283 Right now, using mixed adapter batches with beam search generations does not work. This is because users need to pass the adapter names associated with each sample, i.e. the number of adapter names should be identical to the number of samples in the input. When applying beam search, transformers internally repeats the samples once per beam (or so it looks like). Therefore, we have more samples during generation than samples in the input. Consequently, the adapter names have to be extended accordingly. This is now taken care of. Unfortunately, this does not work for encoder-decoder models yet. With these models, there is always a size mismatch, whether adapter names are extended or not. What I suspect is happening is that only the decoder needs to be extended, but right now I don't see a way to implement this distinction in PEFT. Therefore, encoder-decoder + beam search generations is not supported for the time being.

HuggingFaceDocBuilderDev · 2024-12-17T12:47:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan added 3 commits December 17, 2024 13:16

Add support for encoder-decoder models

6580e90

Add comment to test, remove unnecessary seeds

509f09e

BenjaminBossan mentioned this pull request Dec 17, 2024

TypeError when inference with different LoRA adapters in the same batch #2283

Open

4 tasks

Remove obsolete import

d6bd49b

BenjaminBossan requested a review from githubnemo December 17, 2024 16:51

Correct task type in test

eefbc15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: Generating with mixed adapter batches and with beam search enabled #2287

FIX: Generating with mixed adapter batches and with beam search enabled #2287

BenjaminBossan commented Dec 17, 2024

HuggingFaceDocBuilderDev commented Dec 17, 2024

FIX: Generating with mixed adapter batches and with beam search enabled #2287

Are you sure you want to change the base?

FIX: Generating with mixed adapter batches and with beam search enabled #2287

Conversation

BenjaminBossan commented Dec 17, 2024

HuggingFaceDocBuilderDev commented Dec 17, 2024