Support exporting text-generation models for sequence classification to ONNX #1527

dwyatte · 2023-11-10T18:14:56Z

Feature request

Exporting text-generation models (e.g., LlamaForSequenceClassification) was disabled in #1308

Given https://arxiv.org/abs/2310.01208, these models can outperform typical encoder models for sequence classification (I can confirm this on my own datasets)

What would it take to support this in transformers? CC @fxmarty since it was mentioned specifically in the PR above

Motivation

I would like to export fine-tuned sequence classification models that use a decoder-only model as their base architecture to ONNX

Your contribution

Happy to submit a PR to transformers with guidance on what would be needed

The text was updated successfully, but these errors were encountered:

dwyatte · 2023-11-11T14:41:07Z

Ah, I see what's happening -- ONNX doesn't support int64 inputs to argmax which is how these models are computing the sequence lengths for pooling. Will open a PR over in transformers and leave this open until we can enable export in this repo

fxmarty · 2023-11-14T08:43:29Z

Thank you for the fix!

dwyatte · 2023-12-19T17:45:55Z

Requires one more fix in transformers to get output validation passing with the current dummy inputs: huggingface/transformers#28144

dwyatte mentioned this issue Nov 11, 2023

Support ONNX export for causal LM sequence classifiers huggingface/transformers#27450

Merged

5 tasks

dwyatte mentioned this issue Dec 19, 2023

Fix ONNX export for causal LM sequence classifiers by removing reverse indexing huggingface/transformers#28144

Merged

5 tasks

dwyatte mentioned this issue Feb 2, 2024

re-enable decoder sequence classification #1679

Merged

3 tasks

fxmarty closed this as completed in #1679 Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support exporting text-generation models for sequence classification to ONNX #1527

Support exporting text-generation models for sequence classification to ONNX #1527

dwyatte commented Nov 10, 2023 •

edited

Loading

dwyatte commented Nov 11, 2023

fxmarty commented Nov 14, 2023

dwyatte commented Dec 19, 2023

Support exporting text-generation models for sequence classification to ONNX #1527

Support exporting text-generation models for sequence classification to ONNX #1527

Comments

dwyatte commented Nov 10, 2023 • edited Loading

Feature request

Motivation

Your contribution

dwyatte commented Nov 11, 2023

fxmarty commented Nov 14, 2023

dwyatte commented Dec 19, 2023

dwyatte commented Nov 10, 2023 •

edited

Loading